Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3508 |
Symbol | |
ID | 8138880 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 4049211 |
End bp | 4050350 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 644871127 |
Product | glycosyl transferase group 1 |
Protein accession | YP_003023287 |
Protein GI | 253702098 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1.8743499999999998e-20 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGTGCTTC ACAGGCTTTT GAGCGGCTTG GCGTCTACCA GCTACTGTCT TATCTCAAGT GGCCGAAAAC ACGATGCCGG GGATGCCGCC TGTGAGCGCT TGGATGCACC GTACTTCTAT CTGCCAAAGG TGAGGCAGTT GCCGCCGGTA GCGATCCCCG GCCTTTCTGC CCTGTTCGTG TGTATCAACC TGTTGTGGGT GGTGTTGAGG CGATCCCGGC AGATAGAAGA TATGGCGAGG CGCGAAGGAT GCGAGGCTAT CGTGGCGTGC ACAGGCGATT TTTATGATCT CCCTGCAGCG TTTCTGGCCT GCAGGCGCAT GAAGATACCA TTCGTTCCCT ACATCTTCGA CGACTACGGT TACCAGTGGC TCGGGTTCAG GCGCAGTATC GCCAAGCGGT TGGAGCGTGT CCTGCTCTCC TTTGCAGCGG CGGTCATTGT ACCGAACGAA TACCTGCAGA GAGAGTACGC AACCAGACAC GGCATCGACA GCACTGTCAT CCATAATCCC TGCTCTTTGC CGGACCTGGA GCGCCTGGAC CAGGGGCCGA AGATGTTCGG GGAGGGGGTG AACATCGTCT ATACGGGGTC GATTTACCAC GCCCATTACG ATGCCTTCGC AAACCTCATC GCTGCTCTCA GGCTTTTGGG CAGGCCGGAG GTGAAACTGC ACCTGTTCAC CGCACAGTCT GAAAGAGAGT TGGCTGGACA GGGGATAGGT GGGCCGCAGG TTGTCCACCA CCCTCACGTT CCACAGCGCG AGGTGGAGCG TATCCTGCGG CAGGCGGACC TGCTTTTTCT CCCGTTGGCG TTTCGTTCCC CGATACCCGA GGTGATAAGA ACTTCGGCGC CAGGCAAAAT GGGTGAATAT CTCGCTGTGG GGCGTCCTGT GCTCGTTCAT GCCCTGCCCG ATTCCTTCAT CGCCTGGTAC TTCCGGGCGA ATGGATGCGG GATAGTCGTA GATCAGCATG ATGCAGGAGT TCTGTCACAG GCAATCGAGG CACTGTTATC GGATCCGCAG GCGCTGACAG ATATGGGACT TAAAGCAAGA AAGAGAGCTC AAGTTGATTT CGACGTTACC GTCGTCCGTT CGCAATTCCT TGCTTTACTA AAACGGATCG GCTTTAGGTG TGGTGCATGA
|
Protein sequence | MVLHRLLSGL ASTSYCLISS GRKHDAGDAA CERLDAPYFY LPKVRQLPPV AIPGLSALFV CINLLWVVLR RSRQIEDMAR REGCEAIVAC TGDFYDLPAA FLACRRMKIP FVPYIFDDYG YQWLGFRRSI AKRLERVLLS FAAAVIVPNE YLQREYATRH GIDSTVIHNP CSLPDLERLD QGPKMFGEGV NIVYTGSIYH AHYDAFANLI AALRLLGRPE VKLHLFTAQS ERELAGQGIG GPQVVHHPHV PQREVERILR QADLLFLPLA FRSPIPEVIR TSAPGKMGEY LAVGRPVLVH ALPDSFIAWY FRANGCGIVV DQHDAGVLSQ AIEALLSDPQ ALTDMGLKAR KRAQVDFDVT VVRSQFLALL KRIGFRCGA
|
| |