Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1643 |
Symbol | |
ID | 8136974 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1912767 |
End bp | 1913741 |
Gene Length | 975 bp |
Protein Length | 324 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644869256 |
Product | glycosyl transferase family 2 |
Protein accession | YP_003021456 |
Protein GI | 253700267 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 196 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCCGGG TCAGCATCAT AGTCGTCAAC TGGAACGGCT GGCGCGATAC CCTGGAATGC CTGGAGAGCC TGGAGAAGCT GGCCTATCCG GATTTCGGCA TCGTCCTATG CGATAACGGC TCAACCGACG AATCGCTGGA GCGGATCAGG GAGTGGGCCC GCAGGCATCA GGTCTCTTTC GCAGAGTTGG ATCGGGCCGA GCTGGAGGCG GGAGGGGGGT TGCGGGAGGT TAAGCTTGCC TTGTTGCGGG TTGGCGAGAA TCTCGGCTTT GCCGGAGGGA ACAACGTCGG CTTGAGCTAT GCGCTGGCGC AGGATGTGAG TTTTTGTTGG CTGCTAAACA ACGATACGGT GGTTGAACCC GACGCCCTGA CCGAGCTTGT GGCCCGGATG GAACAGGACC AGGGGATAGG CATCTGCGGC TCCAGTATCC TTTACTACCA CGACCGGGAC CGGATCCAGG CGCTGGGTGG GGGATATTAC CATCCATGGC TCGGTCTTCC CTTGCATTAC GGGCGTTTTA CCCCTTGGCG CATGAGCGGC AGAGTGCAAG GTATGGCGGC GTCCAAGATG AATTACGTCG AGGGTGCCTC CATGCTGGTA TCGCGCAGGT TCCTGCTTGA GATAGGAGTT CTCAGCGAGG AGTATTTCCT GTACTTCGAG GAAGCGGATT GGGCCATGCG CGCGCGCGGA CGCTTCTCCT TGGGGTATGC GGCGCGCAGT GTCGTCTACC ACAAGGTTGG CGGGAGCATC GGCACCAGCA GCAACCCAGG CAGAAAGAGC CTCGTCTGCG ACTTCTACAA CATCCGTAAC CGGATCAGGT TTGCCAGGCG CTACCACCCC TTGACCCTCC CCACCGTTTA TCTTGTGCTA TTGGGGGAGA TCGTGCTGCG CGCCCTGTAC GGCAGATGGG ATCGGGTCGC GATGATCCTG GGACTCATGA TGAAGGGCGC CAGCGGACCG GAGCTACGTC CGTGA
|
Protein sequence | MSRVSIIVVN WNGWRDTLEC LESLEKLAYP DFGIVLCDNG STDESLERIR EWARRHQVSF AELDRAELEA GGGLREVKLA LLRVGENLGF AGGNNVGLSY ALAQDVSFCW LLNNDTVVEP DALTELVARM EQDQGIGICG SSILYYHDRD RIQALGGGYY HPWLGLPLHY GRFTPWRMSG RVQGMAASKM NYVEGASMLV SRRFLLEIGV LSEEYFLYFE EADWAMRARG RFSLGYAARS VVYHKVGGSI GTSSNPGRKS LVCDFYNIRN RIRFARRYHP LTLPTVYLVL LGEIVLRALY GRWDRVAMIL GLMMKGASGP ELRP
|
| |