Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2598 |
Symbol | |
ID | 8137940 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 3029866 |
End bp | 3031122 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 644870205 |
Product | glycosyl transferase group 1 |
Protein accession | YP_003022395 |
Protein GI | 253701206 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 154 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTAACA ATAAAACTGT AATAGCACTT TTAGGTAACA GTTTCATCGG TTGGGGCGGC GGCATAGATT TTTTGCGATT CTGCGCTAAT GCTTTGGCGC TTATCTGCAA GGGCAATAAC ACAAGAATAG TTATTTTGTT GCCAGACCCA GAAAATTGCA CCTTAATTAT TAAGACACGA GCCTTTTTGT CTGCTTGCAA ACAAATAGCA ATTGCGATTC TTGAGCGTAG AAAACCTATT TCCCGTCGAC ATAAGCCATT CTTCAAAAAA CAACTCACAG ATTCATTCCA AAATATAGAG GGAAATGTAG AAATATTATT CTACCCGCAA GGCAAAAATA TTGCGTCTGT AGTTGTCAGT ATACAAGCTG ATGTCGTTAT ACCCTGTGCT TTCTCACTTG GTTCGTCATT TCCTGTGCCA TGGGTTGGTT ATTTGTATGA TTTTCAGCAT AAGTATTTTC CAGATTATTT CTCAGATAAA GAAATAAACA CGCGTGATGC TCTGTTTTCT CAGATGCTTG GAGAAGCAAG TGCTGTTATC GTAAATGCAG CTGATGTTAA AAAAGATATT CAGAAGTTTT ATCCTCAAAC AAAATGTAAA GTGTTTGACC TGCCTTTTTC CGCAACACCA ATTGAATCCT GGTTTGAACC CGCCTCGGAA GATCTTTCAC AAAAATACGA CCTTCCCAGA ATATACTTTG TAATGTGCAA TCAGTTTTGG ATTCACAAGG ACCATGCAAC TGCTTTCAAG GCACTTGCTA TATATATGGA AGCGACAGGT CAACAGGATG TTCATATTGT GTGTACAGGT AGCACGGTTG ACTTTAGGCA TCCCGACTAT TTTTCCAATC TGAAAAATTA TGTTAATACA CTTGGACTAA CTGACAGAGT GCATTTTCTA GGTCATATTC CCAAAAAAGA TCAGATAGAC ATTATGTGCG GCTCGATTGC AGTTCTTCAG CCAACACTTT TTGAAGGTGG CCCCGGTGGT TTTGCTGTTT TTGATGCTAT TTCACTGGCA ATACCAGTGA TTTTGTCTGA TATCCCAGTA AATAGAGAGA TTGAAGGATA TAACGGTCTA CTATTTTTTA AGGCTGGCGA TGCGGATGAT ATGGCAGCAA AGATGATTGC CATTCAAAAC TTCACTCATG TTAAGCAAGG TAAAGAATTA TTGTTAACCA CCGGTAGAGA GAGAACAAAA ACATTCGGCC TGCGATTACT TGAAGCGGCC GAGTATGCCA TGAATCAACA AAACTAG
|
Protein sequence | MINNKTVIAL LGNSFIGWGG GIDFLRFCAN ALALICKGNN TRIVILLPDP ENCTLIIKTR AFLSACKQIA IAILERRKPI SRRHKPFFKK QLTDSFQNIE GNVEILFYPQ GKNIASVVVS IQADVVIPCA FSLGSSFPVP WVGYLYDFQH KYFPDYFSDK EINTRDALFS QMLGEASAVI VNAADVKKDI QKFYPQTKCK VFDLPFSATP IESWFEPASE DLSQKYDLPR IYFVMCNQFW IHKDHATAFK ALAIYMEATG QQDVHIVCTG STVDFRHPDY FSNLKNYVNT LGLTDRVHFL GHIPKKDQID IMCGSIAVLQ PTLFEGGPGG FAVFDAISLA IPVILSDIPV NREIEGYNGL LFFKAGDADD MAAKMIAIQN FTHVKQGKEL LLTTGRERTK TFGLRLLEAA EYAMNQQN
|
| |