Gene GM21_2598 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2598 
Symbol 
ID8137940 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3029866 
End bp3031122 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content39% 
IMG OID644870205 
Productglycosyl transferase group 1 
Protein accessionYP_003022395 
Protein GI253701206 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones154 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAACA ATAAAACTGT AATAGCACTT TTAGGTAACA GTTTCATCGG TTGGGGCGGC 
GGCATAGATT TTTTGCGATT CTGCGCTAAT GCTTTGGCGC TTATCTGCAA GGGCAATAAC
ACAAGAATAG TTATTTTGTT GCCAGACCCA GAAAATTGCA CCTTAATTAT TAAGACACGA
GCCTTTTTGT CTGCTTGCAA ACAAATAGCA ATTGCGATTC TTGAGCGTAG AAAACCTATT
TCCCGTCGAC ATAAGCCATT CTTCAAAAAA CAACTCACAG ATTCATTCCA AAATATAGAG
GGAAATGTAG AAATATTATT CTACCCGCAA GGCAAAAATA TTGCGTCTGT AGTTGTCAGT
ATACAAGCTG ATGTCGTTAT ACCCTGTGCT TTCTCACTTG GTTCGTCATT TCCTGTGCCA
TGGGTTGGTT ATTTGTATGA TTTTCAGCAT AAGTATTTTC CAGATTATTT CTCAGATAAA
GAAATAAACA CGCGTGATGC TCTGTTTTCT CAGATGCTTG GAGAAGCAAG TGCTGTTATC
GTAAATGCAG CTGATGTTAA AAAAGATATT CAGAAGTTTT ATCCTCAAAC AAAATGTAAA
GTGTTTGACC TGCCTTTTTC CGCAACACCA ATTGAATCCT GGTTTGAACC CGCCTCGGAA
GATCTTTCAC AAAAATACGA CCTTCCCAGA ATATACTTTG TAATGTGCAA TCAGTTTTGG
ATTCACAAGG ACCATGCAAC TGCTTTCAAG GCACTTGCTA TATATATGGA AGCGACAGGT
CAACAGGATG TTCATATTGT GTGTACAGGT AGCACGGTTG ACTTTAGGCA TCCCGACTAT
TTTTCCAATC TGAAAAATTA TGTTAATACA CTTGGACTAA CTGACAGAGT GCATTTTCTA
GGTCATATTC CCAAAAAAGA TCAGATAGAC ATTATGTGCG GCTCGATTGC AGTTCTTCAG
CCAACACTTT TTGAAGGTGG CCCCGGTGGT TTTGCTGTTT TTGATGCTAT TTCACTGGCA
ATACCAGTGA TTTTGTCTGA TATCCCAGTA AATAGAGAGA TTGAAGGATA TAACGGTCTA
CTATTTTTTA AGGCTGGCGA TGCGGATGAT ATGGCAGCAA AGATGATTGC CATTCAAAAC
TTCACTCATG TTAAGCAAGG TAAAGAATTA TTGTTAACCA CCGGTAGAGA GAGAACAAAA
ACATTCGGCC TGCGATTACT TGAAGCGGCC GAGTATGCCA TGAATCAACA AAACTAG
 
Protein sequence
MINNKTVIAL LGNSFIGWGG GIDFLRFCAN ALALICKGNN TRIVILLPDP ENCTLIIKTR 
AFLSACKQIA IAILERRKPI SRRHKPFFKK QLTDSFQNIE GNVEILFYPQ GKNIASVVVS
IQADVVIPCA FSLGSSFPVP WVGYLYDFQH KYFPDYFSDK EINTRDALFS QMLGEASAVI
VNAADVKKDI QKFYPQTKCK VFDLPFSATP IESWFEPASE DLSQKYDLPR IYFVMCNQFW
IHKDHATAFK ALAIYMEATG QQDVHIVCTG STVDFRHPDY FSNLKNYVNT LGLTDRVHFL
GHIPKKDQID IMCGSIAVLQ PTLFEGGPGG FAVFDAISLA IPVILSDIPV NREIEGYNGL
LFFKAGDADD MAAKMIAIQN FTHVKQGKEL LLTTGRERTK TFGLRLLEAA EYAMNQQN