Gene GM21_0884 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0884 
Symbol 
ID8136205 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1053594 
End bp1054583 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content65% 
IMG OID644868500 
Producthopanoid-associated sugar epimerase 
Protein accessionYP_003020709 
Protein GI253699520 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID[TIGR03466] hopanoid-associated sugar epimerase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.0000000000222195 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAAGGCAT TCGTCACAGG CGCCACCGGG TTCATCGGCG CGAGCATCGT GCGCGAACTG 
TTGAAGGACG GCTGGGAGGT ACGGGTCTTG GCCCGGCCCG GCTCGGACCG TCGCAACCTC
TCCGGGCTCG ACATAGAGAT CAGGGAAGGG GACCTGAGCG ACCGGGAGGC GCTGGTGCAG
GCGCTCAGCG GCTGCCGGGC GCTGTTCCAC GCCGCCGCCG ATTATCGGCT CTGGACACCC
ACGCCGGAGG CCATGTACGA TGTCAACGTC AAAGGGACCC GGGCGATACT GTCGGCGGCT
CTCGCGGCGG GCATCGAGAA GGTGGTCTAC ACAAGCAGCG TCGGGACCCT GGGGAACCCC
GGCGACGGCA CCCCCGGAGA CGAGAGCACA CCGGTGGACT TCCGCCACAT GGTGGGGGAC
TACAAGAAGA GCAAGTTCCT CGCCGAGCGG GCGGCGGAGT CGTTCCTGGC AAAGGGGTTG
CCGCTCGTGA TCGTGAACCC GTCGACCCCG GTGGGCCCGA TGGATGTGAA GCCTACGCCG
ACGGGAAAGA TCATCGTCGA CTTCCTGAAC GGCCGGATGC CCGCCTACCT GGACACGGGG
CTGAACCTGA TAGACGTGGA GGCTTGCGCG CGGGGGCATG TCCTGGCGGC GCGCAAGGGG
CGGGTCGGGG AAAAGTACAT CCTTGGGAAC CGCAACCTGA CCCTGGCCGA GATATTCGAG
ATGCTGTCCG GCATCACCGG GCTCAAGGCG CCGCGGGTGA AGCTCCCCTA CTATCCGATA
CTTATGGCCG CATACGTGAA CCATGCGCTG TCGGCCGTGA CAGGGAAAGA GCCGCTGATA
CCGCTTGCCG GCGTGCAGAT GGCGGCGAAG TTCATGTATT TCGATGCGGG GAAGGCGGTG
AGCGAGTTGG GGTTGCCGCT CTCCCCCGTG GAAGGCGCGC TGGATCGCGC CGTACAGTGG
TTCCGCAGCA ACGGCTACGT TAACCGATAA
 
Protein sequence
MKAFVTGATG FIGASIVREL LKDGWEVRVL ARPGSDRRNL SGLDIEIREG DLSDREALVQ 
ALSGCRALFH AAADYRLWTP TPEAMYDVNV KGTRAILSAA LAAGIEKVVY TSSVGTLGNP
GDGTPGDEST PVDFRHMVGD YKKSKFLAER AAESFLAKGL PLVIVNPSTP VGPMDVKPTP
TGKIIVDFLN GRMPAYLDTG LNLIDVEACA RGHVLAARKG RVGEKYILGN RNLTLAEIFE
MLSGITGLKA PRVKLPYYPI LMAAYVNHAL SAVTGKEPLI PLAGVQMAAK FMYFDAGKAV
SELGLPLSPV EGALDRAVQW FRSNGYVNR