Gene Gdia_1830 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1830 
Symbol 
ID6975252 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2032976 
End bp2034067 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content69% 
IMG OID643391355 
Producthopanoid-associated sugar epimerase 
Protein accessionYP_002276205 
Protein GI209543976 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID[TIGR03466] hopanoid-associated sugar epimerase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGCCGC ATCGTTTCGC CGCCAACCGG CCGCCCCATG GGGGTTTCCC CCGGGACGTC 
CGCCTGCTGA CATATACCGG AGCATCCGAT CGGATCATGA CTGCCCCCAC GCTTGTTACC
GGCGCGACCG GTTTTGTCGG TTCGGCCGTT GCCCGTACGC TTCTCCAGCG GGGGCATTCG
CTGCGCCTGA TGGCGCGCAA GGGGGCGGAC CTGACCAATA TCCGCGACCT GCCGGCGGAA
CTGGTCGAAG GCGACCTGTC CGCGCCCGCC ACCTTCGCCG ACGCGGTGCG GGGGTGTCGC
TACGTCTTCC ATGTCGCCGC CGACTATCGG CTGTGGGTGC CCGACCCCGC GCCCATGATG
ACCGCGAATG TCGAGGGAAC GCGCCGCCTG ATGCTGGCGG CGCAGGACGC GGGGGTGGAA
CGGATCGTCT ATTGCTCGTC GGTCGCGGCG CTGGGGCTGA TCGGCGACGG CACCGTGTCG
GACGAGGACA CGCCGGTTCA CGAGCACGCG GTGATCGGGA TCTACAAGCG GTCCAAATAC
CGGGCGGAGC AGGAGGTCCT GCGCCTGGTC CGCGAACGCG GCCTGCCGGC GGTGATCGTC
AACCCGTCCA CCCCCGTGGG CCCGCGCGAC ATCAAGCCGA CGCCGACGGG CCAGATGATC
CTGGATTGCG CGGCGGGGCG CATGCCGGCC TATGTCGATA CCGGGGTGAA CATCGTCCAT
GTCGACGACG TGGCCGAGGG CCACGTCCTG GCGCTGGAAC GCGGCCGGGC GGGTGAGAAA
TACATCCTGG GCGGCCAGAA TTTCCTGCTG CGCGACCTGT TCGCCATGAC GGCGGACATC
GCGGGCGTGC GGCCGCCGCG CGTCAGCCTG CCGCAATCGG TGATCTGGCC GGTGGCGGTG
GTGTCGGAAT GGCTGTCGCG CGGCTTCGGC ATCGCCCCGC GCGTCACGCG CGAGATGCTG
GCCATGTCGC ACAAGAAGAT GTTCTTTTCC TCGGCCAAGG CCGAACGGGA GCTGGGCTAT
GCCCCGCGCC CGGCGCGCGA CGCGGTGGCG GATGCCGTGG CCTGGTTCCG CCAGAACGGC
ATGCTGGGCT AG
 
Protein sequence
MVPHRFAANR PPHGGFPRDV RLLTYTGASD RIMTAPTLVT GATGFVGSAV ARTLLQRGHS 
LRLMARKGAD LTNIRDLPAE LVEGDLSAPA TFADAVRGCR YVFHVAADYR LWVPDPAPMM
TANVEGTRRL MLAAQDAGVE RIVYCSSVAA LGLIGDGTVS DEDTPVHEHA VIGIYKRSKY
RAEQEVLRLV RERGLPAVIV NPSTPVGPRD IKPTPTGQMI LDCAAGRMPA YVDTGVNIVH
VDDVAEGHVL ALERGRAGEK YILGGQNFLL RDLFAMTADI AGVRPPRVSL PQSVIWPVAV
VSEWLSRGFG IAPRVTREML AMSHKKMFFS SAKAERELGY APRPARDAVA DAVAWFRQNG
MLG