Gene Gdia_3322 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_3322 
Symbol 
ID6976762 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp3631832 
End bp3633310 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content76% 
IMG OID643392833 
Productcarbohydrate kinase, YjeF related protein 
Protein accessionYP_002277664 
Protein GI209545435 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.0650972 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGGTA CCTGTACGTC GCTTCTTCTT CCCGATCCCA CGCAGATGGG GCTGATCGAC 
CGCGCCGCCA GCCGCACCGT GCCGGTGCGC GACCTGATGG AAAACGCGGG CCGGGCCGTG
GCGCGGGCCG TGCTGCGGCA TGTCAGGCCG TGCCGGGTGC TGGTCCTATG CGGCCCGGGG
AATAACGGCG GCGACGGCTA TGTCGCCGCC CGCCGGCTGG TCCAGGCCGG CTGGCCGGTC
GCGGTCGCCG CCCTGGCGCC GCCCCGTCCG GGCGGGGACG CGGCCGCCGC CGCCGCCGAC
TGGCGCGGCC CCCGCGTACC CTTCACGGCG GAAGAGGCCG CGCGGGCCGA CCTCGTGGTC
GATGCGGTGT TCGGCGCCGG CCTCAGCCGC GATATCGATG CCGGCCTGGC ATCGGTGCTG
GCGGCGGCCC GCCGCGTGGT CGCGGTGGAC ATGCCCAGCG GCATCGACGG CGGCACCGGC
GCGGTCCGGG GCTATGCCCC GCACGCCGAA ATGACGGTCA CGTTCGTGCG CGCCAAGCCG
GGCCATCTGC TGCTGCCGGG GCGCGAGCGG CTGGGGCGGC TGGTGGTGGC GGATATCGGC
ATGCCCGACA CGGTCTGGGA CGCGGTGGAC ATCCGCACCT GGCGGAACGA GCCAGGGCTG
TGGCGGGTGC CGGGCGACGC GGTGGACAGT CACAAATACG CGCGCGGCGT CGTCAGCATC
TGCGGCGGCG CCACGATGCC GGGCGCCGCC CGCCTGTCCG CCGCCGGGGC CCGGGCGGCG
GGCGCGGGGC TGGTGCGTCT GGCGGCGGGG GCGGCAGCGG ATCTGTACCG GCTGGGCGAA
CCCGGGCTGG TGGTCGATGG CGGCGACCTG ACCGACCTGC TGGCCGACCG GCGCCGCGCG
GTCTGGGTCT GCGGCCCCGG CCTGACGGTG GACGAGGTCG ATCATACATT GCCGATCCTG
GTCGGGGCGG GGCGCACGGT GCTGGCCGAT GCCGGGGCCT TTGCCGTGGC GGCGGACCAG
CCGGACCGGC TGCGGGGGGT GGCGGTGATC ACCCCGCATG CCGGGGAATT CGCCCGCGTG
TTCGGCCATC CCGGCGACGA CAGGCCGGCG GCCGCGCGCG CCGCCGCCGC GCGAACCGGC
GCGGTGGTGG TGCTGAAGGG CCCCGATACC GTGGTCGCCG CGCCGGACGG GCGGGTCGCG
ATCAATGCCC ACGCGTCCTC GGCGCTGGCC ACCGCCGGGT CGGGCGACAC GTTGACCGGG
GTGATCGCGG CATTGCTGGC GGCGGGGATG GAGCCGTGGC CGGCCGCCTG CGCCGGGGTG
TGGATGCATG GCGAAGCCGG GATGCGGGCC GGCCCCTGGC CGCTGGCCGA ACAGTTCGAC
CAGCATCTGG GCGCCGCGCG GGCGCGGGCG GTGGAACTGG GCGCGCGGCG CCCGGCCTAT
CGGCATTGTC GGCTTGCCGG ATCGGGTCTC GTGCGCTAA
 
Protein sequence
MDGTCTSLLL PDPTQMGLID RAASRTVPVR DLMENAGRAV ARAVLRHVRP CRVLVLCGPG 
NNGGDGYVAA RRLVQAGWPV AVAALAPPRP GGDAAAAAAD WRGPRVPFTA EEAARADLVV
DAVFGAGLSR DIDAGLASVL AAARRVVAVD MPSGIDGGTG AVRGYAPHAE MTVTFVRAKP
GHLLLPGRER LGRLVVADIG MPDTVWDAVD IRTWRNEPGL WRVPGDAVDS HKYARGVVSI
CGGATMPGAA RLSAAGARAA GAGLVRLAAG AAADLYRLGE PGLVVDGGDL TDLLADRRRA
VWVCGPGLTV DEVDHTLPIL VGAGRTVLAD AGAFAVAADQ PDRLRGVAVI TPHAGEFARV
FGHPGDDRPA AARAAAARTG AVVVLKGPDT VVAAPDGRVA INAHASSALA TAGSGDTLTG
VIAALLAAGM EPWPAACAGV WMHGEAGMRA GPWPLAEQFD QHLGAARARA VELGARRPAY
RHCRLAGSGL VR