Gene Gdia_1094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1094 
Symbol 
ID6974497 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp1229127 
End bp1230104 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content69% 
IMG OID643390622 
ProductADP-L-glycero-D-manno-heptose-6-epimerase 
Protein accessionYP_002275492 
Protein GI209543263 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID[TIGR02197] ADP-L-glycero-D-manno-heptose-6-epimerase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0181056 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCATCA TCACCGGCGG GGCCGGCTTC ATCGGATCGT GCCTGCAGGC GGCGCTTCAG 
GCGCGGGGCG AGCAGACGGT CGTCGTCGAC TGGCTGGGCA GCCAGGGCAA GTGGCGCAAC
ATCGCGCGCC ACCCGCCCGA CCACCTGCTG ACGCCCGAGG CACTGGACGA TTTCCTGGCC
GGCCGGCCCG CCGTGTCCGC CATCCTGCAT ATGGGCGCGA TCAGCGAGAC GACGGCCTGT
GACGGCGACC TGGCCTGGCG CACCAATGTC GACCTGTCGG CGCGGCTGTG GGCGTGGTGC
GCGCGGCATG GCGTGCGCTT CATCTACGCG TCCTCGGCCG CGACCTATGG CGCGGCCGGG
GACGAATCGT TTTCCGACGA TCCCGCGGGG CTGGAGGCAT TGCGGCCGCT GAACCTGTAC
GGCTGGTCGA AGCATGTGTT CGACCGGCAG GTCGTGGCCG GCCTGGCCCG CGGCGCGTCG
TCGCCGCCGC AATGGGCGGG ACTGAAATTC TTCAACGTCT ATGGCCCGAA CGAATATCAC
AAGGGCCCGA TGGTCTCGGT CGTGAAGGTC AAGTACGACG AGGTCCGCCG GGGCCAGCCG
GCGCGGCTGT TCCGCTCGGA CGTTCCCGGC CTGGCCGATG GGGCGCAGGC GCGGGATTTC
ATCTGGGTCG GCGACGTGGT GGACGTGACG CTGTGGCTGC TGGACAGCCC GCATGTCAGC
GGCCTGTTCA ATTGCGGCAC CGGGGTCGCG CGCAGCTACC TGGACCTGGC CCATGCGGTC
TGCGACGCCG CCGGCCGGCC GCGCCAGGTC GAATTCGTCG ACATGCCTGA CGCGCTGCGC
GGCCATTACC AGTCCTATAC CCGCGCCGAC ATGACGCGGC TGCGCCAGGC GGGATATGCC
CGGCCCTTCA CGTCGCTGGA AGACGGCATC CGTCGCTACG TCCAGGATTA CCTGGCCACC
GACGACGCCT ACCTGTAA
 
Protein sequence
MIIITGGAGF IGSCLQAALQ ARGEQTVVVD WLGSQGKWRN IARHPPDHLL TPEALDDFLA 
GRPAVSAILH MGAISETTAC DGDLAWRTNV DLSARLWAWC ARHGVRFIYA SSAATYGAAG
DESFSDDPAG LEALRPLNLY GWSKHVFDRQ VVAGLARGAS SPPQWAGLKF FNVYGPNEYH
KGPMVSVVKV KYDEVRRGQP ARLFRSDVPG LADGAQARDF IWVGDVVDVT LWLLDSPHVS
GLFNCGTGVA RSYLDLAHAV CDAAGRPRQV EFVDMPDALR GHYQSYTRAD MTRLRQAGYA
RPFTSLEDGI RRYVQDYLAT DDAYL