Gene Gdia_2040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_2040 
Symbol 
ID6975467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2259502 
End bp2260494 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content73% 
IMG OID643391570 
Product4-hydroxythreonine-4-phosphate dehydrogenase 
Protein accessionYP_002276415 
Protein GI209544186 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1995] Pyridoxal phosphate biosynthesis protein 
TIGRFAM ID[TIGR00557] 4-hydroxythreonine-4-phosphate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGGCG GGGCGCGGAC GCTGCTGGCC CTGACGATGG GCGATCCGGC CGGGATCGGG 
CCGGAAATCA CCACTGCCGC CTGGCGTGCC CTGCGCGCGG GCGGCGGGCC GGCCTTCGTC
GTGCCCGGCG ATGCCGCCCT GCTGGCGGCC CATGCCCCGG TCCGGATCGT GGCGGACGTG
GCGCAGGCAG TGGCGGCCTT CACCGATGCG ATCCCGGTGC TGGCGGTCGA TCTGCCCGGC
CCGGTGCAAT CGGGGCGGCC GGACCCCGCC AACGTGCCCG CCATCACCGG CAGCATCGAC
TGCGCCGTGG CGCTGGCGCT GGCCGGGCAG GTGGACGGTG TTGTCACCAA TCCGATCAGC
AAGCTGGTGC TGAAGCAGGC CGGCTTCCGC CATCCGGGCC ATACCGAATA CCTGGCCGAA
CTCTGCGCCG TGCCGGGGCA GGAGGTCATG ATGCTGGCCT GCCCCGAATT GCGGGTGGTG
CCGGTGACCA TCCACATGGC GCTGCGCCGG GCGCTGGACA GCCTGACGAC GGCGGAAATC
GTGCGCTGCG GCCGCACCGC CGCCGCTGCC CTGCGCCGCG ATTTCGGGAT CGCGGCGCCG
CGCGTGGCCG TCGCCGGCCT GAACCCGCAT GCCGGCGAGG GCGGGGTGAT GGGGGACGAG
GAACGGACGA TCATCGCCCC CGCGCTGGAC GCATTGCGCG CCGACGGGAT CGCGGTCAGC
GGCCCCTGGC CGCCGGATAC GATGTTCACC CCCCTGGCCC GCGCCCGATA CGATGTGGCG
CTGTGCATGT ATCACGACCA GGCGCTGATC CCGCTGAAGA CGATCGACAT GGCGGGCGGG
GTCAACGTCA CGCTGGGCCT GCCCATCATC CGCACCTCGC CCGACCACGG CACCGCCTTC
GACATCGCGG GACAGGGCCT GGCCGACCCG TCCAGCCTGC TGGCCGCCCT GCGCCTGGCC
GACGAAATGA CCCGCAACCA GAGGGCCGCA TGA
 
Protein sequence
MEGGARTLLA LTMGDPAGIG PEITTAAWRA LRAGGGPAFV VPGDAALLAA HAPVRIVADV 
AQAVAAFTDA IPVLAVDLPG PVQSGRPDPA NVPAITGSID CAVALALAGQ VDGVVTNPIS
KLVLKQAGFR HPGHTEYLAE LCAVPGQEVM MLACPELRVV PVTIHMALRR ALDSLTTAEI
VRCGRTAAAA LRRDFGIAAP RVAVAGLNPH AGEGGVMGDE ERTIIAPALD ALRADGIAVS
GPWPPDTMFT PLARARYDVA LCMYHDQALI PLKTIDMAGG VNVTLGLPII RTSPDHGTAF
DIAGQGLADP SSLLAALRLA DEMTRNQRAA