Gene EcolC_3603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3603 
SymbolpdxA 
ID6065044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3944774 
End bp3945763 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content56% 
IMG OID641603021 
Product4-hydroxythreonine-4-phosphate dehydrogenase 
Protein accessionYP_001726544 
Protein GI170021590 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1995] Pyridoxal phosphate biosynthesis protein 
TIGRFAM ID[TIGR00557] 4-hydroxythreonine-4-phosphate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.179437 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0001268 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTTAAAA CCCAACGTGT TGTGATCACT CCCGGCGAGC CCGCCGGGAT TGGCCCGGAC 
TTAGTTGTCC AGCTTGCACA GCGTGAGTGG CCGGTCGAAC TGGTTGTTTG TGCCGATGCC
ACTCTCCTTA CCGACCGGGC AGCGATGCTC GGTTTGCCGC TCACCCTCCG CCCTTATTCC
CCCAACTCCC CTGCACAACC GCAAACTACG GGCACATTAA CGCTACTTCC TGTCGCGCTA
CGTGAATCTG TCACTGCGGG GCAGTTAGCG ATTGAAAATG GACATTACGT GGTGGAGACG
CTGGCGCGAG CGTGTGATGG CTGTCTGAAC GGTGAATTTG CTGCGCTGAT CACAGGCCCC
GTGCATAAAG GCGTCATTAA CGACGCAGGC ATTCCGTTTA CCGGTCATAC CGAGTTTTTC
GAAGAGCGTT CGCAGGCGAA AAAAGTGGTG ATGATGCTGG CGACCGAAGA ACTTCGCGTG
GCGCTGGCAA CGACGCATTT ACCGCTGCGC GATATCGCAG ATGCTATCAC CCCTGCACTT
TTGCACGAAG TGATTGCTAT TTTGCATCAC GATTTGCGTA CAAAATTTGG TATTGCCGAA
CCGCGCATTC TGGTCTGCGG GCTGAATCCG CACGCGGGCG AAGGCGGTCA TATGGGTACG
GAAGAGATAG ACACCATTAT TCCGGTGCTC GACGAGCTGC GGGCGCAGGG GATGAAACTC
AACGGGCCGC TGCCTGCCGA TACCCTGTTT CAGCCGAAAT ATCTCGATAA CGCCGACGCC
GTGCTGGCGA TGTACCACGA TCAGGGTCTT CCCGTGCTAA AATACCAGGG CTTCGGGCGC
GGTGTGAACA TTACGCTGGG CCTGCCCTTT ATTCGCACAT CAGTGGACCA CGGCACCGCG
CTTGAACTGG CGGGACGTGG CGAAGCCGAT GTCGGCAGTT TTATTACGGC GCTTAATCTC
GCCATCAAAA TGATTGTTAA CACCCAATGA
 
Protein sequence
MVKTQRVVIT PGEPAGIGPD LVVQLAQREW PVELVVCADA TLLTDRAAML GLPLTLRPYS 
PNSPAQPQTT GTLTLLPVAL RESVTAGQLA IENGHYVVET LARACDGCLN GEFAALITGP
VHKGVINDAG IPFTGHTEFF EERSQAKKVV MMLATEELRV ALATTHLPLR DIADAITPAL
LHEVIAILHH DLRTKFGIAE PRILVCGLNP HAGEGGHMGT EEIDTIIPVL DELRAQGMKL
NGPLPADTLF QPKYLDNADA VLAMYHDQGL PVLKYQGFGR GVNITLGLPF IRTSVDHGTA
LELAGRGEAD VGSFITALNL AIKMIVNTQ