Gene EcSMS35_2476 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2476 
SymbolpdxB 
ID6143872 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2523580 
End bp2524716 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content53% 
IMG OID641617349 
Producterythronate-4-phosphate dehydrogenase 
Protein accessionYP_001744521 
Protein GI170681474 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0111] Phosphoglycerate dehydrogenase and related dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAAATCC TTGTTGATGA AAATATGCCT TATGCCCGCG ACTTATTTAG CCGTTTGGGT 
GAGGTGACCG CGGTTCCCGG GCGTCCAATT CCCGTCGCTC AACTGGCCGA CGCGGATGCG
CTGATGGTGC GTTCGGTCAC GAAAGTGAAT GAATCTTTGC TGGCAGGAAA ACCCATTAAA
TTTGTTGGCA CTGCCACAGC GGGGACCGAC CATGTCGATG AAGCGTGGTT AAAGCAGGCG
GGAATTGGTT TTTCCGCTGC ACCTGGCTGT AATGCGATTG CGGTGGTGGA ATATGTTTTC
TCCTCCCTGC TGATGCTTGC CGAACGCGAT GGATTTTCAC TGCACGAGCG TACCGTGGGG
ATCGTGGGCG TTGGTAACGT TGGTCGCCGT TTACAGGCAC GACTGGAAGC GTTAGGGATT
AAAACCTTAC TTTGCGATCC GCCTCGCGCC GACCGTGGAG ATGAGGGTGA TTTCCGCTCG
CTGGATGAAT TAGTTCAGCA TGCGGATATT CTGACTTTCC ATACGCCACT CTTTAAAGAC
GGGCCGTACA AAACGCTGCA TCTGGCGGAT GAAAAACTGA TCCGCAGCCT GAAACCAGGA
GCGATTCTGA TTAACGCCTG CCGTGGCGCA GTTGTCGATA ATACCGCCCT GCTGACCTGC
CTGAACGAAG GCCAGAAGTT AAGCGTAGTG CTGGATGTCT GGGAAGGCGA ACCGGAACTT
AACGTGGAAT TGCTGAAAAA AGTGGATATC GGCACGCCGC ATATCGCAGG CTATACCCTG
GAAGGTAAAG CACGCGGTAC TACGCAAGTG TTTGAAGCTT ATAGCAAGTT TATTGGGCAT
GAACAGCACG TTGCGCTGGA TACATTACTG CCCGCGCCAG AGTTTGGTCG CATTACGCTG
CATGGCCCGC TCGATCAGCC AACGCTGAAA AGGCTGGTGC ATTTGGTGTA TGATGTGCGC
CGCGATGACG CGCCACTGCG TAAAGTCGCC GGGATACCGG GTGAGTTCGA TAAGCTGCGC
AAAAACTATC TTGAGCGCCG TGAATGGTCA TCTCTGTATG TAATTTGTGA TGACGCCAGT
GCGGCATCAT TGCTGTGTAA ACTGGGTTTT AACGCCGTCC ATCATCCGGC ACGTTAA
 
Protein sequence
MKILVDENMP YARDLFSRLG EVTAVPGRPI PVAQLADADA LMVRSVTKVN ESLLAGKPIK 
FVGTATAGTD HVDEAWLKQA GIGFSAAPGC NAIAVVEYVF SSLLMLAERD GFSLHERTVG
IVGVGNVGRR LQARLEALGI KTLLCDPPRA DRGDEGDFRS LDELVQHADI LTFHTPLFKD
GPYKTLHLAD EKLIRSLKPG AILINACRGA VVDNTALLTC LNEGQKLSVV LDVWEGEPEL
NVELLKKVDI GTPHIAGYTL EGKARGTTQV FEAYSKFIGH EQHVALDTLL PAPEFGRITL
HGPLDQPTLK RLVHLVYDVR RDDAPLRKVA GIPGEFDKLR KNYLERREWS SLYVICDDAS
AASLLCKLGF NAVHHPAR