Gene ECH74115_4373 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4373 
SymbolddtA 
ID6966744 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4049076 
End bp4049987 
Gene Length912 bp 
Protein Length303 aa 
Translation table11 
GC content54% 
IMG OID643388096 
Producttartrate dehydratase subunit alpha 
Protein accessionYP_002272534 
Protein GI209396443 
COG category[C] Energy production and conversion 
COG ID[COG1951] Tartrate dehydratase alpha subunit/Fumarate hydratase class I, N-terminal domain 
TIGRFAM ID[TIGR00722] hydro-lyases, Fe-S type, tartrate/fumarate subfamily, alpha region 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0282958 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones80 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAGCG AAAGTAATAA GCAACAGGCA GTGAATAAGT TGACGGAGAT TGTCGCTAAC 
TTTACCGCCA TGATTTCTAC CCGAATGCCT GATGACGTGG TGGATAAACT AAAACAGCTA
AAGGATGCCG AAACGTCGTC GATGGGGAAA ATTATCTACC ATACGATGTT CGACAACATG
CAAAAAGCGA TTGACCTGAA TCGTCCTGCC TGTCAGGACA CCGGGGAGAT TATGTTCTTC
GTTAAAGTCG GTTCCCGCTT CCCACTGCTT GGCGAGCTGC AAAGCATACT CAAACAAGCC
GTGGAAGAGG CGACCATCAA AGCGCCGCTG CGTCACAATG CGGTAGAAAT TTTTGACGAA
GTAAACACCG GCAAAAATAC CGGTAGCGGC GTACCGTGGG TCACCTGGGA TATCGTCCCC
GACGGTGACG ATGCGGAAAT CGAAGTTTAC ATGGCAGGCG GCGGCTGCAC GCTACCAGGC
CGCTCGAAAG TGTTAATGCC GTCAGAAGGC TACGAAGGCG TGGTGAAATT CGTCTTCGAA
AATATCTCCA CCCTCGCAGT AAACGCCTGT CCACCGGTAC TGGTGGGCGT GGGCATCGCC
ACCTCGGTGG AAACCGCCGC CGTACTCTCG CGTAAAGCCA TTTTGCGCCC GATTGGCAGC
CGCCACCCCA ATCCAAAAGC GGCAGAGCTG GAGCTACGCC TGGAAGAAGG ACTCAACCGT
CTGGGGATTG GTCCACAAGG GCTAACTGGC AACAGTTCAG TGATGGGCGT GCATATCGAA
TCTGCCGCCC GCCATCCGTC AACCATCGGC GTTGCTGTTT CTACCGGTTG CTGGGCGCAT
CGTCGCGGCA CACTGCTGGT TCATGCCGAT CTCACCTTTG AAAATCTGTC TCACACCCGG
AGCGCGTTAT GA
 
Protein sequence
MMSESNKQQA VNKLTEIVAN FTAMISTRMP DDVVDKLKQL KDAETSSMGK IIYHTMFDNM 
QKAIDLNRPA CQDTGEIMFF VKVGSRFPLL GELQSILKQA VEEATIKAPL RHNAVEIFDE
VNTGKNTGSG VPWVTWDIVP DGDDAEIEVY MAGGGCTLPG RSKVLMPSEG YEGVVKFVFE
NISTLAVNAC PPVLVGVGIA TSVETAAVLS RKAILRPIGS RHPNPKAAEL ELRLEEGLNR
LGIGPQGLTG NSSVMGVHIE SAARHPSTIG VAVSTGCWAH RRGTLLVHAD LTFENLSHTR
SAL