Gene EcSMS35_3354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3354 
SymbolddtA 
ID6146886 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3431981 
End bp3432892 
Gene Length912 bp 
Protein Length303 aa 
Translation table11 
GC content54% 
IMG OID641618183 
Producttartrate dehydratase subunit alpha 
Protein accessionYP_001745333 
Protein GI170683099 
COG category[C] Energy production and conversion 
COG ID[COG1951] Tartrate dehydratase alpha subunit/Fumarate hydratase class I, N-terminal domain 
TIGRFAM ID[TIGR00722] hydro-lyases, Fe-S type, tartrate/fumarate subfamily, alpha region 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.374954 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAGCG AAAGTAATAA GCAACAGGCA GTGAATAAGT TGACGGAGAT TGTCGCTAAC 
TTTACCGCCA TGATTTCTAC CCGAATGCCC GATGACGTGG TGGACAAACT AAAACAGCTA
AAGGATGCCG AAACATCGTC GATGGGGAAA ATCATCTACC ACACGATGTT CGATAACATG
CAAAAAGCGA TCGACCTGAA TCGTCCTGCC TGTCAGGACA CCGGCGAAAT CATGTTTTTT
GTTAAGGTCG GTTCCCGTTT CCCACTGCTT GGCGAGCTGC AAAGCATACT CAAACAAGCC
GTGGAAGAGG CAACCGTCAA AGCGCCACTG CGTCACAATG CGGTAGAAAT TTTTGACGAA
GTAAACACCG GCAAAAATAC CGGCAGCGGT GTACCGTGGG TCACCTGGGA CATCATCCCC
GACAATGACG ATGCGGAAAT CGAAGTTTAC ATGGCAGGCG GCGGCTGCAC GCTACCAGGC
CGCTCGAAAG TGTTAATGCC GTCAGAAGGC TACGAAGGCG TAGTGAAATT CGTCTTCGAA
AATATCTCCA CCCTCGCCGT AAACGCCTGT CCGCCGGTAC TTGTGGGCGT TGGCATCGCT
ACCTCGGTGG AAACCGCCGC CGTGCTCTCG CGTAAAGCCA TTTTGCGCCC GATTGGCAGC
CGCCACCCCA ATCCGAAAGC GGCAGAGCTG GAGCTACGCC TGGAAGAAGG ACTCAACCGT
CTGGGGATTG GTCCACAAGG GCTGACTGGC AACAGTTCAG TGATGGGCGT ACATATCGAA
TCTGCCGCCC GCCATCCGTC AACCATCGGC GTTGCTGTTT CTACAGGCTG CTGGGCGCAT
CGTCGCGGCA CACTGCTGGT TCATGCCGAT CTCACCTTTG AAAATCTGTC TCACACCCGG
AGCGCGTTAT GA
 
Protein sequence
MMSESNKQQA VNKLTEIVAN FTAMISTRMP DDVVDKLKQL KDAETSSMGK IIYHTMFDNM 
QKAIDLNRPA CQDTGEIMFF VKVGSRFPLL GELQSILKQA VEEATVKAPL RHNAVEIFDE
VNTGKNTGSG VPWVTWDIIP DNDDAEIEVY MAGGGCTLPG RSKVLMPSEG YEGVVKFVFE
NISTLAVNAC PPVLVGVGIA TSVETAAVLS RKAILRPIGS RHPNPKAAEL ELRLEEGLNR
LGIGPQGLTG NSSVMGVHIE SAARHPSTIG VAVSTGCWAH RRGTLLVHAD LTFENLSHTR
SAL