Gene EcolC_0638 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0638 
Symbol 
ID6066378 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp688534 
End bp689445 
Gene Length912 bp 
Protein Length303 aa 
Translation table11 
GC content54% 
IMG OID641600045 
Producttartrate dehydratase subunit alpha 
Protein accessionYP_001723641 
Protein GI170018687 
COG category[C] Energy production and conversion 
COG ID[COG1951] Tartrate dehydratase alpha subunit/Fumarate hydratase class I, N-terminal domain 
TIGRFAM ID[TIGR00722] hydro-lyases, Fe-S type, tartrate/fumarate subfamily, alpha region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000169351 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAGCG AAAGTAATAA GCAACAGGCA GTGAATAAGT TGACAGAGAT TGTCGCTAAC 
TTTACCGCCA TGATTTCTAC CCGAATGCCT GATGACGTGG TGGATAAACT AAAACAGCTA
AAGGATGCCG AAACGTCGTC GATGGGGAAA ATTATCTACC ATACGATGTT CGACAACATG
CAAAAAGCGA TTGACCTGAA TCGTCCTGCC TGTCAGGACA CCGGGGAGAT TATGTTCTTC
GTTAAAGTCG GTTCCCGCTT CCCACTGCTT GGCGAGCTGC AAAGCATACT CAAACAAGCC
GTGGAAGAGG CAACCGTCAA AGCGCCACTA CGTCACAATG CGGTAGAAAT TTTTGACGAA
GTAAACACCG GCAAAAATAC CGGTAGCGGC GTACCGTGGG TCACCTGGGA CATCATCCCC
GACAATGACG ATGCGGAAAT CGAAGTTTAC ATGGCAGGCG GCGGCTGCAC GCTACCTGGC
CGCTCGAAAG TGTTAATGCC GTCAGAAGGC TACGAAGGCG TGGTGAAATT CGTCTTCGAA
AATATCTCCA CCCTCGCCGT AAACGCCTGT CCACCGGTAC TGGTGGGCGT GGGCATCGCC
ACCTCGGTGG AAACCGCCGC CGTACTCTCG CGTAAAGCCA TTTTGCGCCC GATTGGCAGC
CGCCATCCCA ATCCAAAAGC GGCAGAACTG GAGCTACGCC TGGAAGAAGG ACTCAACCGT
CTGGGGATTG GTCCACAAGG GCTGACCGGC AACAGTTCAG TGATGGGCGT ACATATCGAA
TCTGCCGCCC GCCATCCGTC AACCATCGGC GTTGCTGTCT CTACCGGCTG CTGGGCGCAT
CGTCGCGGCA CGCTGCTGGT TCATGCCGAT CTCACCTTTG AAAATCTGTC TCACACCCGG
AGCGCGTTAT GA
 
Protein sequence
MMSESNKQQA VNKLTEIVAN FTAMISTRMP DDVVDKLKQL KDAETSSMGK IIYHTMFDNM 
QKAIDLNRPA CQDTGEIMFF VKVGSRFPLL GELQSILKQA VEEATVKAPL RHNAVEIFDE
VNTGKNTGSG VPWVTWDIIP DNDDAEIEVY MAGGGCTLPG RSKVLMPSEG YEGVVKFVFE
NISTLAVNAC PPVLVGVGIA TSVETAAVLS RKAILRPIGS RHPNPKAAEL ELRLEEGLNR
LGIGPQGLTG NSSVMGVHIE SAARHPSTIG VAVSTGCWAH RRGTLLVHAD LTFENLSHTR
SAL