Gene EcolC_2018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2018 
Symbol 
ID6068000 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2228115 
End bp2229761 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content51% 
IMG OID641601430 
Producttartrate/fumarate subfamily Fe-S type hydro-lyase beta subunit 
Protein accessionYP_001724989 
Protein GI170020035 
COG category[C] Energy production and conversion 
COG ID[COG1838] Tartrate dehydratase beta subunit/Fumarate hydratase class I, C-terminal domain
[COG1951] Tartrate dehydratase alpha subunit/Fumarate hydratase class I, N-terminal domain 
TIGRFAM ID[TIGR00722] hydro-lyases, Fe-S type, tartrate/fumarate subfamily, alpha region
[TIGR00723] hydro-lyases, Fe-S type, tartrate/fumarate subfamily, beta region 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAAACA AACCCTTTCA TTATCAGGCT CCTTTTCCAC TCAAAAAAGA TGATACTGAG 
TATTACCTGC TAACCAGCGA ACACGTTAGC GTATCTGAAT TTGAAGGGCA GGAGATTTTG
AAAGTCGCAC CCGAAGCGTT AACTCTGTTG GCGCGTCAGG CGTTTCATGA TGCGTCGTTT
ATGCTGCGTC CGGCTCACCA ACAACAGGTG GCCGACATTC TGCGTGACCC GGAGGCCAGC
GAAAATGATA AATATGTGGC GCTGCAATTC CTGCGTAACT CCGACATCGC GGCGAAAGGC
GTTCTGCCAA CCTGTCAGGA TACCGGCACT GCGATTATTG TTGGTAAAAA AGGGCAGCGT
GTATGGACCG GTGGCTGTGA TGAAGCGGCG CTGGCGCGCG GTGTCTATAA CACTTATATC
GAAGATAATT TGCGCTACTC GCAAAACGCG CCGCTGGATA TGTATAAAGA GGTGAATACC
GGCACCAATC TGCCAGCGCA GATCGATCTT TATGCCGTTG ATGGCGACGA GTACAAATTC
CTCTGTATCG CCAAAGGTGG TGGTTCGGCA AACAAGACGT ATCTCTATCA GGAAACCAAA
GCGTTACTAA CGCCGGGGAA ACTGAAAAAT TACCTGGTTG AGAAGATGCG CACGCTGGGT
ACGGCGGCCT GTCCTCCGTA TCATATTGCG TTCGTAATTG GTGGAACTTC TGCAGAAACT
AACCTTAAAA CAGTGAAACT GGCTTCCGCT AAATACTATG ACGAACTGCC AACGGAAGGG
AATGAGCACG GTCAGGCATT CCGCGATGTG GAACTGGAAA AAGAATTGCT GATCGAAGCG
CAAAATCTCG GTCTGGGTGC GCAGTTTGGT GGTAAATACT TCGCTCACGA CATACGCGTG
ATTCGCCTGC CACGTCACGG CGCATCCTGT CCGGTTGGTA TGGGTGTTTC CTGCTCTGCT
GACCGTAATA TCAAAGCGAA GATCAACCGT CAGGGGATCT GGATCGAAAA ACTGGAACAT
AATCCAGGTA AATATATCCC TGAAGAGCTG CGTAAAGCGG GAGAAGGCGA AGCGGTGCGC
GTTGACCTTA ACCGTCCGAT GAAAGAGATC CTCGCACAGT TGTCGCAGTA TCCCGTTTCT
ACACGCTTAT CGCTTAACGG CACGATTATC GTCGGTCGTG ATATTGCTCA CGCCAAACTG
AAAGAGCGGA TGGATAACGG TGAAGGGCTG CCGCAGTACA TTAAAGATCA TCCGATTTAC
TACGCGGGTC CGGCCAAAAC GCCGGAAGGT TATGCCTCCG GTTCTCTTGG CCCAACGACC
GCCGGACGGA TGGATTCTTA TGTCGATCAA CTGCAAGCGC AGGGCGGAAG TATGATCATG
CTGGCGAAAG GCAACCGCAG CCAGCAGGTG ACGGATGCCT GTAAAAAACA CGGCGGCTTC
TACCTTGGCA GTATCGGTGG TCCGGCCGCT GTATTGGCGC AGGGAAGTAT TAAGAGCCTG
GAATGTGTTG AATATCCGGA ACTGGGAATG GAAGCCATCT GGAAAATTGA AGTGGAAGAT
TTCCCGGCGT TTATCCTTGT GGATGATAAA GGAAATGACT TCTTCCAGCA GATACAACTC
ACACAGTGCA CTCGCTGTGT GAAATAA
 
Protein sequence
MSNKPFHYQA PFPLKKDDTE YYLLTSEHVS VSEFEGQEIL KVAPEALTLL ARQAFHDASF 
MLRPAHQQQV ADILRDPEAS ENDKYVALQF LRNSDIAAKG VLPTCQDTGT AIIVGKKGQR
VWTGGCDEAA LARGVYNTYI EDNLRYSQNA PLDMYKEVNT GTNLPAQIDL YAVDGDEYKF
LCIAKGGGSA NKTYLYQETK ALLTPGKLKN YLVEKMRTLG TAACPPYHIA FVIGGTSAET
NLKTVKLASA KYYDELPTEG NEHGQAFRDV ELEKELLIEA QNLGLGAQFG GKYFAHDIRV
IRLPRHGASC PVGMGVSCSA DRNIKAKINR QGIWIEKLEH NPGKYIPEEL RKAGEGEAVR
VDLNRPMKEI LAQLSQYPVS TRLSLNGTII VGRDIAHAKL KERMDNGEGL PQYIKDHPIY
YAGPAKTPEG YASGSLGPTT AGRMDSYVDQ LQAQGGSMIM LAKGNRSQQV TDACKKHGGF
YLGSIGGPAA VLAQGSIKSL ECVEYPELGM EAIWKIEVED FPAFILVDDK GNDFFQQIQL
TQCTRCVK