Gene ECH74115_0003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0003 
SymbolthrA 
ID6967964 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp360 
End bp2816 
Gene Length2457 bp 
Protein Length818 aa 
Translation table11 
GC content53% 
IMG OID643384087 
Productbifunctional aspartokinase I/homeserine dehydrogenase I 
Protein accessionYP_002268610 
Protein GI209398223 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase
[COG0527] Aspartokinases 
TIGRFAM ID[TIGR00657] aspartate kinase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.726282 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTGAAGT TCGGCGGTAC ATCAGTGGCA AATGCAGAAC GTTTTCTGCG GGTTGCCGAT 
ATTCTGGAAA GCAATGCCAG GCAGGGGCAG GTGGCCACCG TCCTCTCTGC CCCCGCCAAA
ATCACCAACC ACCTGGTGGC GATGATTGAA AAAACCATTA GCGGCCAGGA TGCTTTACCC
AATATCAGCG ATGCCGAACG TATTTTTGCC GAACTTCTGA CGGGACTCGC CGCCGCCCAG
CCGGGATTCC CGCTGGCGCA ATTGAAAACT TTCGTCGACC AGGAATTTGC CCAAATAAAA
CATGTCCTGC ATGGCATTAG TTTGTTAGGG CAGTGCCCGG ATAGCATTAA CGCTGCGCTG
ATTTGCCGTG GCGAGAAAAT GTCGATCGCC ATTATGGCCG GCGTATTAGA AGCGCGCGGT
CACAACGTTA CCGTTATCGA TCCGGTCGAA AAACTGCTGG CAGTGGGGCA TTACCTCGAA
TCTACTGTCG ATATTGCAGA GTCCACCCGC CGTATTGCGG CAAGTCGTAT TCCGGCTGAT
CACATGGTGC TGATGGCAGG TTTCACCGCC GGTAATGAAA AAGGCGAACT GGTGGTACTT
GGACGCAACG GTTCCGACTA CTCCGCGGCG GTGCTGGCTG CCTGTTTACG CGCCGATTGT
TGCGAGATTT GGACGGACGT TGACGGGGTA TATACCTGCG ACCCGCGTCA GGTGCCCGAT
GCGAGGTTGT TGAAATCGAT GTCCTACCAG GAAGCGATGG AGCTTTCCTA CTTCGGCGCT
AAAGTTCTTC ACCCCCGCAC CATTACCCCC ATCGCCCAGT TCCAGATCCC TTGCCTGATT
AAAAATACCG GAAATCCTCA AGCTCCAGGT ACGCTCATTG GTGCCAGTCG TGATGAAGAC
GAATTACCGG TCAAGGGCAT TTCCAATCTG AATAATATGG CAATGTTCAG CGTTTCCGGC
CCGGGGATGA AAGGAATGGT CGGCATGGCG GCGCGCGTCT TTGCTGCAAT GTCACGCGCC
CGTATTTCCG TGGTGCTGAT TACGCAATCA TCTTCCGAAT ACAGTATCAG TTTCTGCGTT
CCGCAAAGCG ACTGTGTGCG AGCTGAACGG GCAATGCAGG AAGAGTTCTA CCTGGAACTG
AAAGAAGGCT TACTGGAGCC GCTGGCGGTG ACGGAACGGC TGGCCATTAT CTCGGTGGTA
GGTGATGGTA TGCGCACCTT GCGTGGGATC TCGGCGAAAT TCTTTGCCGC GCTGGCCCGC
GCCAATATCA ACATTGTCGC TATTGCTCAG GGATCTTCTG AACGCTCAAT CTCTGTCGTG
GTAAATAACG ATGATGCGAC CACTGGCGTG CGCGTTACTC ATCAGATGCT GTTCAATACC
GATCAGGTTA TCGAAGTGTT TGTGATTGGC GTCGGTGGCG TTGGCGGTGC GCTGCTGGAG
CAACTGAAGC GTCAGCAAAG CTGGTTGAAG AATAAACATA TCGACTTACG TGTCTGCGGT
GTTGCTAACT CGAAGGCTCT GCTCACCAAT GTGCATGGCC TAAATCTGGA AAACTGGCAG
GAAGAACTGG CGCAAGCCAA AGAGCCGTTT AATCTCGGGC GCTTAATTCG CCTCGTGAAA
GAATATCATC TGCTGAACCC GGTCATTGTT GACTGCACCT CCAGCCAGGC AGTGGCGGAT
CAATATGCCG ACTTCCTGCG CGAAGGTTTC CACGTTGTCA CGCCGAACAA AAAGGCCAAC
ACCTCGTCGA TGGATTACTA CCATCTGTTG CGTCATGCGG CTGAAAAATC GCGGCGTAAA
TTCCTCTATG ACACCAACGT TGGGGCTGGA TTACCGGTTA TTGAGAACCT GCAAAATCTG
CTCAATGCTG GTGATGAATT GATGAAGTTC TCCGGCATTC TTTCAGGTTC GCTTTCTTAT
ATCTTCGGCA AGTTAGACGA AGGCATGAGT TTCTCCGAGG CGACTACGCT GGCGCGGGAA
ATGGGTTATA CCGAACCGGA TCCGCGAGAT GATCTTTCTG GTATGGATGT AGCGCGTAAA
CTATTAATTC TCGCTCGTGA AACGGGACGT GAACTGGAGC TGGCGGATAT TGAAATTGAA
CCTGTGCTGC CCGCAGAGTT TAACGCTGAG GGTGATGTTG CCGCTTTTAT GGCGAATCTG
TCACAGCTCG ACGATCTCTT TGCCGCGCGC GTGGCGAAGG CCCGTGATGA AGGAAAAGTT
TTGCGCTATG TTGGCAATAT TGATGAAGAT GGCGTCTGCC GCGTGAAGAT TGCCGAAGTG
GATGGTAATG ATCCGCTGTT CAAAGTGAAA AATGGCGAAA ACGCCCTGGC CTTTTATAGC
CACTATTATC AGCCGCTGCC GTTGGTGCTG CGCGGATATG GTGCGGGCAA TGACGTTACC
GCTGCCGGTG TCTTTGCCGA TCTGCTACGT ACCCTCTCAT GGAAGTTAGG AGTCTGA
 
Protein sequence
MLKFGGTSVA NAERFLRVAD ILESNARQGQ VATVLSAPAK ITNHLVAMIE KTISGQDALP 
NISDAERIFA ELLTGLAAAQ PGFPLAQLKT FVDQEFAQIK HVLHGISLLG QCPDSINAAL
ICRGEKMSIA IMAGVLEARG HNVTVIDPVE KLLAVGHYLE STVDIAESTR RIAASRIPAD
HMVLMAGFTA GNEKGELVVL GRNGSDYSAA VLAACLRADC CEIWTDVDGV YTCDPRQVPD
ARLLKSMSYQ EAMELSYFGA KVLHPRTITP IAQFQIPCLI KNTGNPQAPG TLIGASRDED
ELPVKGISNL NNMAMFSVSG PGMKGMVGMA ARVFAAMSRA RISVVLITQS SSEYSISFCV
PQSDCVRAER AMQEEFYLEL KEGLLEPLAV TERLAIISVV GDGMRTLRGI SAKFFAALAR
ANINIVAIAQ GSSERSISVV VNNDDATTGV RVTHQMLFNT DQVIEVFVIG VGGVGGALLE
QLKRQQSWLK NKHIDLRVCG VANSKALLTN VHGLNLENWQ EELAQAKEPF NLGRLIRLVK
EYHLLNPVIV DCTSSQAVAD QYADFLREGF HVVTPNKKAN TSSMDYYHLL RHAAEKSRRK
FLYDTNVGAG LPVIENLQNL LNAGDELMKF SGILSGSLSY IFGKLDEGMS FSEATTLARE
MGYTEPDPRD DLSGMDVARK LLILARETGR ELELADIEIE PVLPAEFNAE GDVAAFMANL
SQLDDLFAAR VAKARDEGKV LRYVGNIDED GVCRVKIAEV DGNDPLFKVK NGENALAFYS
HYYQPLPLVL RGYGAGNDVT AAGVFADLLR TLSWKLGV