Gene EcHS_A0003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0003 
SymbolthrA 
ID5593169 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp336 
End bp2798 
Gene Length2463 bp 
Protein Length820 aa 
Translation table11 
GC content53% 
IMG OID640919192 
Productbifunctional aspartokinase I/homeserine dehydrogenase I 
Protein accessionYP_001456787 
Protein GI157159469 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase
[COG0527] Aspartokinases 
TIGRFAM ID[TIGR00657] aspartate kinase 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value0.761608 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAGTGT TGAAGTTCGG CGGTACATCA GTGGCAAATG CAGAACGTTT TCTGCGGGTT 
GCCGATATTC TGGAAAGCAA TGCCAGGCAG GGGCAGGTGG CCACCGTCCT CTCTGCCCCC
GCCAAAATCA CCAACCACCT GGTGGCGATG ATTGAAAAAA CCATTAGCGG CCAGGATGCT
TTACCCAATA TCAGCGATGC CGAACGTATT TTTGCCGAAC TTCTGACGGG ACTCACCGCC
GCCCAGCCGG GATTCCCGCT GGCGCAACTG AAAACTTTCG TCGACCAGGA ATTTGCCCAA
ATAAAACATG TCCTGCATGG CATTAGTTTG TTGGGGCAGT GCCCGGATAG CATCAACGCT
GCGCTGATTT GCCGTGGCGA GAAAATGTCG ATCGCCATTA TGGCCGGCGT GTTAGAAGCG
CGTGGTCACA ACGTTACCGT TATCGATCCG GTCGAAAAAC TACTGGCAGT GGGGCATTAC
CTCGAATCTA CCGTCGATAT TGCTGAGTCC ACCCGCCGTA TTGCGGCAAG TCGTATTCCG
GCTGATCACA TGGTGCTGAT GGCAGGCTTC ACCGCCGGTA ATGAAAAAGG CGAACTGGTG
GTACTTGGAC GCAACGGTTC CGACTACTCC GCGGCGGTGC TGGCTGCCTG TTTACGCGCC
GATTGTTGCG AGATTTGGAC GGACGTTGAC GGGGTCTATA CCTGCGACCC GCGTCAGGTG
CCCGATGCGA GGTTGTTGAA GTCGATGTCC TACCAGGAAG CGATGGAGCT TTCCTACTTC
GGCGCTAAAG TTCTTCACCC CCGCACCATT ACCCCCATCG CCCAGTTCCA GATCCCTTGC
CTGATTAAAA ATACCGGAAA TCCTCAAGCA CCAGGTACGC TCATTGGTGC CAGCCGTGAT
GAAGACGAAT TACCGGTCAA GGGCATTTCC AATCTGAATA ACATGGCAAT GTTCAGCGTT
TCTGGTCCGG GAATGAAAGG GATGGTCGGC ATGGCGGCGC GCGTCTTTGC AGCGATGTCA
CGCGCCCGTA TTTCCGTGGT GCTGATTACG CAATCATCTT CCGAATACAG TATCAGTTTC
TGCGTTCCGC AAAGCGACTG TGTGCGAGCT GAACGGGCAA TGCAGGAAGA GTTCTACCTG
GAACTGAAAG AAGGCTTACT GGAGCCGCTG GCGGTGACGG AACGGCTGGC CATTATCTCG
GTGGTAGGTG ATGGTATGCG CACCTTGCGT GGGATCTCGG CGAAATTCTT TGCCGCGCTG
GCCCGCGCCA ATATCAACAT TGTCGCCATT GCTCAGGGAT CTTCTGAACG CTCAATCTCT
GTCGTGGTAA ATAACGATGA TGCGACCACT GGCGTGCGCG TTACTCATCA GATGCTGTTC
AATACCGATC AGGTTATCGA AGTGTTTGTG ATTGGCGTCG GTGGCGTTGG CGGTGCGCTG
CTGGAGCAAC TGAAGCGTCA ACAAAGCTGG CTGAAGAATA AACATATCGA CTTACGTGTC
TGCGGTGTTG CCAACTCGAA GGCACTGCTC ACCAATGTGC ATGGCCTAAA TCTGGAAAAC
TGGCAGGAAG AACTGGCGCA AGCCAAAGAG CCGTTTAATC TCGGGCGCTT AATTCGCCTC
GTGAAAGAAT ATCATCTGCT GAACCCGGTC ATTGTTGACT GCACCTCCAG CCAGGCAGTG
GCGGATCAAT ATGCCGACTT CTTGCGCGAA GGTTTCCACG TTGTCACGCC GAACAAAAAG
GCCAACACCT CGTCGATGGA TTACTACCAT CTGTTGCGTC ATGCGGCGGA AAAATCGCGG
CGTAAATTCC TCTATGACAC CAACGTTGGG GCTGGATTAC CGGTTATTGA GAACCTGCAA
AATCTGCTCA ATGCTGGTGA TGAATTGATG AAGTTCTCCG GCATTCTTTC AGGTTCGCTT
TCTTATATCT TCGGCAAGTT AGACGAAGGC ATGAGTTTCT CCGAGGCGAC TACTCTGGCG
CGGGAAATGG GTTATACCGA ACCGGATCCG CGAGATGATC TTTCTGGTAT GGATGTAGCG
CGTAAGCTAT TGATTCTCGC TCGTGAAACG GGACGTGAAC TGGAGCTGGC GGATATTGAA
ATTGAACCTG TGCTGCCCGC AGAGTTTAAC GCTGAGGGTG ATGTTGCCGC TTTTATGGCG
AATCTGTCAC AGCTCGACGA TCTCTTTGCC GCGCGCGTGG CGAAGGCCCG TGATGAAGGA
AAAGTTTTGC GCTATGTTGG CAATATTGAT GAAGATGGTG CCTGCCGCGT GAAGATTGCC
GAAGTGGATG GTAATGATCC GCTGTTCAAA GTGAAAAATG GCGAAAACGC CCTGGCCTTT
TATAGCCACT ATTATCAGCC GCTGCCGTTG GTGCTGCGCG GATATGGTGC GGGCAATGAC
GTTACAGCTG CCGGTGTCTT TGCCGATCTG CTACGTACCC TCTCATGGAA GTTAGGAGTC
TGA
 
Protein sequence
MRVLKFGGTS VANAERFLRV ADILESNARQ GQVATVLSAP AKITNHLVAM IEKTISGQDA 
LPNISDAERI FAELLTGLTA AQPGFPLAQL KTFVDQEFAQ IKHVLHGISL LGQCPDSINA
ALICRGEKMS IAIMAGVLEA RGHNVTVIDP VEKLLAVGHY LESTVDIAES TRRIAASRIP
ADHMVLMAGF TAGNEKGELV VLGRNGSDYS AAVLAACLRA DCCEIWTDVD GVYTCDPRQV
PDARLLKSMS YQEAMELSYF GAKVLHPRTI TPIAQFQIPC LIKNTGNPQA PGTLIGASRD
EDELPVKGIS NLNNMAMFSV SGPGMKGMVG MAARVFAAMS RARISVVLIT QSSSEYSISF
CVPQSDCVRA ERAMQEEFYL ELKEGLLEPL AVTERLAIIS VVGDGMRTLR GISAKFFAAL
ARANINIVAI AQGSSERSIS VVVNNDDATT GVRVTHQMLF NTDQVIEVFV IGVGGVGGAL
LEQLKRQQSW LKNKHIDLRV CGVANSKALL TNVHGLNLEN WQEELAQAKE PFNLGRLIRL
VKEYHLLNPV IVDCTSSQAV ADQYADFLRE GFHVVTPNKK ANTSSMDYYH LLRHAAEKSR
RKFLYDTNVG AGLPVIENLQ NLLNAGDELM KFSGILSGSL SYIFGKLDEG MSFSEATTLA
REMGYTEPDP RDDLSGMDVA RKLLILARET GRELELADIE IEPVLPAEFN AEGDVAAFMA
NLSQLDDLFA ARVAKARDEG KVLRYVGNID EDGACRVKIA EVDGNDPLFK VKNGENALAF
YSHYYQPLPL VLRGYGAGND VTAAGVFADL LRTLSWKLGV