Gene EcSMS35_0001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0001 
SymbolthrA 
ID6142593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp336 
End bp2798 
Gene Length2463 bp 
Protein Length820 aa 
Translation table11 
GC content53% 
IMG OID641614902 
Productbifunctional aspartokinase I/homeserine dehydrogenase I 
Protein accessionYP_001742118 
Protein GI170684018 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase
[COG0527] Aspartokinases 
TIGRFAM ID[TIGR00657] aspartate kinase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.880494 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAGTGT TGAAGTTCGG CGGTACATCA GTGGCAAATG CAGAACGTTT TCTGCGGGTT 
GCCGATATTC TGGAAAGCAA TGCCAGGCAG GGGCAGGTGG CCACCGTCCT CTCTGCCCCC
GCCAAAATCA CCAACCACCT GGTGGCGATG ATTGAAAAAA CCATTAGCGG CCAGGATGCT
TTACCCAATA TCAGCGATGC CGAACGTATT TTTGCCGAAC TTTTGACGGG ACTCGCCGCC
GCCCAGCCGG GGTTCCCGCT GGCGCAATTG AAAACTTTCG TCGATCAGGA ATTTGCCCAA
ATAAAACATG TCCTGCATGG CATTAGTTTG TTGGGGCAGT GCCCGGATAG CATCAACGCT
GCGCTGATTT GCCGTGGCGA GAAAATGTCG ATCGCCATTA TGGCCGGCGT ATTAGAAGCG
CGCGGTCACA ACGTTACCGT TATCGATCCG GTCGAAAAAC TGCTGGCAGT GGGGCATTAC
CTCGAATCTA CCGTCGATAT TGCTGAGTCC ACCCGCCGTA TTGCGGCAAG TCGTATTCCG
GCTGATCACA TGGTGCTGAT GGCAGGTTTC ACCGCCGGTA ATGAAAAAGG CGAACTGGTG
GTGCTTGGAC GCAACGGTTC CGACTACTCT GCTGCGGTGC TGGCTGCCTG TCTACGCGCC
GATTGTTGCG AGATTTGGAC GGACGTTGAC GGGGTCTATA CCTGCGACCC GCGTCAGGTG
CCCGATGCGA GGTTGTTGAA GTCGATGTCC TACCAGGAAG CGATGGAGCT TTCCTACTTC
GGCGCTAAAG TTCTTCACCC CCGCACAATT ACCCCTATCG CCCAGTTCCA GATCCCTTGC
CTGATTAAAA ATACCGGAAA TCCTCAAGCA CCAGGTACGC TCATTGGTGC CAGCCGTGAT
GAAGACGAAT TACCGGTCAA GGGCATTTCC AATCTGAATA ACATGGCAAT GTTCAGCGTT
TCCGGCCCGG GGATGAAAGG GATGGTTGGC ATGGCGGCGC GTGTCTTTGC AGCGATGTCA
CGCGCCCGTA TTTCCGTGGT GCTGATTACG CAATCGTCTT CTGAATACAG TATCAGTTTC
TGCGTTCCAC AAAGCGACTG TGTGCGAGCT GAACGGGCGA TGCAGGAAGA GTTCTATCTT
GAACTGAAGG AAGGCTTGCT GGAGCCGCTG GCGGTGACGG AACGGCTGGC CATTATCTCG
GTGGTAGGTG ATGGTATGCG CACCTTGCGT GGGATCTCGG CGAAATTCTT TGCCGCGCTG
GCCCGCGCCA ATATCAACAT TGTCGCCATT GCTCAGGGAT CTTCTGAACG CTCAATCTCT
GTCGTGGTCA ATAACGATGA TGCGACCACT GGCGTGCGCG TTACTCATCA GATGCTGTTC
AACACCGATC AGGTTATCGA AGTGTTTGTG ATTGGTGTCG GTGGCGTTGG CGGTGCGCTG
CTGGAGCAAC TGAAGCGTCA GCAAAGCTGG TTGAAGAATA AACATATCGA CTTACGTGTC
TGCGGTGTTG CCAACTCGAA GGCACTGCTC ACCAATGTAC ATGGCCTTAA TCTGGAAAAC
TGGCAGGAAG AACTGGCGCA AGCCAAAGAG CCGTTTAATC TCGGGCGCTT AATTCGCCTC
GTGAAAGAAT ATCATCTGCT GAACCCGGTC ATTGTTGACT GCACTTCCAG CCAGGCAGTG
GCGGATCAAT ATGCCGACTT CCTGCGCGAA GGTTTCCACG TTGTCACGCC GAACAAAAAG
GCCAACACCT CGTCGATGGA TTACTACCAT CAGTTGCGTT ATGCGGCGGA AAAATCGCGG
CGTAAATTCC TCTATGACAC CAACGTTGGG GCTGGATTAC CGGTTATTGA GAACCTGCAA
AATCTGCTCA ATGCAGGTGA TGAATTGATG AAGTTCTCCG GCATTCTTTC TGGTTCGCTT
TCTTATATCT TCGGCAAGTT AGACGAAGGC ATGAGTTTCT CCGAGGCGAC CACGCTGGCG
CGGGAAATGG GTTATACCGA ACCGGACCCG CGAGATGATC TTTCTGGTAT GGATGTGGCG
CGTAAGCTAT TGATTCTCGC CCGTGAAACG GGACGTGAAC TGGAACTGGC GGATATTGAA
ATTGAACCTG TGCTGCCCGC AGAGTTTAAC GCAGAGGGTG ATGTTGCCGC TTTTATGGCG
AATCTGTCAC AGCTCGACGA TCTCTTTGCC GCACGCGTGG CGAAGGCTCG TGATGAGGGC
AAAGTTTTGC GCTATGTTGG CAATATTGAT GAAGATGGCA TCTGCCGCGT GAAGATTGCC
GAAGTGGATG GCAATGATCC GCTGTTCAAA GTGAAAAATG GCGAAAACGC CCTGGCCTTC
TATAGCCACT ATTATCAGCC GCTGCCGTTG GTTCTGCGCG GATATGGCGC GGGCAATGAC
GTTACAGCTG CTGGTGTCTT TGCCGATCTG CTACGTACCC TCTCATGGAA GTTAGGAGTC
TGA
 
Protein sequence
MRVLKFGGTS VANAERFLRV ADILESNARQ GQVATVLSAP AKITNHLVAM IEKTISGQDA 
LPNISDAERI FAELLTGLAA AQPGFPLAQL KTFVDQEFAQ IKHVLHGISL LGQCPDSINA
ALICRGEKMS IAIMAGVLEA RGHNVTVIDP VEKLLAVGHY LESTVDIAES TRRIAASRIP
ADHMVLMAGF TAGNEKGELV VLGRNGSDYS AAVLAACLRA DCCEIWTDVD GVYTCDPRQV
PDARLLKSMS YQEAMELSYF GAKVLHPRTI TPIAQFQIPC LIKNTGNPQA PGTLIGASRD
EDELPVKGIS NLNNMAMFSV SGPGMKGMVG MAARVFAAMS RARISVVLIT QSSSEYSISF
CVPQSDCVRA ERAMQEEFYL ELKEGLLEPL AVTERLAIIS VVGDGMRTLR GISAKFFAAL
ARANINIVAI AQGSSERSIS VVVNNDDATT GVRVTHQMLF NTDQVIEVFV IGVGGVGGAL
LEQLKRQQSW LKNKHIDLRV CGVANSKALL TNVHGLNLEN WQEELAQAKE PFNLGRLIRL
VKEYHLLNPV IVDCTSSQAV ADQYADFLRE GFHVVTPNKK ANTSSMDYYH QLRYAAEKSR
RKFLYDTNVG AGLPVIENLQ NLLNAGDELM KFSGILSGSL SYIFGKLDEG MSFSEATTLA
REMGYTEPDP RDDLSGMDVA RKLLILARET GRELELADIE IEPVLPAEFN AEGDVAAFMA
NLSQLDDLFA ARVAKARDEG KVLRYVGNID EDGICRVKIA EVDGNDPLFK VKNGENALAF
YSHYYQPLPL VLRGYGAGND VTAAGVFADL LRTLSWKLGV