Gene EcolC_3653 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3653 
SymbolthrA 
ID6065835 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3999502 
End bp4001964 
Gene Length2463 bp 
Protein Length820 aa 
Translation table11 
GC content53% 
IMG OID641603068 
Productbifunctional aspartokinase I/homeserine dehydrogenase I 
Protein accessionYP_001726591 
Protein GI170021637 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase
[COG0527] Aspartokinases 
TIGRFAM ID[TIGR00657] aspartate kinase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAGTGT TGAAGTTCGG CGGTACATCA GTGGCAAATG CAGAACGTTT TCTGCGGGTT 
GCCGATATTC TGGAAAGCAA TGCCAGGCAG GGGCAGGTGG CCACCGTCCT CTCTGCCCCC
GCCAAAATCA CCAACCACCT GGTGGCGATG ATTGAAAAAA CCATTAGCGG CCAGGATGCT
TTACCCAATA TCAGCGATGC TGAACGTATT TTTGCCGAAC TTCTGACGGG ACTCGCCGCC
GCCCAGCCGG GATTCCCGCT GGCGCAATTG AAAACTTTCG TCGACCAGGA ATTTGCTCAA
ATAAAACATG TCCTGCATGG CATTAGTTTG TTAGGGCAGT GCCCGGATAG CATCAACGCT
GCGCTGATTT GCCGTGGCGA GAAAATGTCG ATCGCCATTA TGGCCGGCGT GTTAGAAGCG
CGTGGTCACA ACGTTACCGT TATCGATCCG GTCGAAAAAC TACTGGCAGT GGGGCATTAC
CTCGAATCTA CCGTCGATAT TGCTGAGTCC ACCCGCCGTA TTGCGGCAAG TCGTATTCCG
GCTGATCACA TGGTGCTGAT GGCAGGTTTC ACCGCCGGTA ATGAAAAAGG CGAACTGGTG
GTGCTTGGAC GTAACGGTTC CGACTACTCC GCTGCGGTGC TGGCTGCCTG TTTACGCGCC
GATTGTTGCG AGATTTGGAC GGACGTTGAC GGGGTTTATA CCTGCGACCC GCGTCAGGTG
CCCGATGCGA GGTTGTTGAA GTCGATGTCC TACCAGGAAG CGATGGAGCT TTCCTACTTC
GGCGCTAAAG TTCTTCACCC CCGCACCATT ACCCCCATCG CCCAGTTCCA GATCCCTTGC
CTGATTAAAA ATACCGGAAA TCCTCAAGCA CCAGGTACGC TCATTGGTGC CAGCCGTGAT
GAAGACGAAT TACCGGTCAA GGGCATTTCC AATCTGAATA ACATGGCAAT GTTCAGCGTT
TCCGGCCCGG GGATGAAAGG AATGGTCGGC ATGGCGGCGC GCGTCTTTGC TGCAATGTCA
CGCGCCCGTA TTTCCGTGGT GCTGATTACG CAATCATCTT CCGAATACAG TATCAGTTTC
TGCGTTCCGC AAAGCGACTG TGTGCGAGCT GAACGGGCAA TGCAGGAAGA GTTCTACCTG
GAACTGAAAG AAGGCTTACT GGAGCCGCTG GCGGTGACGG AACGGCTGGC CATTATCTCG
GTGGTAGGTG ATGGTATGCG CACCTTGCGT GGGATCTCGG CGAAATTCTT TGCCGCGCTG
GCCCGCGCCA ATATCAACAT TGTCGCCATT GCTCAGGGAT CTTCTGAACG CTCAATCTCT
GTCGTGGTAA ATAACGATGA TGCGACCACT GGCGTGCGCG TTACTCATCA GATGCTGTTC
AATACCGATC AGGTTATCGA AGTGTTTGTG ATTGGCGTCG GTGGCGTTGG CGGTGCGCTG
CTGGAGCAAC TGAAGCGTCA ACAAAGCTGG CTGAAGAATA AACATATCGA CTTACGTGTC
TGCGGCGTTG CCAACTCGAA GGCTCTGCTT ACCAATGTGC ATGGCCTAAA CCTGGAAAAC
TGGCAGGAAG AACTGGCGCA AGCCAAAGAG CCGTTTAATC TCGGGCGCTT AATTCGCCTC
GTGAAAGAAT ATCATCTGCT AAACCCGGTC ATTGTTGACT GCACCTCCAG CCAGGCAGTG
GCGGATCAAT ATGCCGACTT CTTGCGCGAA GGTTTCCACG TTGTCACGCC GAACAAAAAG
GCCAACACCT CGTCGATGGA TTACTACCAT CTGTTGCGTC ATGCGGCGGA AAAATCGCGG
CGTAAATTCC TCTATGACAC CAACGTTGGG GCTGGATTAC CGGTTATTGA GAACCTGCAA
AATCTGCTCA ATGCTGGTGA TGAATTGATG AAGTTCTCCG GCATTCTTTC AGGTTCGCTT
TCTTATATCT TCGGCAAGTT AGACGAAGGC ATGAGTTTCT CCGAGGCGAC TACTCTGGCG
CGGGAAATGG GTTATACCGA ACCGGATCCG CGAGATGATC TTTCTGGTAT GGATGTAGCG
CGTAAGCTAT TGATTCTCGC TCGTGAAACG GGACGTGAAC TGGAGCTGGC GGATATTGAA
ATTGAACCTG TGCTGCCCGC AGAGTTTAAC GCCGAGGGTG ATGTTGCCGC TTTTATGGCG
AATCTGTCAC AGCTCGACGA GCTCTTTGCC GCGCGCGTGG CGAAGGCCCG TGATGAAGGA
AAAGTTTTGC GCTATGTTGG CAATATTGAT GAAGATGGCG TCTGCCGCGT GAAGATTGCC
GAAGTGGATG GGAATGATCC GCTGTTCAAA GTGAAAAATG GCGAAAACGC CCTGGCCTTC
TATAGCCACT ATTATCAGCC GCTGCCGTTG GTGCTGCGCG GATATGGTGC GGGCAATGAC
GTTACCGCTG CTGGTGTCTT TGCCGATCTG CTACGTACCC TCTCATGGAA GTTAGGAGTC
TGA
 
Protein sequence
MRVLKFGGTS VANAERFLRV ADILESNARQ GQVATVLSAP AKITNHLVAM IEKTISGQDA 
LPNISDAERI FAELLTGLAA AQPGFPLAQL KTFVDQEFAQ IKHVLHGISL LGQCPDSINA
ALICRGEKMS IAIMAGVLEA RGHNVTVIDP VEKLLAVGHY LESTVDIAES TRRIAASRIP
ADHMVLMAGF TAGNEKGELV VLGRNGSDYS AAVLAACLRA DCCEIWTDVD GVYTCDPRQV
PDARLLKSMS YQEAMELSYF GAKVLHPRTI TPIAQFQIPC LIKNTGNPQA PGTLIGASRD
EDELPVKGIS NLNNMAMFSV SGPGMKGMVG MAARVFAAMS RARISVVLIT QSSSEYSISF
CVPQSDCVRA ERAMQEEFYL ELKEGLLEPL AVTERLAIIS VVGDGMRTLR GISAKFFAAL
ARANINIVAI AQGSSERSIS VVVNNDDATT GVRVTHQMLF NTDQVIEVFV IGVGGVGGAL
LEQLKRQQSW LKNKHIDLRV CGVANSKALL TNVHGLNLEN WQEELAQAKE PFNLGRLIRL
VKEYHLLNPV IVDCTSSQAV ADQYADFLRE GFHVVTPNKK ANTSSMDYYH LLRHAAEKSR
RKFLYDTNVG AGLPVIENLQ NLLNAGDELM KFSGILSGSL SYIFGKLDEG MSFSEATTLA
REMGYTEPDP RDDLSGMDVA RKLLILARET GRELELADIE IEPVLPAEFN AEGDVAAFMA
NLSQLDELFA ARVAKARDEG KVLRYVGNID EDGVCRVKIA EVDGNDPLFK VKNGENALAF
YSHYYQPLPL VLRGYGAGND VTAAGVFADL LRTLSWKLGV