Gene EcSMS35_4485 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4485 
SymbollysC 
ID6144933 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4581431 
End bp4582780 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content54% 
IMG OID641619301 
Productaspartate kinase III 
Protein accessionYP_001746413 
Protein GI170684154 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0527] Aspartokinases 
TIGRFAM ID[TIGR00656] aspartate kinase, monofunctional class
[TIGR00657] aspartate kinase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAAA TTGTTGTCTC CAAATTTGGC GGTACCAGCG TAGCTGATTT TGACGCCATG 
AACCGCAGCG CTGATATTGT GCTTTCTGAT GCCAACGTAC GTTTAGTTGT CCTCTCGGCT
TCTGCTGGTA TCACTAATCT GCTGGTCGCT TTAGCTGAAG GACTGGAACC TGGCGAGCGA
TTCGAAAAAC TCGACGCAAT CCGCAATATC CAGTTTGCCA TTCTGGAACG TCTGCGTTAC
CCGAACGTTA TCCGTGAAGA GATTGAACGT CTGCTGGAGA ATATTACTGT TCTGGCAGAA
GCGGCGGCGC TGGCAACGTC TCCGGCCCTG ACAGATGAAC TGGTCAGCCA CGGCGAGCTG
ATGTCGACCC TGCTGTTTGT CGAAATCCTG CGCGAACGCG ATGTTCAGGC ACAGTGGTTT
GATGTACGTA AAGTGATGCG TACCAACGAC CGATTTGGTC GTGCAGAGCC AGATGTAGCC
GCGCTGGCGG AACTGGCCGC GCTGCAGCTG CTCCCACGCC TCAATGAAGG CTTAGTGATC
ACCCAGGGAT TTATCGGTAG CGAAAATAAA GGTCGGACAA CGACGCTTGG CCGTGGAGGC
AGCGATTATA CGGCAGCCTT GCTGGCGGAG GCTTTACACG CATCTCGTGT TGATATCTGG
ACCGACGTCC CGGGCATCTA CACCACCGAT CCACGCGTGG TTTCCGCAGC AAAACGCATT
GATGAAATCG CGTTTGCCGA AGCGGCAGAG ATGGCAACTT TTGGTGCAAA AGTACTGCAT
CCGGCAACGT TGCTACCCGC AGTACGCAGC GATATCCCGG TCTTTGTCGG CTCCAGCAAA
GACCCACGCG CAGGTGGTAC GCTGGTGTGC AATAAAACTG AAAATCCGCC GCTGTTCCGC
GCGCTGGCGC TTCGTCGCAA TCAGACTCTG CTCACTTTGC ACAGCCTGAA TATGCTGCAT
TCTCGCGGTT TCCTCGCGGA AGTTTTCGGC ATCCTCGCGC GGCATAATAT TTCGGTAGAC
TTAATCACCA CGTCAGAAGT GAGCGTGGCA TTAACCCTTG ATACCACCGG TTCAACCTCC
ACTGGCGATA CGTTGCTGAC GCAATCCCTG CTGATGGAGC TTTCCGCACT GTGCCGGGTG
GAAGTGGAAG AGGGGCTGGC GCTGGTCGCG TTGATTGGCA ATGACCTGTC AAAAGCCTGC
GGCGTTGGCA AAGAGGTATT CGGCGTACTG GAACCGTTCA ACATTCGCAT GATTTGTTAC
GGCGCATCCA GCCATAACCT GTGCTTCCTG GTGCCCGGCG AAGATGCCGA GCAGGTGGTG
CAGAAGCTGC ATTTTAATTT ATTTGAGTAA
 
Protein sequence
MSEIVVSKFG GTSVADFDAM NRSADIVLSD ANVRLVVLSA SAGITNLLVA LAEGLEPGER 
FEKLDAIRNI QFAILERLRY PNVIREEIER LLENITVLAE AAALATSPAL TDELVSHGEL
MSTLLFVEIL RERDVQAQWF DVRKVMRTND RFGRAEPDVA ALAELAALQL LPRLNEGLVI
TQGFIGSENK GRTTTLGRGG SDYTAALLAE ALHASRVDIW TDVPGIYTTD PRVVSAAKRI
DEIAFAEAAE MATFGAKVLH PATLLPAVRS DIPVFVGSSK DPRAGGTLVC NKTENPPLFR
ALALRRNQTL LTLHSLNMLH SRGFLAEVFG ILARHNISVD LITTSEVSVA LTLDTTGSTS
TGDTLLTQSL LMELSALCRV EVEEGLALVA LIGNDLSKAC GVGKEVFGVL EPFNIRMICY
GASSHNLCFL VPGEDAEQVV QKLHFNLFE