Gene EcolC_4006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_4006 
Symbol 
ID6064559 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4400690 
End bp4402039 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content54% 
IMG OID641603417 
Productaspartate kinase III 
Protein accessionYP_001726932 
Protein GI170021978 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0527] Aspartokinases 
TIGRFAM ID[TIGR00656] aspartate kinase, monofunctional class
[TIGR00657] aspartate kinase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.193511 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAAA TTGTTGTCTC CAAATTTGGC GGTACCAGCG TAGCTGATTT TGACGCCATG 
AACCGCAGCG CTGATATTGT GCTTTCTGAT GCCAACGTGC GTTTAGTAGT CCTCTCGGCT
TCTGCTGGTA TCACTAATCT GCTGGTCGCT TTAGCGGAAG GGCTGGAACC TGGCGAGCGA
TTCGAAAAAC TCGACGCTAT TCGCAACATC CAGTTTGCCA TTCTGGAACG TCTGCGTTAC
CCGAACGTTA TCCGTGAAGA GATTGAACGT CTGCTGGAGA ACATTACTGT TCTGGCAGAA
GCGGCGGCGC TGGCAACGTC TCCGGCGCTG ACAGATGAGC TGGTCAGCCA CGGCGAGCTG
ATGTCGACCC TGCTGTTTGT TGAGATCCTG CGCGAACGCG ATGTTCAGGC ACAGTGGTTT
GATGTACGTA AAGTGATGCG TACCAACGAC CGATTTGGTC GTGCAGAGCC AGATGTAGCC
GCGCTGGCGG AACTGGCCGC GCTGCAGCTG CTCCCACGCC TCAATGACGG CTTAGTGATC
ACCCAGGGAT TTATCGGTAG CGAAAATAAA GGTCGTACAA CGACGCTTGG CCGTGGAGGC
AGCGATTATA CGGCAGCCTT GCTGGCGGAG GCTTTACACG CATCTCGTGT TGATATCTGG
ACCGACGTCC CGGGCATCTA CACCACCGAT CCACGCGTGG TTTCCGCAGC AAAACGCATT
GATGAAATCG CGTTTGCCGA AGCGGCAGAG ATGGCAACTT TTGGTGCAAA AGTACTGCAT
CCAGCAACGT TGCTCCCCGC AGTACGCAGC GATATCCCGG TCTTTGTCGG CTCCAGCAAA
GACTCACGCG CAGGTGGTAC GCTGGTGTGC AATAAAACTG AAAATCCGCC GCTGTTCCGC
GCGCTGGCGC TTCGTCGCAA TCAGACTCTA CTCACTTTGC ACAGCCTGAA TATGCTGCAT
TCTCGCGGTT TCCTCGCGGA AGTTTTCGGC ATCCTCGCGC GGCATAATAT TTCGGTAGAC
TTAATCACCA CGTCAGAAGT GAGCGTGGCA TTAACCCTTG ATACCACCGG TTCAACCTCC
ACTGGCGATA CGTTGCTGAC GCAATCTCTG CTGATGGAGC TTTCCGCACT GTGTCGGGTG
GAGGTGGAAG AAGGTCTGGC GCTGGTCGCG TTGATTGGCA ATGACCTGTC AAAAGCCTGC
GGCGTTGGCA AAGAGGTATT CGGCGTACTG GAACCGTTCA ACATTCGCAT GATTTGTTAT
GGCGCATCCA GCCATAACCT GTGCTTCCTG GTGCCCGGCG AAGATGCCGA GCAGGTGGTG
CAAAAACTGC ATAGTAATTT GTTTGAGTAA
 
Protein sequence
MSEIVVSKFG GTSVADFDAM NRSADIVLSD ANVRLVVLSA SAGITNLLVA LAEGLEPGER 
FEKLDAIRNI QFAILERLRY PNVIREEIER LLENITVLAE AAALATSPAL TDELVSHGEL
MSTLLFVEIL RERDVQAQWF DVRKVMRTND RFGRAEPDVA ALAELAALQL LPRLNDGLVI
TQGFIGSENK GRTTTLGRGG SDYTAALLAE ALHASRVDIW TDVPGIYTTD PRVVSAAKRI
DEIAFAEAAE MATFGAKVLH PATLLPAVRS DIPVFVGSSK DSRAGGTLVC NKTENPPLFR
ALALRRNQTL LTLHSLNMLH SRGFLAEVFG ILARHNISVD LITTSEVSVA LTLDTTGSTS
TGDTLLTQSL LMELSALCRV EVEEGLALVA LIGNDLSKAC GVGKEVFGVL EPFNIRMICY
GASSHNLCFL VPGEDAEQVV QKLHSNLFE