Gene EcolC_1356 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1356 
Symbol 
ID6068166 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1486251 
End bp1487453 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content52% 
IMG OID641600778 
Productacetate kinase 
Protein accessionYP_001724349 
Protein GI170019395 
COG category[C] Energy production and conversion 
COG ID[COG0282] Acetate kinase 
TIGRFAM ID[TIGR00016] acetate kinase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.446941 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.180313 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAGTA AGTTAGTACT GGTTCTGAAC TGCGGTAGTT CTTCACTGAA ATTTGCCATC 
ATCGATGCAG TAAATGGTGA AGAGTACCTT TCTGGTTTAG CCGAATGTTT CCACCTGCCT
GAAGCACGTA TCAAATGGAA AATGGACGGC AATAAACAGG AAGCGGCTTT AGGTGCAGGC
GCCGCTCACA GCGAAGCGCT CAACTTTATC GTTAATACTA TTCTGGCACA AAAACCAGAA
CTGTCTGCGC AGCTGACTGC TATCGGTCAC CGTATCGTAC ACGGCGGCGA AAAGTATACC
AGCTCCGTAG TGATCGATGA GTCTGTTATT CAGGGTATCA AAGATGCAGC TTCTTTTGCA
CCGCTGCACA ACCCGGCTCA CCTGATCGGT ATCGAAGAAG CTCTGAAATC TTTCCCACAG
CTGAAAGACA AAAACGTTGC TGTATTTGAC ACCGCGTTCC ACCAGACTAT GCCGGAAGAG
TCTTACCTCT ACGCCCTGCC TTACAACCTG TACAAAGAGC ACGGCATCCG TCGTTACGGC
GCGCACGGCA CCAGCCACTT CTATGTAACC CAGGAAGCGG CAAAAATGCT GAACAAACCG
GTAGAAGAAC TGAACATCAT CACCTGCCAC CTGGGCAACG GTGGTTCCGT TTCTGCTATC
CGCAACGGTA AATGCGTTGA CACCTCTATG GGCCTGACCC CGCTGGAAGG TCTGGTCATG
GGTACCCGTT CTGGTGATAT CGATCCGGCG ATCATCTTCC ACCTGCACGA CACCCTGGGC
ATGAGCGTTG ACGCAATCAA CAAACTGCTG ACCAAAGAGT CTGGCCTGCT GGGTCTGACC
GAAGTGACCA GCGACTGCCG CTATGTTGAA GACAACTACG CGACGAAAGA AGACGCGAAG
CGCGCAATGG ACGTTTACTG CCACCGCCTG GCGAAATACA TCGGTGCCTA CACTGCGCTG
ATGGATGGTC GTCTGGACGC TGTTGTATTC ACTGGTGGTA TCGGTGAAAA TGCCGCGATG
GTTCGTGAAC TGTCTCTGGG CAAACTGGGC GTGCTGGGCT TTGAAGTTGA TCATGAACGC
AACCTGGCTG CACGTTTCGG CAAATCTGGT TTCATCAACA AAGAAGGTAC CCGTCCTGCG
GTGGTTATCC CAACCAACGA AGAACTGGTT ATCGCGCAAG ACGCGAGCCG CCTGACTGCC
TGA
 
Protein sequence
MSSKLVLVLN CGSSSLKFAI IDAVNGEEYL SGLAECFHLP EARIKWKMDG NKQEAALGAG 
AAHSEALNFI VNTILAQKPE LSAQLTAIGH RIVHGGEKYT SSVVIDESVI QGIKDAASFA
PLHNPAHLIG IEEALKSFPQ LKDKNVAVFD TAFHQTMPEE SYLYALPYNL YKEHGIRRYG
AHGTSHFYVT QEAAKMLNKP VEELNIITCH LGNGGSVSAI RNGKCVDTSM GLTPLEGLVM
GTRSGDIDPA IIFHLHDTLG MSVDAINKLL TKESGLLGLT EVTSDCRYVE DNYATKEDAK
RAMDVYCHRL AKYIGAYTAL MDGRLDAVVF TGGIGENAAM VRELSLGKLG VLGFEVDHER
NLAARFGKSG FINKEGTRPA VVIPTNEELV IAQDASRLTA