Gene ECD_02221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_02221 
SymbolackA 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp2300723 
End bp2301925 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content52% 
IMG OID 
Productacetate kinase 
Protein accessionACT44043 
Protein GI253978373 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.177023 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGAGTA AGTTAGTACT GGTTCTGAAC TGCGGTAGTT CTTCACTGAA ATTTGCCATC 
ATCGATGCAG TAAATGGTGA AGAGTACCTT TCTGGTTTAG CCGAATGTTT CCACCTGCCT
GAAGCACGTA TCAAATGGAA AATGGACGGC AATAAACAGG AAGCGGCTTT AGGTGCAGGC
GCCGCTCACA GCGAAGCGCT CAACTTTATC GTTAATACTA TTCTGGCACA AAAACCAGAA
CTGTCTGCGC AGCTGACTGC TATCGGTCAC CGTATCGTAC ACGGCGGCGA AAAGTATACC
AGCTCCGTAG TGATCGATGA GTCTGTTATT CAGGGTATCA AAGATGCAGC TTCTTTTGCA
CCGCTGCACA ACCCGGCTCA CCTGATCGGT ATCGAAGAAG CTCTGAAATC TTTCCCACAG
CTGAAAGACA AAAACGTTGC TGTATTTGAC ACCGCGTTCC ACCAGACTAT GCCGGAAGAG
TCTTACCTCT ACGCCCTGCC TTACAACCTG TACAAAGAGC ACGGCATCCG TCGTTACGGC
GCGCACGGCA CCAGCCACTT CTATGTAACC CAGGAAGCGG CAAAAATGCT GAACAAACCG
GTAGAAGAAC TGAACATCAT CACCTGCCAC CTGGGCAACG GTGGTTCCGT TTCTGCTATC
CGCAACGGTA AATGCGTTGA CACCTCTATG GGCCTGACCC CGCTGGAAGG TCTGGTCATG
GGTACCCGTT CTGGTGATAT CGATCCGGCG ATCATCTTCC ACCTGCACGA CACCCTGGGC
ATGAGCGTTG ACGCAATCAA CAAACTGCTG ACCAAAGAGT CTGGCCTGCT GGGTCTGACC
GAAGTGACCA GCGACTGCCG CTATGTTGAA GACAACTACG CGACGAAAGA AGACGCGAAG
CGCGCAATGG ACGTTTACTG CCACCGCCTG GCGAAATACA TCGGTGCCTA CACTGCGCTG
ATGGATGGTC GTCTGGACGC TGTTGTATTC ACTGGTGGTA TCGGTGAAAA TGCCGCGATG
GTTCGTGAAC TGTCTCTGGG CAAACTGGGC GTGCTGGGCT TTGAAGTTGA TCATGAACGC
AACCTGGCTG CACGTTTCGG CAAATCTGGT TTCATCAACA AAGAAGGTAC CCGTCCTGCG
GTGGTTATCC CAACCAACGA AGAACTGGTT ATCGCGCAAG ACGCGAGCCG CCTGACTGCC
TGA
 
Protein sequence
MSSKLVLVLN CGSSSLKFAI IDAVNGEEYL SGLAECFHLP EARIKWKMDG NKQEAALGAG 
AAHSEALNFI VNTILAQKPE LSAQLTAIGH RIVHGGEKYT SSVVIDESVI QGIKDAASFA
PLHNPAHLIG IEEALKSFPQ LKDKNVAVFD TAFHQTMPEE SYLYALPYNL YKEHGIRRYG
AHGTSHFYVT QEAAKMLNKP VEELNIITCH LGNGGSVSAI RNGKCVDTSM GLTPLEGLVM
GTRSGDIDPA IIFHLHDTLG MSVDAINKLL TKESGLLGLT EVTSDCRYVE DNYATKEDAK
RAMDVYCHRL AKYIGAYTAL MDGRLDAVVF TGGIGENAAM VRELSLGKLG VLGFEVDHER
NLAARFGKSG FINKEGTRPA VVIPTNEELV IAQDASRLTA