Gene EcSMS35_2604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2604 
SymboleutD 
ID6144063 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2656790 
End bp2657806 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content58% 
IMG OID641617475 
Productphosphotransacetylase 
Protein accessionYP_001744640 
Protein GI170681570 
COG category[C] Energy production and conversion 
COG ID[COG0280] Phosphotransacetylase 
TIGRFAM ID[TIGR00651] phosphate acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTATTG AACGTTGTCG TGAACTGGCG TTGCGAGCGC CCGCCAGAGT GGTTTTTCCG 
GATGCGTTAG ATCAGCGTGT GCTGAAAGCT GCGCAATATT TACATCAACA AGGTCTGGCA
ACGCCCATTC TGGTCGCTAA TCCGTTTGAA CTTCGTCAGT TTGCGCTCAG TCATGGCGTA
GCGATGGACG GGCTACAGGT GATTGATCCG CATGGCAACC TCGCAATGCG GGAAGAATTT
GCTCATCGCT GGCTGGCCCG CGCGGGCGAA AAAACGCCGC CGGATGCGCT GGAAAAACTC
ACCGAACCGC TGATGTTCGC CGCCGCAATG GTCAGCGCCG GTAAAGCGGA TGTCTGTATC
GCGGGCAATC TCTCTTCCAC GGCGAATGTG CTGCGTGCCG GATTACGCAT TATTGGCTTG
CAGCCAGGCT GTAAAACGCT CTCATCCATT TTCCTGATGC TGCCACAGTA CAGCGGTCCG
GCGTTGGGTT TTGCCGATTG CAGCGTGGTG CCACAGCCGA CGGCGGCGCA GTTGGCGGAT
ATCGCGCTTG CCAGCGCCGA AACCTGGCGC GCCATCACCG GAGAAGAACC GCGCGTGGCG
ATGCTGTCGT TTTCCAGCAA CGGTAGTGCC CGTCACCCCT GCGTTGCCAA CGTGCAGCAG
GCGACAGAAA TCGTCCGTGA GCGCGCACCA AAGCTGGTTG TCGATGGCGA GTTGCAGTTT
GACGCCGCCT TCGTGCCGGA AGTGGCGGCG CAAAAAGCGC CTGCCAGCCC GCTACAAGGC
AAGGCCAATG TGATGGTTTT TCCGTCGCTG GAAGCCGGAA ATATTGGCTA CAAAATCGCA
CAACGACTTG GCGGATATCG TGCCGTCGGG CCATTGATAC AAGGACTTGC CGCGCCGATG
CACGATCTCT CTCGTGGTTG TAGCGTGCAG GAAATTATCG AACTGGCGCT GGTGGCAGCT
GTGCCGCGTC AGACAGAAGT GAACCGCGAA AGCAGTTTAC AAACACTGGT TGAATGA
 
Protein sequence
MIIERCRELA LRAPARVVFP DALDQRVLKA AQYLHQQGLA TPILVANPFE LRQFALSHGV 
AMDGLQVIDP HGNLAMREEF AHRWLARAGE KTPPDALEKL TEPLMFAAAM VSAGKADVCI
AGNLSSTANV LRAGLRIIGL QPGCKTLSSI FLMLPQYSGP ALGFADCSVV PQPTAAQLAD
IALASAETWR AITGEEPRVA MLSFSSNGSA RHPCVANVQQ ATEIVRERAP KLVVDGELQF
DAAFVPEVAA QKAPASPLQG KANVMVFPSL EAGNIGYKIA QRLGGYRAVG PLIQGLAAPM
HDLSRGCSVQ EIIELALVAA VPRQTEVNRE SSLQTLVE