Gene EcSMS35_2524 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2524 
Symbolfrc 
ID6146854 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2582823 
End bp2584073 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content47% 
IMG OID641617396 
Productformyl-coenzyme A transferase 
Protein accessionYP_001744567 
Protein GI170682002 
COG category[C] Energy production and conversion 
COG ID[COG1804] Predicted acyl-CoA transferases/carnitine dehydratase 
TIGRFAM ID[TIGR03253] formyl-CoA transferase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.494058 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAACTC CACTTCAAGG AATTAAAGTT CTCGATTTCA CCGGTGTGCA ATCTGGCCCA 
TCTTGTACTC AAATGCTGGC CTGGTTTGGC GCTGACGTCA TTAAAATTGA ACGTCCCGGC
GTTGGTGACG TAACGCGTCA CCAGCTGCGA GATATTCCTG ATATCGATGC GCTTTACTTC
ACCATGCTTA ACAGTAACAA ACGTTCTATT GAATTAAATA CCAAAACAGC GGAAGGCAAA
GAGGTAATGG AAAAGCTGAT CCGCGAAGCT GATATCTTAG TCGAGAACTT TCATCCTGGG
GCCATTGATC ACATGGGCTT CACCTGGGAG CATATTCAAG AAATCAATCC ACGTCTGATT
TTTGGTTCGA TCAAAGGGTT TGATGAGTGT TCGCCTTATG TGAATGTAAA AGCCTATGAA
AACGTTGCTC AGGCAGCGGG TGGCGCGGCA TCCACTACGG GTTTTTGGGA CGGTCCGCCG
CTGGTAAGCG CTGCAGCGTT GGGTGACAGC AACACCGGAA TGCATTTACT GATCGGTTTA
CTTGCTGCTT TGCTGCATCG CGAAAAAACG GGGCGTGGGC AACGAGTCAC CATGTCAATG
CAGGATGCCG TATTGAACCT TTGCCGCGTG AAATTACGCG ACCAGCAGCG TCTCGATAAA
TTGGGTTATC TGGAAGAATA CCCGCAGTAT CCGAATGGCA CATTTGGTGA TGCAGTTCCC
CGCGGAGGTA ATGCGGGTGG TGGCGGTCAG CCTGGCTGGA TCCTGAAATG TAAAGGCTGG
GAAACCGATC CTAATGCCTA TATTTATTTC ACTATTCAGG AGCAAAACTG GGAAAACACC
TGTAAAGCTA TCGGCAAACC AGAATGGATT ACCGATCCGG CATACAGTAC AGCCCATGCC
CGACAGCCAC ATATTTTCGA TATTTTTGCT GAAATCGAAA AATACACTGT CACTATTGAT
AAACATGAAG CTGTGGCCTA TTTGACTCAG TTTGATATTC CTTGTGCACC GGTTTTAAGT
ATGAAAGAAA TTTCACTTGA TCCCTCTTTG CGCCAAAGTG GCAGTGTTGT CGAAGTGGAA
CAACCGTTGC GTGGAAAATA TCTGACAGTT GGTTGTCCAA TGAAATTCTC TGCCTTTACG
CCGGATATTA AAGCTGCGCC GCTATTAGGT GAACATACCG CTGCTGTATT GCAGGAGCTG
GGTTATAGCG ACGATGAAAT TGCTGCAATG AAGCAAAACC ACGCCATCTG A
 
Protein sequence
MSTPLQGIKV LDFTGVQSGP SCTQMLAWFG ADVIKIERPG VGDVTRHQLR DIPDIDALYF 
TMLNSNKRSI ELNTKTAEGK EVMEKLIREA DILVENFHPG AIDHMGFTWE HIQEINPRLI
FGSIKGFDEC SPYVNVKAYE NVAQAAGGAA STTGFWDGPP LVSAAALGDS NTGMHLLIGL
LAALLHREKT GRGQRVTMSM QDAVLNLCRV KLRDQQRLDK LGYLEEYPQY PNGTFGDAVP
RGGNAGGGGQ PGWILKCKGW ETDPNAYIYF TIQEQNWENT CKAIGKPEWI TDPAYSTAHA
RQPHIFDIFA EIEKYTVTID KHEAVAYLTQ FDIPCAPVLS MKEISLDPSL RQSGSVVEVE
QPLRGKYLTV GCPMKFSAFT PDIKAAPLLG EHTAAVLQEL GYSDDEIAAM KQNHAI