Gene ECH74115_3606 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3606 
Symbolfrc 
ID6970697 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3324253 
End bp3325503 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content47% 
IMG OID643387401 
Productformyl-coenzyme A transferase 
Protein accessionYP_002271860 
Protein GI209398990 
COG category[C] Energy production and conversion 
COG ID[COG1804] Predicted acyl-CoA transferases/carnitine dehydratase 
TIGRFAM ID[TIGR03253] formyl-CoA transferase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value0.480404 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAACTC CACTTCAAGG AATTAAAGTT CTCGATTTCA CCGGTGTGCA ATCTGGCCCA 
TCTTGTACTC AAATGCTGGC CTGGTTTGGC GCTGACGTCA TTAAAATTGA ACGCCCCGGC
GTTGGTGACG TAACGCGTCA CCAGCTGCGA GATATTCCTG ATATCGATGC GCTTTACTTC
ACCATGCTTA ACAGTAACAA ACGTTCTATT GAGTTAAATA CCAAAACAGC GGAAGGCAAA
GAGGTAATGG AAAAGCTGAT CCGCGAAGCT GATATCTTAG TCGAGAACTT TCATCCAGGG
GCCATTGATC ACATGGGCTT CACCTGGGAG CATATTCAAG AAATCAATCC ACGTCTGATT
TTTGGTTCGA TCAAAGGGTT TGATGAGTGT TCGCCTTATG TGAATGTAAA AGCCTATGAA
AACGTTGCTC AGGCAGCGGG TGGCGCGGCA TCCACTACGG GTTTTTGGGA CGGTCCGCCG
CTGGTAAGCG CTGCAGCGTT GGGTGACAGC AACACCGGAA TGCATTTACT GATCGGTTTA
CTTGCTGCTT TGCTGCATCG CGAAAAAACG GGGCGTGGGC AACGAGTCAC CATGTCAATG
CAGGATGCCG TATTGAACCT TTGCCGCGTG AAATTACGCG ACCAGCAGCG TCTCGATAAA
TTGGGTTATC TGGAAGAATA CCCGCAGTAT CCGAATGGCA CATTTGGTGA TGCAGTTCCC
CGCGGAGGTA ATGCGGGTGG TGGCGGTCAG CCTGGCTGGA TCCTGAAATG TAAAGGCTGG
GAAACTGATC CTAACGCCTA TATTTATTTC ACTATTCAGG AGCAAAACTG GGAAAACACC
TGTAAAGCCA TCGGCAAACC AGATTGGATT ACCGATCCGG CATACAGTAC AGCCCATGCC
CGACAGCCAC ATATTTTCGA TATTTTTGCT GAAATCGAAA AATACACTGT CACTATTGAT
AAACATGAAG CAGTGGCTTA TTTGACTCAG TTTGATATTC CTTGTGCACC GGTTTTAAGT
ATGAAAGAAA TTTCACTTGA TCCCTCTTTA CGCCAAAGCG GCAGTGTTGT CGAAGTGGAA
CAACCATTGC GTGGAAAATA TCTGACAGTT GGTTGTCCAA TGAAATTCTC TGCCTTTACG
CCGGATATTA AAGCTGCGCC TCTATTAGGT GAACATACCG CTGCGGTATT GCAGGAGCTG
GGTTATAGCG ACGATGAAAT TGCTGCAATG AAGCAAAACC ACGCCATCTG A
 
Protein sequence
MSTPLQGIKV LDFTGVQSGP SCTQMLAWFG ADVIKIERPG VGDVTRHQLR DIPDIDALYF 
TMLNSNKRSI ELNTKTAEGK EVMEKLIREA DILVENFHPG AIDHMGFTWE HIQEINPRLI
FGSIKGFDEC SPYVNVKAYE NVAQAAGGAA STTGFWDGPP LVSAAALGDS NTGMHLLIGL
LAALLHREKT GRGQRVTMSM QDAVLNLCRV KLRDQQRLDK LGYLEEYPQY PNGTFGDAVP
RGGNAGGGGQ PGWILKCKGW ETDPNAYIYF TIQEQNWENT CKAIGKPDWI TDPAYSTAHA
RQPHIFDIFA EIEKYTVTID KHEAVAYLTQ FDIPCAPVLS MKEISLDPSL RQSGSVVEVE
QPLRGKYLTV GCPMKFSAFT PDIKAAPLLG EHTAAVLQEL GYSDDEIAAM KQNHAI