Gene ECH74115_3333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3333 
SymbolccmF 
ID6967800 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3067876 
End bp3069819 
Gene Length1944 bp 
Protein Length647 aa 
Translation table11 
GC content57% 
IMG OID643387145 
Productcytochrome c-type biogenesis protein CcmF 
Protein accessionYP_002271608 
Protein GI209399270 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1138] Cytochrome c biogenesis factor 
TIGRFAM ID[TIGR00353] c-type cytochrome biogenesis protein CcmF 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCCGG AAATTGGTAA CGGGCTGCTG TGCCTGGCGC TGGGAATTGC GCTGCTGCTG 
TCCGTGTATC CGCTATGGGG CGTGGCGCGC GGAGATGCGC GCATGATGGC GTCTTCCCGC
TTGTTTGCCT GGCTGCTGTT TATGTCTGTG GCTGGCGCAT TTCTGGTGCT GGTCAATGCT
TTCGTGGTCA ACGACTTTAC CGTCACCTAT GTTGCCAGCA ACTCCAATAC CCAGCTTCCG
GTGTGGTATC GCGTGGCGGC TACCTGGGGC GCACATGAAG GCTCACTGCT GCTGTGGGTG
CTGCTGATGA GCGGCTGGAC CTTTGCGGTG GCGATTTTTA GTCAGCGTAT TCCGCTGGAT
ATTGTGGCCC GCGTACTGGC GATAATGGGG ATGGTCAGTG TCGGCTTTTT GCTGTTCATT
CTCTTTACCT CTAACCCATT CTCACGCACG CTGCCGAACT TCCCGATTGA AGGGCGCGAT
CTTAACCCGC TGTTACAGGA TCCGGGGCTG ATCTTCCATC CGCCTCTGCT TTATATGGGG
TACGTGGGTT TCTCGGTGGC GTTTGCTTTT GCCATTGCTT CTTTGCTGAG CGGGCGTCTG
GACAGCACTT ATGCGCGTTT TACTCGTCCG TGGACGCTGG CAGCGTGGAT CTTCCTGACG
CTCGGCATCG TGCTCGGTTC CGCATGGGCC TATTACGAAC TCGGCTGGGG TGGCTGGTGG
TTCTGGGATC CGGTAGAAAA CGCCTCGTTT ATGCCGTGGC TGGTGGGGAC TGCGCTGATG
CACTCACTGG CGGTCACTGA ACAACGCGCC AGCTTCAAAG CGTGGACATT ACTGCTGGCA
ATCAGTGCCT TCTCGTTGTG TCTGTTGGGG ACTTTCCTCG TGCGTTCCGG CGTGCTGGTA
TCGGTACACG CGTTTGCGTC TGATCCGGCG CGCGGTATGT TTATCCTCGC CTTTATGGTG
CTGGTGATTG GCGGTTCGCT GCTGCTGTTT GCCGCGCGTG GACACAAAGT TCGCTCACGC
GTAAACAATG CGCTGTGGTC GCGGGAATCT TTGCTGTTAG CGAACAACGT TTTGCTGGTC
GCCGCGATGC TGGTGGTGTT GCTGGGGACG CTGCTGCCGC TGGTGCACAA GCAACTGGGA
CTGGGCAGTA TTTCGATTGG CGAACCGTTC TTCAACACCA TGTTTACCTG GCTGATGGTG
CCGTTTGCGC TACTGCTGGG TGTCGGTCCT CTGGTGCGCT GGGGGCGCGA TCGCCCACGT
AAAATCCGCA ATTTATTGAT TATCGCCTTC ATCTCTACGC TGGTGCTGTC TCTGCTGTTG
CCGTGGCTGT TCGAAAGCAA AGTTGTGGCG ATGACGGTGC TCGGCCTGGC AATGGCCTGC
TGGATTGCGG TGCTGGCAAT TGCGGAAGCT GCGCTACGTA TTTCACGCGG CACGAAAACC
ACCTTCAGTT ATTGGGGGAT GGTGGCGGCT CACCTGGGGC TGGCAGTGAC AATTGTTGGC
ATTGCCTTTA GCCAGAACTA TAGCGTTGAG CGTGATGTGC GCATGAAGTC CGGCGATAGC
GTCGATATTC ATGAATATCG CTTCACCTTC CGTGATGTCA AAGAGGTGAC TGGCCCGAAC
TGGCGTGGCG GTGTGGCGAC TATCGGCGTA ACGCGAGAGG GTAAACCGGA AACGGTGCTG
TATGCGGAAA AACGTTATTA CAACACTGCC GGGTCGATGA TGACCGAAGC GGCAATTGAC
GGCGGCATCA CGCGTGACCT GTACGCCGCG CTCGGTGAAG AGCTGGAAAA CGGCGCGTGG
GCTGTGCGTC TTTACTACAA ACCATTTGTT CGCTGGATTT GGGCGGGCGG GCTGATGATG
GCGTTGGGCG GACTGCTGTG TCTGTTTGAT CCTCGCTATC GTAAGCGCGT GAGCCCGCAA
AAAACTGCGC CGGAGGCTGT ATGA
 
Protein sequence
MMPEIGNGLL CLALGIALLL SVYPLWGVAR GDARMMASSR LFAWLLFMSV AGAFLVLVNA 
FVVNDFTVTY VASNSNTQLP VWYRVAATWG AHEGSLLLWV LLMSGWTFAV AIFSQRIPLD
IVARVLAIMG MVSVGFLLFI LFTSNPFSRT LPNFPIEGRD LNPLLQDPGL IFHPPLLYMG
YVGFSVAFAF AIASLLSGRL DSTYARFTRP WTLAAWIFLT LGIVLGSAWA YYELGWGGWW
FWDPVENASF MPWLVGTALM HSLAVTEQRA SFKAWTLLLA ISAFSLCLLG TFLVRSGVLV
SVHAFASDPA RGMFILAFMV LVIGGSLLLF AARGHKVRSR VNNALWSRES LLLANNVLLV
AAMLVVLLGT LLPLVHKQLG LGSISIGEPF FNTMFTWLMV PFALLLGVGP LVRWGRDRPR
KIRNLLIIAF ISTLVLSLLL PWLFESKVVA MTVLGLAMAC WIAVLAIAEA ALRISRGTKT
TFSYWGMVAA HLGLAVTIVG IAFSQNYSVE RDVRMKSGDS VDIHEYRFTF RDVKEVTGPN
WRGGVATIGV TREGKPETVL YAEKRYYNTA GSMMTEAAID GGITRDLYAA LGEELENGAW
AVRLYYKPFV RWIWAGGLMM ALGGLLCLFD PRYRKRVSPQ KTAPEAV