Gene EcSMS35_2344 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2344 
SymbolccmF 
ID6143358 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2377246 
End bp2379189 
Gene Length1944 bp 
Protein Length647 aa 
Translation table11 
GC content57% 
IMG OID641617217 
Productcytochrome c-type biogenesis protein CcmF 
Protein accessionYP_001744389 
Protein GI170679920 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1138] Cytochrome c biogenesis factor 
TIGRFAM ID[TIGR00353] c-type cytochrome biogenesis protein CcmF 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.756913 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCCGG AAATTGGTAA CGGGCTGCTG TGTCTGGCGC TAGGAATTGC GCTGCTGCTG 
TCCGTGTATC CGCTATGGGG CGTAGCGCGC GGAGATGCGC GCATGATGGC ATCTTCCCGC
TTGTTTGCCT GGCTGCTGTT TATGTCTGTG GCTGGCGCAT TTCTGGTACT GGTCAATGCC
TTCGTGGTCA ACGACTTCAC CGTCACCTAT GTTGCCAGCA ACTCCAATAC CCAGCTTCCG
GTGTGGTATC GCGTGGCGGC CACCTGGGGC GCGCATGAAG GCTCGCTCCT GCTGTGGGTG
CTGCTGATGA GCGGCTGGAC CTTTGCGGTA GCGATTTTTA GTCAGCGTAT TCCGCTGGAT
ATTGTGGCCC GCGTACTGGC GATAATGGGG ATGGTCAGCG TCGGCTTTTT GCTGTTCATT
CTCTTTACCT CTAACCCGTT CTCACGCACG TTGCCGAACT TCCCGATTGA AGGTCGCGAT
CTTAACCCGC TGTTACAGGA TCCGGGGCTG ATCTTCCATC CGCCTCTGCT TTATATGGGG
TACGTCGGCT TCTCGGTGGC GTTTGCTTTT GCCATTGCTT CTTTGTTGAG CGGGCGTCTG
GACAGCACTT ATGCGCGTTT TACTCGTCCG TGGACGCTGG CGGCGTGGAT CTTCTTGACG
CTCGGCATTG TGCTCGGTTC CGCATGGGCC TATTACGAAC TCGGCTGGGG CGGCTGGTGG
TTCTGGGATC CGGTAGAAAA CGCCTCGTTT ATGCCGTGGC TGGTGGGGAC TGCGCTGATG
CACTCACTGG CGGTCACTGA ACAACGCGCC AGCTTCAAAG CGTGGACATT ACTGCTGGCA
ATCAGTGCCT TCTCGTTGTG TCTGCTGGGG ACTTTCCTGG TGCGTTCCGG CGTGCTGGTA
TCGGTACACG CGTTTGCCTC TGATCCGGCA CGCGGTATGT TTATCCTCGC CTTTATGGTA
CTGGTGATTG GTGGTTCGCT GCTGCTGTTT GCCGCGCGTG GACACAAAGT TCGCTCACGC
GTAAACAATG CGCTGTGGTC GCGGGAATCT CTGCTGTTAG CGAACAACGT TTTGCTGGTC
GCCGCGATGC TGGTGGTGTT GCTGGGGACG CTGCTGCCGC TGGTGCACAA GCAACTGGGA
CTGGGCAGTA TTTCGATTGG CGAACCGTTC TTCAACACCA TGTTTACCTG GCTGATGGTG
CCGTTTGCGC TGCTGCTTGG TGTCGGTCCT CTGGTGCGCT GGGGGCGCGA TCGCCCACGT
AAAATCCGCA ATTTGTTGAT TATCGCCTTC ATCTCTACGC TGGTGCTGTC GCTGCTGTTG
CCGTGGCTGT TCGAAAGCAA AGTGGTGGCG ATGACGGTGC TCGGCCTGGC AATGGCCTGC
TGGATTGCGG TGCTGGCAAT TGCGGAAGCT GCGCTGCGTA TTTCACGCGG CACGAAAACC
ACCTTCAGTT ATTGGGGAAT GGTGGCGGCT CACCTGGGGC TGGCAGTGAC AATTGTTGGT
ATTGCCTTTA GCCAGAACTA TAGCGTTGAG CGTGATGTGC GCATGAAGTC CGGCGATAGT
GTCGATATCC ATGAATATCG CTTCACCTTC CGTGATGTCA AAGAGGTGAC TGGCCCGAAC
TGGCGTGGCG GTGTGGCGAC TATCGGCGTA ACGCGCGATG GCAAGCCGGA AACGGTGCTG
TATGCGGAAA AACGTTATTA CAACACTGCC GGGTCGATGA TGACCGAAGC GGCGATTGAC
GGCGGCATCA CGCGTGACCT GTACGCCGCG CTCGGTGAAG AGCTGGAAAA CGGCGCGTGG
GCCGTGCGTC TTTACTACAA ACCGTTTGTT CGCTGGATTT GGGCGGGCGG GCTGATGATG
GCGTTGGGCG GACTGCTGTG TCTGTTTGAT CCTCGCTATC GTAAGCGCGT GAGTCCGCAA
AAAACTGCGC CGGAGGCTGT ATGA
 
Protein sequence
MMPEIGNGLL CLALGIALLL SVYPLWGVAR GDARMMASSR LFAWLLFMSV AGAFLVLVNA 
FVVNDFTVTY VASNSNTQLP VWYRVAATWG AHEGSLLLWV LLMSGWTFAV AIFSQRIPLD
IVARVLAIMG MVSVGFLLFI LFTSNPFSRT LPNFPIEGRD LNPLLQDPGL IFHPPLLYMG
YVGFSVAFAF AIASLLSGRL DSTYARFTRP WTLAAWIFLT LGIVLGSAWA YYELGWGGWW
FWDPVENASF MPWLVGTALM HSLAVTEQRA SFKAWTLLLA ISAFSLCLLG TFLVRSGVLV
SVHAFASDPA RGMFILAFMV LVIGGSLLLF AARGHKVRSR VNNALWSRES LLLANNVLLV
AAMLVVLLGT LLPLVHKQLG LGSISIGEPF FNTMFTWLMV PFALLLGVGP LVRWGRDRPR
KIRNLLIIAF ISTLVLSLLL PWLFESKVVA MTVLGLAMAC WIAVLAIAEA ALRISRGTKT
TFSYWGMVAA HLGLAVTIVG IAFSQNYSVE RDVRMKSGDS VDIHEYRFTF RDVKEVTGPN
WRGGVATIGV TRDGKPETVL YAEKRYYNTA GSMMTEAAID GGITRDLYAA LGEELENGAW
AVRLYYKPFV RWIWAGGLMM ALGGLLCLFD PRYRKRVSPQ KTAPEAV