Gene EcolC_1454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1454 
Symbol 
ID6067355 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1601645 
End bp1603588 
Gene Length1944 bp 
Protein Length647 aa 
Translation table11 
GC content57% 
IMG OID641600873 
Productcytochrome c-type biogenesis protein CcmF 
Protein accessionYP_001724444 
Protein GI170019490 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1138] Cytochrome c biogenesis factor 
TIGRFAM ID[TIGR00353] c-type cytochrome biogenesis protein CcmF 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.494014 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCCGG AAATTGGTAA CGGGCTGCTG TGTCTGGCGC TAGGAATTGC GCTGCTGCTG 
TCCGTGTATC CGCTATGGGG CGTAGCGCGC GGAGATGCGC GCATGATGGC GTCTTCCCGC
TTGTTTGCCT GGCTGCTGTT TATGTCTGTG GCTGGCGCAT TTCTGGTACT GGTCAATGCC
TTCGTGGTCA ACGACTTCAC CGTCACCTAT GTTGCCAGCA ACTCCAATAC CCAGCTTCCG
GTGTGGTATC GCGTGGCGGC TACCTGGGGC GCGCATGAAG GCTCGCTCCT GCTGTGGGTG
CTGCTGATGA GCGGCTGGAC CTTTGCGGTA GCGATTTTTA GTCAGCGTAT TCCGCTGGAT
ATTGTGGCCC GCGTACTGGC GATAATGGGG ATGGTCAGCG TCGGCTTTTT GCTGTTCATT
CTCTTTACCT CTAACCCGTT CTCACGCACG TTGCCGAACT TCCCGATTGA AGGGCGCGAT
CTTAACCCGC TGTTACAGGA TCCGGGGCTG ATCTTCCATC CGCCTCTGCT CTATATGGGG
TACGTGGGGT TCTCGGTGGC GTTTGCTTTT GCCATTGCTT CTTTGCTGAG CGGGCGTCTG
GACAGCACTT ATGCGCGTTT TACTCGCCCG TGGACGCTGG CGGCGTGGAT TTTCCTGACG
CTCGGCATTG TGCTCGGTTC CGCATGGGCC TATTACGAAC TCGGCTGGGG CGGCTGGTGG
TTCTGGGATC CGGTAGAAAA CGCCTCGTTT ATGCCGTGGC TGGTGGGGAC TGCGCTGATG
CACTCACTGG CGGTCACTGA ACAACGCGCC AGTTTCAAAG CGTGGACATT ACTGCTGGCA
ATCAGTGCCT TCTCGTTGTG TCTGCTGGGG ACCTTCCTCG TGCGTTCCGG CGTGCTGGTA
TCGGTACACG CGTTTGCGTC TGATCCGGCG CGCGGTATGT TTATCCTCGC CTTTATGGTG
CTGGTGATTG GCGGTTCGCT GCTGCTGTTT GCCGCGCGTG GACACAAAGT TCGCTCACGC
GTAAACAATG CGCTGTGGTC GCGGGAATCT TTGCTGTTAG CGAACAATGT TTTGCTGGTC
GCTGCGATGC TGGTGGTGTT GCTGGGGACG CTGCTGCCGT TGGTGCATAA GCAACTGGGA
CTGGGCAGTA TTTCGATTGG CGAACCGTTC TTCAACACCA TGTTTACCTG GCTGATGGTG
CCGTTTGCGC TACTGCTTGG TGTCGGTCCT CTGGTGCGCT GGGGGCGGGA TCGCCCGCGT
AAGATCCGCA ATTTATTGAT TATCGCCTTC ATCTCTACGC TGGTGCTGTC GCTGCTGTTG
CCGTGGCTGT TCGAAAGCAA AGTTGTGGCG ATGACGGTGC TCGGCCTGGC AATGGCCTGC
TCGATTGCGG TGCTGGCAAT TGCGGAAGCT GCGCTACGTA TTTCACGCGG CACGAAAACC
ACCTTCAGTT ATTGGGGGAT GGTGGCGGCT CACCTTGGGC TGGCAGTGAC AATTGTTGGC
ATTGCCTTTA GCCAGAACTA TAGCGTTGAG CGTGATGTGC GCATGAAGTC CGGCGATAGC
GTCGATATTC ATGAATATCG CTTCACCTTC CGTGATGTCA AAGAGGTGAC TGGCCCGAAC
TGGCGTGGCG GTGTGGCGAC TATCGGCGTA ACGCGCGATG GCAAGCCGGA AACGGTGCTG
TATGCGGAAA AACGTTATTA CAACACTGCC GGGTCGATGA TGACCGAAGC GGCAATTGAC
GGCGGCATCA CGCGTGACCT GTACGCCGCG CTCGGTGAAG AGCTGGAAAA CGGCGCGTGG
GCCGTGCGTC TTTACTACAA ACCATTTGTT CGCTGGATTT GGGCGGGCGG GCTGATGATG
GCGTTGGGCG GACTGCTGTG TCTGTTTGAT CCTCGCTATC GTAAGCGCGT GAGTCCGCAA
AAAACTGCGC CGGAGGCCGT ATGA
 
Protein sequence
MMPEIGNGLL CLALGIALLL SVYPLWGVAR GDARMMASSR LFAWLLFMSV AGAFLVLVNA 
FVVNDFTVTY VASNSNTQLP VWYRVAATWG AHEGSLLLWV LLMSGWTFAV AIFSQRIPLD
IVARVLAIMG MVSVGFLLFI LFTSNPFSRT LPNFPIEGRD LNPLLQDPGL IFHPPLLYMG
YVGFSVAFAF AIASLLSGRL DSTYARFTRP WTLAAWIFLT LGIVLGSAWA YYELGWGGWW
FWDPVENASF MPWLVGTALM HSLAVTEQRA SFKAWTLLLA ISAFSLCLLG TFLVRSGVLV
SVHAFASDPA RGMFILAFMV LVIGGSLLLF AARGHKVRSR VNNALWSRES LLLANNVLLV
AAMLVVLLGT LLPLVHKQLG LGSISIGEPF FNTMFTWLMV PFALLLGVGP LVRWGRDRPR
KIRNLLIIAF ISTLVLSLLL PWLFESKVVA MTVLGLAMAC SIAVLAIAEA ALRISRGTKT
TFSYWGMVAA HLGLAVTIVG IAFSQNYSVE RDVRMKSGDS VDIHEYRFTF RDVKEVTGPN
WRGGVATIGV TRDGKPETVL YAEKRYYNTA GSMMTEAAID GGITRDLYAA LGEELENGAW
AVRLYYKPFV RWIWAGGLMM ALGGLLCLFD PRYRKRVSPQ KTAPEAV