Gene EcHS_A2334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2334 
SymbolccmF 
ID5591291 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2333940 
End bp2335883 
Gene Length1944 bp 
Protein Length647 aa 
Translation table11 
GC content57% 
IMG OID640921460 
Productcytochrome c-type biogenesis protein CcmF 
Protein accessionYP_001458995 
Protein GI157161677 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1138] Cytochrome c biogenesis factor 
TIGRFAM ID[TIGR00353] c-type cytochrome biogenesis protein CcmF 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGCCGG AAATTGGTAA CGGGCTGCTG TGTCTGGCGC TAGGAATTGC GCTGCTGCTG 
TCCGTGTATC CGCTATGGGG CGTAGCGCGC GGAGATGCGC GCATGATGGC GTCTTCCCGC
TTGTTTGCCT GGCTGCTGTT TATGTCTGTG GCTGGCGCAT TTCTGGTACT GGTCAATGCC
TTCGTGGTCA ACGACTTCAC CGTCACCTAT GTTGCCAGCA ACTCCAATAC CCAGCTTCCG
GTGTGGTATC GCGTGGCGGC TACCTGGGGC GCGCATGAAG GCTCGCTCCT GCTGTGGGTG
CTGCTGATGA GCGGCTGGAC CTTTGCGGTA GCGATTTTTA GTCAGCGTAT TCCGCTGGAT
ATTGTGGCCC GCGTACTGGC GATAATGGGG ATGGTCAGCG TCGGCTTTTT GCTGTTCATT
CTCTTTACCT CTAACCCGTT CTCACGCACG TTGCCGAACT TCCCGATTGA AGGGCGCGAT
CTTAACCCGC TGTTACAGGA TCCGGGGCTG ATCTTCCATC CGCCTCTGCT CTATATGGGG
TACGTGGGGT TCTCGGTGGC GTTTGCTTTT GCCATTGCTT CTTTGCTGAG CGGGCGTCTG
GACAGCACTT ATGCGCGTTT TACTCGCCCG TGGACGCTGG CGGCGTGGAT TTTCCTGACG
CTCGGCATTG TGCTCGGTTC CGCATGGGCC TATTACGAAC TCGGCTGGGG CGGCTGGTGG
TTCTGGGATC CGGTAGAAAA CGCCTCGTTT ATGCCGTGGC TGGTGGGGAC TGCGCTGATG
CACTCACTGG CGGTCACTGA ACAACGCGCC AGTTTCAAAG CGTGGACATT ACTGCTGGCA
ATCAGTGCCT TCTCGTTGTG TCTGCTGGGG ACCTTCCTCG TGCGTTCCGG CGTGCTGGTA
TCGGTACACG CGTTTGCGTC TGATCCGGCG CGCGGTATGT TTATCCTCGC CTTTATGGTG
CTGGTGATTG GCGGTTCGCT GCTGCTGTTT GCCGCGCGTG GACACAAAGT TCGCTCACGC
GTAAACAATG CGCTGTGGTC GCGGGAATCT TTGCTGTTAG CGAACAATGT TTTGCTGGTC
GCTGCGATGC TGGTGGTGTT GCTGGGGACG CTGCTGCCGT TGGTGCATAA GCAACTGGGA
CTGGGCAGTA TTTCGATTGG CGAACCGTTC TTCAACACCA TTTTTACCTG GCTGATGGTG
CCGTTTGCGC TACTGCTTGG TGTCGGTCCT CTGGTGCGCT GGGGGCGGGA TCGCCCGCGT
AAGATCCGCA ATTTATTGAT TATCGCCTTC ATCTCTACGC TGGTGCTGTC GCTGCTGTTG
CCGTGGCTGT TCGAAAGCAA AGTTGTGGCG ATGACGGTGC TCGGCCTGGC AATGGCCTGC
TGGATTGCGG TGCTGGCAAT TGCGGAAGCT GCGCTACGTA TTTCACGCGG CACGAAAACC
ACCTTCAGTT ATTGGGGGAT GGTGGCGGCT CACCTTGGGC TGGCAGTGAC AATTGTTGGC
ATTGCCTTTA GCCAGAACTA TAGCGTTGAG CGTGATGTGC GCATGAAGTC CGGCGATAGC
GTCGATATTC ATGAATATCG CTTCACCTTC CGTGATGTCA AAGAGGTGAC TGGCCCGAAC
TGGCGTGGCG GTGTGGCGAC TATCGGCGTA ACGCGCGATG GCAAGCCGGA AACGGTGCTG
TATGCGGAAA AACGTTATTA CAACACTGCC GGGTCGATGA TGACCGAAGC GGCAATTGAC
GGCGGCATCA CGCGTGACCT GTACGCCGCG CTCGGTGAAG AGCTGGAAAA CGGCGCGTGG
GCCGTGCGTC TTTACTACAA ACCATTTGTT CGCTGGATTT GGGCGGGCGG GCTGATGATG
GCGTTGGGCG GACTGCTGTG TCTGTTTGAT CCTCGCTATC GTAAGCGCGT GAGTCCGCAA
AAAACTGCGC CGGAGGCCGT ATGA
 
Protein sequence
MMPEIGNGLL CLALGIALLL SVYPLWGVAR GDARMMASSR LFAWLLFMSV AGAFLVLVNA 
FVVNDFTVTY VASNSNTQLP VWYRVAATWG AHEGSLLLWV LLMSGWTFAV AIFSQRIPLD
IVARVLAIMG MVSVGFLLFI LFTSNPFSRT LPNFPIEGRD LNPLLQDPGL IFHPPLLYMG
YVGFSVAFAF AIASLLSGRL DSTYARFTRP WTLAAWIFLT LGIVLGSAWA YYELGWGGWW
FWDPVENASF MPWLVGTALM HSLAVTEQRA SFKAWTLLLA ISAFSLCLLG TFLVRSGVLV
SVHAFASDPA RGMFILAFMV LVIGGSLLLF AARGHKVRSR VNNALWSRES LLLANNVLLV
AAMLVVLLGT LLPLVHKQLG LGSISIGEPF FNTIFTWLMV PFALLLGVGP LVRWGRDRPR
KIRNLLIIAF ISTLVLSLLL PWLFESKVVA MTVLGLAMAC WIAVLAIAEA ALRISRGTKT
TFSYWGMVAA HLGLAVTIVG IAFSQNYSVE RDVRMKSGDS VDIHEYRFTF RDVKEVTGPN
WRGGVATIGV TRDGKPETVL YAEKRYYNTA GSMMTEAAID GGITRDLYAA LGEELENGAW
AVRLYYKPFV RWIWAGGLMM ALGGLLCLFD PRYRKRVSPQ KTAPEAV