Gene CPF_1540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1540 
SymbolyhbH 
ID4202573 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1756478 
End bp1757656 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content31% 
IMG OID638082418 
Producthypothetical protein 
Protein accessionYP_695983 
Protein GI110799649 
COG category[S] Function unknown 
COG ID[COG2718] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02877] sporulation protein YhbH 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.212388 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCATAT TTAGAGATCA AGCAGAAAAC CATGTAGAAC ATGATAGATC TATAGAGGAT 
AGAAGACGTC ACAGGCAGTT AGTAGAAAAA TCTATTAAAG AAAATTTAGG AGATATACTT
TCAGAGGAGA GCATAATTGG AGAGACTAAA AATAAAAAAT ATAAAATTCC TATTAGAGGA
ATAAAGGAAT ATCAATTTAT TTATGGTGCA AATAATAAAG GGGTCACAAC AGGTACTGGA
GAAGAAAGAC GAGGGGATAG AATTTCTAGT GATAAGAGAA AAGCTATTTC TAATAATAAA
GCAGGAAATC AGGAGGGAAA GGATATATAT GAAACTGAGA TAACCTTAGA GGAACTTATG
GATTATATAG TTGAGGATCT TGATTTACCT AACTTAGATA GGAAGAAGTA CTCTGAAATA
ATAGTTGAAA GTGCAGCTAA AAAAAGAGGA TATCAAAAAT ATGGTGTAAG GCCAAGGCTT
GCAAAGAAAA AAACTGTTAT GTGTAAAATA GCTAGAAAAC AAGGAAAAAA AAGAGCATTG
CGTGAAATAG GAGAAGAAGC GGAAATAGGA AGATTTCCTT TTAGAGAAGA TGACTTAAGA
TATTATAAAG TGAAAAAACA TCCTAAAAAA GAAAGCAATG CTGTAATGAT TTTTATAATG
GACGTTTCAG GTTCTATGGA TAACACTAAA AAATATTTAG CTAGATCATT TTTCTTTGTT
TTATCTAGGT TTATAAGGAG AAAATATAAT AATGTAGCCT TTGAATTCAT ATCTCATACT
ACTACAGCTA AGAATGTTAA TGAATATGAG TTTTTCCACA AAGGGGAATC TGGAGGAACG
TATATATCTT CGGGAATAAA TGCTGCCATA GATTTAATAA AAGAAAAGTA TAACCCAGGC
GTTTGGAATA TATATCCTTT CTATGCTTCA GACGGCGATA ACTGGAGTGA GGATAATGAA
AAGGCTATGG AAGCTGTAAA TGAAATTTCA GATTTAAGTA ATATGTTTGG ATATATAGAG
CTTTTACCAT CCACTTATTC CACTACAATG TTCTACAGAT TTAAAAAAGA AATAAGTAAG
AAAAATTTTG TCTCTGTAAC TGTAAAGGAA AAGAAGGATC TGTGGAATGC TATAAAATAT
ATGCTATCTG AAGAACTACA GGAAAAGAAT AAGGAATGA
 
Protein sequence
MAIFRDQAEN HVEHDRSIED RRRHRQLVEK SIKENLGDIL SEESIIGETK NKKYKIPIRG 
IKEYQFIYGA NNKGVTTGTG EERRGDRISS DKRKAISNNK AGNQEGKDIY ETEITLEELM
DYIVEDLDLP NLDRKKYSEI IVESAAKKRG YQKYGVRPRL AKKKTVMCKI ARKQGKKRAL
REIGEEAEIG RFPFREDDLR YYKVKKHPKK ESNAVMIFIM DVSGSMDNTK KYLARSFFFV
LSRFIRRKYN NVAFEFISHT TTAKNVNEYE FFHKGESGGT YISSGINAAI DLIKEKYNPG
VWNIYPFYAS DGDNWSEDNE KAMEAVNEIS DLSNMFGYIE LLPSTYSTTM FYRFKKEISK
KNFVSVTVKE KKDLWNAIKY MLSEELQEKN KE