Gene CPR_1302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1302 
Symbol 
ID4205928 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1472713 
End bp1473747 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content31% 
IMG OID642565858 
Producttranscription regulator 
Protein accessionYP_698624 
Protein GI110803988 
COG category[K] Transcription 
COG ID[COG2390] Transcriptional regulator, contains sigma factor-related N-terminal domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.368614 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCTAGAGA TTTTAAAATT GCAAAAGATA ATAGTTCCTG AAATGGTAGA ACTATTAGTT 
AAAAGATACA ATATAATCAG AACAATTTAT TATAATCAGC CAATAGGTAG AAGAGCTTTG
GCAAACAATT TAAATTTAGG TGAAAGAATT GTAAGAACAG AAATAGGTTT TTTAAAATCT
CAAAACCTTA TACAAATAAA CACTCCAGGT ATGTCTGTAA CCCAAGAAGG AGAGTTCATG
TTAGAATCTT TAAAAGGATT TATACATGAA ATAAAAGGGC TATCTAATCT TGAGAACAAA
TTATGTGATA TGCTTAATAT AAGAAATGTA ATTGTTGTTC CAGGAGATTG CAGTGAGGAT
GAAAACACTA TAAAAGAGCT TGGAAAAGCC GCTGCAAATT ATTCTAAGAA TATCATTAAA
GATGGCTATA CTATTGCTGT TACTGGTGGA AGTACTGTAA AAGAAGTTAT AGACGAATTA
CCAGAAATGG CAAACCTAAA AAATGTGTTA GTGGTGCCTG CAAGAGGTGG TATGGGTAAG
AAGGTTGAGA CGCAGTCAAA TACATTAGCT GCAAATTTAG CTAAAAAGTT AAATGGAACC
TATAAAATGC TTCATGTTCC TGATAATATA AGCAAAGAAG TAATGGATGC TTTAATGAAG
CAGGATGATA TGAAGGAACT TATACAAAAT ATTAAAAATG CAGATATGCT AATATATGGT
ATAGGGCAGG CTAAAAAGAT GGCTAATAAA AGAGGAGTGC CTAAGGATCA AATAATAAAG
TTATTAGATT TAGGTGCTAT AGGAGAAGCT TTTGGATGCT ACTTTAATGA TAAGTCAGAA
GTTGTTTATG TAATGCCAAC TTTAGGGGTT ACAATAAATG ATCTTAGAAA AATAAAAAAT
CACATAGCTG TAGCTGGTGG TAAGAATAAA GCAGAAGCTA TATTAACTAC TGTTTATGGC
AATGAAAATG CCGTTCTTAT TACTGATGAA GGAGCAGCTA ATGAAATTTT AAATTTTTTA
AATAGAGATG AATAA
 
Protein sequence
MLEILKLQKI IVPEMVELLV KRYNIIRTIY YNQPIGRRAL ANNLNLGERI VRTEIGFLKS 
QNLIQINTPG MSVTQEGEFM LESLKGFIHE IKGLSNLENK LCDMLNIRNV IVVPGDCSED
ENTIKELGKA AANYSKNIIK DGYTIAVTGG STVKEVIDEL PEMANLKNVL VVPARGGMGK
KVETQSNTLA ANLAKKLNGT YKMLHVPDNI SKEVMDALMK QDDMKELIQN IKNADMLIYG
IGQAKKMANK RGVPKDQIIK LLDLGAIGEA FGCYFNDKSE VVYVMPTLGV TINDLRKIKN
HIAVAGGKNK AEAILTTVYG NENAVLITDE GAANEILNFL NRDE