Gene CPR_1881 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1881 
Symbol 
ID4205039 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2075279 
End bp2076661 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content27% 
IMG OID642566431 
Productcell envelope-related function transcriptional attenuator 
Protein accessionYP_699191 
Protein GI110803230 
COG category[K] Transcription 
COG ID[COG1316] Transcriptional regulator 
TIGRFAM ID[TIGR00350] cell envelope-related function transcriptional attenuator common domain 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.723477 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAAAA AGAAATATTT TAATTTTAGT AAAAAACAAA AAATAATTAT ATTTTCTTTC 
TTAACTATAA TTATTTTAAT CGCATCTATA GTAAGCTTTT TTGTTTTTAG ATTTTACACC
CACTCCTACG AAGGAAGTAA TGATCCTGCT AAAGTTAATA CTGTAGATGA AGATATTAAA
TTTAAAGAAG TTCCAGGAAT AACAAATATT CTTTTGCTTA GTAGCGATGC TAGACCTGGA
GAAAATGTCT CTAGATCAGA TTCCATAATG ATCTTAACAA TAGATAATAT CAATAAAAAA
TTAAAAGTAA CTTCTTTAAT GAGAGATATG TTAGTAAAAA TAGATGGTCA TGGAGAAGAA
AAATTAAATC ATGCTTTTGC TTATGGTGGT CCTACTAAAA CAATTGAAAC TATTCAAAAT
AATTTCGGAA TAAAATTAAA TAATTATGTA ATTGTAGATT TTAATGCTTT TGTTAAAGTT
ATAGATGCTA TAAACGGAAT TGAAGTAACC GTTAAAGATT ATGAACTTGA TGAACTTAAT
AAATATATTT TAGATGGTGG CGGTTCTGAA AAAGATTTAC TTCCTTCATC TGGAACTTAT
AATCTTAATG GTTATCAAGC CTTATCTTAT GCTAGAATTA GAAAAGTTGG TAATGGTGAA
TATGAAAGAA CTGAAAGACA GAGAGCTGTT CTTCAAATTG CTTTAGAAAA AATTAAAGAT
ATGTCAAAAG TTAAAATTGT TTCTCTGCTT AATGAATTAT TCCCATATGT TAAAACTAAT
ATTTCTTTAG GAAATGCTAT GGATTATGGA TTTACAGCTT TAAATGTAGG TAAAAAGTGT
AACTTTAAAA TAGAGCAATT CAGAGTTCCT CTAGATAGTA TTTCAAAAGG TGGAATCATT
AATAATAAAG GTTGGGTCTT TGTTATTGAT AAGGTAGAAA CATCAAAGGC TCTTAAAGAG
TTCATCTTTA ATGACAATAA AAATTATGAG CCTGATACTA GTAATTTTGA CTCTATAATA
GAACAATACT TTAATGACTA TGATATTAAA GATGATACTA CTCACCCAGA TTATGAATAT
GTACCTATTA TAAACACTAA TGGCAACTTT GAAAATTCTA AAAACAATTT AAATACTAGC
AGTAATAATA ATTCTAATGA TAAAGTAGAA AAATCATCTA ACCAAGGAAG CAATGAAAAT
TCTTCTAATA AACCAAGTAG TGGAAGCACT TCTACTAGTA ATAATTTTAC AGAAAAACCA
AACACTTCTC AAGAAACTTC AAAACCATCT GAAAATAATG GAGATAGTGG AAATATAGAT
TCCTCCAACA CTCCCTCTAT TGATGGAAAT GCTTCTTCTG AGAATAATAC GGAAGTAAAT
TAA
 
Protein sequence
MDKKKYFNFS KKQKIIIFSF LTIIILIASI VSFFVFRFYT HSYEGSNDPA KVNTVDEDIK 
FKEVPGITNI LLLSSDARPG ENVSRSDSIM ILTIDNINKK LKVTSLMRDM LVKIDGHGEE
KLNHAFAYGG PTKTIETIQN NFGIKLNNYV IVDFNAFVKV IDAINGIEVT VKDYELDELN
KYILDGGGSE KDLLPSSGTY NLNGYQALSY ARIRKVGNGE YERTERQRAV LQIALEKIKD
MSKVKIVSLL NELFPYVKTN ISLGNAMDYG FTALNVGKKC NFKIEQFRVP LDSISKGGII
NNKGWVFVID KVETSKALKE FIFNDNKNYE PDTSNFDSII EQYFNDYDIK DDTTHPDYEY
VPIINTNGNF ENSKNNLNTS SNNNSNDKVE KSSNQGSNEN SSNKPSSGST STSNNFTEKP
NTSQETSKPS ENNGDSGNID SSNTPSIDGN ASSENNTEVN