Gene CPR_0271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0271 
Symbol 
ID4206599 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp329558 
End bp330580 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content29% 
IMG OID642564829 
Productcell envelope-related function transcriptional attenuator 
Protein accessionYP_697601 
Protein GI110801829 
COG category[K] Transcription 
COG ID[COG1316] Transcriptional regulator 
TIGRFAM ID[TIGR00350] cell envelope-related function transcriptional attenuator common domain 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.968761 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCAAG GCAAACATGC AAAAAAGAAT GATTCTAATA GTCAAGGTAA TACACCTAAG 
AAAAAAAGTA AAGTGAAAAT AATAGTATTA ACATTATTCT TCTTATTACT TATAGGTATA
GGACTTGGAG CTACATATGT TTACTCAACA TTAAATAAAA TGGATATAAA GAAAATAGCT
CAAGATGATA AATCTTTAGG TATTGATGAA TCAAATAAAG ACCTATTCCA AGATGGTATT
TTAAATATAG CTCTTTTTGG AGTAGATAGC AGAGATCATA ATAATGTTGG ACGCTCTGAT
TCAATAATAA TAGCTACAAT AGATACTAAA CATGATAAAA TAAAACTTAC ATCTCTTATG
AGAGATAGTT ATGTTGAAGT TGATGGACAT GGTAAAACTA AATTAACTCA TGCTTATGCT
TATGGTGGTC CTACTTTAGC ATTAAAAACA ATAAATGAAA ACTTTGGATT AGATATAAAA
GACTATGTAA CTGTTAACTT TGATAACTTA GCTGAAATAA TAGATGATTT AGGTGGAGTA
CCAATAAATA TAAAACCTTA TGAAGTTAAG GAAGTTAATA ATTACGCTAA AAATGTTGCA
GAAATTGCTG GAAGAGAATA TACGCCAGTT AGTGAAGGTG AGCAAGTATT AAATGGTGCC
CAAGCTGTAG GTTACTCTAG AATTCGTTAT GTTGGTGATG GAGATTATGA AAGAACTGAA
AGACAAAGAA ATGTTCTTGA TGCAATCATA AAGAAACTTT CAACATTAAA ACCTTCTGAA
TATCCTGAAA CAATAAAAAA ATTTTTACCT TATGTTGAAA CAAACTTAAC TCCATCTAAA
ATACTTAGTA TTGCTAAATC AGTTGCTTCA ACTGGTATCC CACCTGTTGA AAATATGCGT
TTCCCTCTAA ATGGATATTG CAAAGGGGAA ATGATTGATG GTGTTTGGTA TTTAACATTT
GATGAAGCTA AAACAAAGGA ACAAATACAA AACTATATAT ATAAGGATGT TAATCCAAAA
TAA
 
Protein sequence
MSQGKHAKKN DSNSQGNTPK KKSKVKIIVL TLFFLLLIGI GLGATYVYST LNKMDIKKIA 
QDDKSLGIDE SNKDLFQDGI LNIALFGVDS RDHNNVGRSD SIIIATIDTK HDKIKLTSLM
RDSYVEVDGH GKTKLTHAYA YGGPTLALKT INENFGLDIK DYVTVNFDNL AEIIDDLGGV
PINIKPYEVK EVNNYAKNVA EIAGREYTPV SEGEQVLNGA QAVGYSRIRY VGDGDYERTE
RQRNVLDAII KKLSTLKPSE YPETIKKFLP YVETNLTPSK ILSIAKSVAS TGIPPVENMR
FPLNGYCKGE MIDGVWYLTF DEAKTKEQIQ NYIYKDVNPK