Gene CPR_1303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1303 
SymbolrpoN 
ID4205088 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1473975 
End bp1475360 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content28% 
IMG OID642565859 
ProductRNA polymerase factor sigma-54 
Protein accessionYP_698625 
Protein GI110803124 
COG category[K] Transcription 
COG ID[COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog 
TIGRFAM ID[TIGR02395] RNA polymerase sigma-54 factor 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.357078 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAATGG ATTTTAATTT GAATTTAACT CAAGAGCAAA AGCTTATTAT GACCCAGCAA 
ATGCAACTTT CAATAAAACT TTTACAAATG TCAACATATG ATTTAAGAGA ATATATTGAA
AAAGAATTTT CAGAAAATCC TGTATTAGAA GCTCAGTATG AGGGTACTAA AGAGGTGTCA
AAAGAGCAAG ATAGATTAGA ATATAAAGAG TTATTAAAAT ACTTAGAGTC AGATAATTAT
GGTTCTCAAA GTTATGGTGA ATATGATGAT GAAGAAATAT CACCTTTTAC TTTCATAAGT
AAGCCAGAAT CTTTAACAGA TTATTTAGAA GGGCAAATAT TAGAACTACC CATAGACGAA
TATATGAGAA GTGTATGTAG TTATATGGTT GAGTGTTTAG ATCAAAAGGG ATATTTAGAT
ATAAAAAAAG AAGAATTAAT TAATGAGCTA GATTGTTCTG AAGAGACTTT TAATAGGGCT
TTAATAGTTA TTCAAAACTT AGAACCTGCT GGTATAGGAG CAAGAGATTT AAAGGAATGC
TTAGAAATTC AGTTAGAAAG AAAAGGTGAA AATGACCCTA TAGTTAAAGA GATTATATAT
AATCATTTAG ATGATTTAGC AGATAATAAA TATCAAGTTA TTGCAAAGGA TTTAGGAATT
ACTCCTAAAA AAGCACAAGA TTATGGAGAT TTGATAAAAA CTTTAGAACC AAAACCATCA
AGAGGCTTTT ACACTGGTGA CGAAGTAGGG TTTATAATTC CTGATGCAGA AATACGAAAG
ATAGATGGAG AATTCCTCAT ATTAATGAAT GATGGAGTTT TACCTATGCT TTCAGTTAAT
CCTTTATATA AAGCTATATT AAAAGATAGT ACTAATGATA AAGAGGCTAC AGAGTATGTA
AAGGAAAAAA TAGAAAAAGC TATGTTTTTA ATTAAAAGTA TAGAGCAAAG AAAAAGTACT
TTATACAAAG TTCTGCAAAA AATACTTGAA AAGCAAAAGG ATTATTTTGA AAAGGGAGAG
AAATATTTAA AGCCTATGAC TTTAAAAGAA ATAGCTGAGA AACTAGAAAT GCATGAATCA
ACTATTTCAA GAGCTATAAG AGATAAGTAT ATTTTAACTT CTATGGGAAC AATAAAAATA
AAGAATCTCT TTGTAAACTC AATAAGTAAT AAAGAAAAAA GTCATGGAGA AGAAGATGTT
ACAGTTATAA ATATAAAAAA AGCTTTAGAA GAAGTAATTA AGAAAGAGGA TAAAAGGAAG
CCCTTATCAG ATCAAGCCAT AAGCGAAATT TTAAAAGAAA AAGGAATGGT TATTTCAAGA
AGAACTGTGG CAAAATACAG AGAAGAGTTA GGCATAAAGT CATCTAGCAA GAGAAAAAGA
TTTTAA
 
Protein sequence
MLMDFNLNLT QEQKLIMTQQ MQLSIKLLQM STYDLREYIE KEFSENPVLE AQYEGTKEVS 
KEQDRLEYKE LLKYLESDNY GSQSYGEYDD EEISPFTFIS KPESLTDYLE GQILELPIDE
YMRSVCSYMV ECLDQKGYLD IKKEELINEL DCSEETFNRA LIVIQNLEPA GIGARDLKEC
LEIQLERKGE NDPIVKEIIY NHLDDLADNK YQVIAKDLGI TPKKAQDYGD LIKTLEPKPS
RGFYTGDEVG FIIPDAEIRK IDGEFLILMN DGVLPMLSVN PLYKAILKDS TNDKEATEYV
KEKIEKAMFL IKSIEQRKST LYKVLQKILE KQKDYFEKGE KYLKPMTLKE IAEKLEMHES
TISRAIRDKY ILTSMGTIKI KNLFVNSISN KEKSHGEEDV TVINIKKALE EVIKKEDKRK
PLSDQAISEI LKEKGMVISR RTVAKYREEL GIKSSSKRKR F