Gene CPR_0026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0026 
Symbol 
ID4205554 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp33505 
End bp34773 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content26% 
IMG OID642564569 
ProductpolyA polymerase family protein 
Protein accessionYP_697371 
Protein GI110802983 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0617] tRNA nucleotidyltransferase/poly(A) polymerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTAC CTAACAATGT ACAATATATC TTAGAGAAAT TTGACTCTAA TGGTTTTGAA 
GCCTTTATAG TTGGTGGTTG TGTAAGAGAT TCTTTATTAA ATAAAAAACC TCAAGATTAT
GATATTACAA CCAATGCATT CCCTGAAAAA ATAGAAGAGC TTTTTGATAA AACTATCCCT
ACTGGTATTA AACATGGAAC AGTAACAGTT TTAATCGACA AAACTCCTTA TGAAGTAACT
ACTTATAGGG TAGATGGGGA ATATTTAAAT AATAGAAAGC CTAAAGACGT AAAGTTCGTT
TCTAATATAG AAGAAGATTT ATCAAGAAGA GATTTTACTA TAAATGCAAT GGCATATAGC
CCATATTTAG GATTTAAGGA TTGTTTTAAT GGAAAAGAGG ATCTAAAAAA CAAATTAATA
AGATGCGTTG GAGATCCTGA TAAACGCTTC TCTGAAGATG CCTTAAGAAT GCTTAGAGCA
ATTAGATTTA GTTGTCAATT AAACTTTAAA ATAGAAAAAT TAACTGCTGA ATCTATAAGA
AAGAATTTTA AATTAATAAA AAATATAAGC ATGGAAAGAA TTCAAAGTGA ATTTACTAAA
ATCATTCTAA GCAATGAGCC TGATAGAGGT CTTATGCTTC TTAGAAAGCT AGGATTTTCT
GACTTTTTAG TTAAGGAATT TAAGAATTTA AAACTAATAA ATTGCTATGA TTTATATGAT
GATATTCATG ATACTTATGG ATTAATAAAT TCACTTCCTA AAAAGCTTCA TGTAAGATTA
GCAGGATTAT TCTATAAGGT TTTTAATTCT GAAAATGCAG TTGAGAAGTG CAGAACTATA
TTAAAGAAAC TTAAATATGA TAATAATACA ATCAACGATA CTTGCAACTT AGTAGAAAAT
ATAAATAGTA TTTCATGTAA TATGACAAGA AAAAAACTAA AACTACTTAT AAATTCAGTT
GGAACCGAAA ATATCTTTGA TTTATTAGAT TTACAAAAAT CATATTTATC TTATATGGAT
GAATATGATA CTGAGTGTAT AGATATATTA AAAAATAGAG TTTCTGATAT ATTAGCTTCA
AAAGAACCCA TATTTATTAA GGACTTAGCC ATAACAGGAA ATGACTTAAT TACCGAACTT
AATTTTAAAC CTGGAAAAAA TATAGGTGTT ATATTAAATT TTCTTCTTGA AAATGTAATG
CAAACACCAG AGTTAAACAA TAAGGAAGAC TTACTAAACC TTAGTAAGCA ATTTTATTCA
TATAATTAA
 
Protein sequence
MKLPNNVQYI LEKFDSNGFE AFIVGGCVRD SLLNKKPQDY DITTNAFPEK IEELFDKTIP 
TGIKHGTVTV LIDKTPYEVT TYRVDGEYLN NRKPKDVKFV SNIEEDLSRR DFTINAMAYS
PYLGFKDCFN GKEDLKNKLI RCVGDPDKRF SEDALRMLRA IRFSCQLNFK IEKLTAESIR
KNFKLIKNIS MERIQSEFTK IILSNEPDRG LMLLRKLGFS DFLVKEFKNL KLINCYDLYD
DIHDTYGLIN SLPKKLHVRL AGLFYKVFNS ENAVEKCRTI LKKLKYDNNT INDTCNLVEN
INSISCNMTR KKLKLLINSV GTENIFDLLD LQKSYLSYMD EYDTECIDIL KNRVSDILAS
KEPIFIKDLA ITGNDLITEL NFKPGKNIGV ILNFLLENVM QTPELNNKED LLNLSKQFYS
YN