Gene CPR_2266 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2266 
Symbol 
ID4204934 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2490206 
End bp2491354 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content27% 
IMG OID642566818 
Producthypothetical protein 
Protein accessionYP_699542 
Protein GI110802894 
COG category 
COG ID 
TIGRFAM ID[TIGR02532] prepilin-type N-terminal cleavage/methylation domain 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.580808 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTATCA AAAAAAGAAA AAGTAAAAAA GGGTTTACAG TAATAGAAGC TATAATTAGT 
TTAGTTTTGT TTACTTTAAT AATGATACCC CTAGGTAGCT TTACTTTAAC AGCTGTTAAA
ACATCAGCAA AAAGTGCCAC AAAAGAGCAG GCTATAAATG CTGGACAAGG TGTCATAGAA
CAATTAAAAA CAATTAAATT AAGCCAGTTT AAGAAAATAG ACTCAGATGC AAATTATAGT
GAGAGTAATA AAGCTACATT AAAGCTTGGA AACTTAAATG TTATAAAGAC AAAAGATGAT
CCAACTTATA AAATTGAAGG TGAATATGTA GACCCTAATA ATAAGAAGAA ATTTAATATA
GATGGAGATA TAACACAGGG GAAGCTTGAG GATAATGGAG AAAGTGGTAA GGAACATACT
TTGGATAAAA AAGAGCCGAA AAAAAATTAT GTGGTATATA TAGGAGAAAG CCTTATAACA
GTTTATAAAA TTGTTTCAAA AGAAAATATT TTAAAAAAAT ACATAGATAA CAATACAATT
AAAATAATTG ATGATAAAGA TAATGACTTG GAAGAAGCTA AAATAGTAAA TGGAATTTTT
AAAGGAAGTA GTATAAAGCT TAGACGCTGG TTACTTATAA AGAATAAAGA ACTAAAAAAT
AATGATAATG AAATTACAAC TCTTTTTATA GATGAGAATA GTAATGGTGA GATAAATATA
AAAGCAGAAA ATGAAGATAA GGGAGAAGAG ATATCAACCC CTGATAGTCC AAGTGAAGAA
AGAAAAAAGC TTGTTAAAAT ATTAAATGAA TATACTGATG ATTTTGAATT AGAGGATGGA
GAGAAAAAAA CTGATATAAT GGTGTATTTT TATAGAAGTT CTGCAAATAA AAAAATAGAT
TTAAAAGTGT CAAATAAGTG TTCAGGAAGT CGATTGAACA TTTATACATG TAAGAATGAA
GGAAGTGGAA TAAGTTATAA TATTTATACT TCAACTGAAA ATTCTCATGG CAATATAAGT
GTATTTAAAA ATTATATAGA GGGAAATAGC GAAGATTTAA GAGGGCAATT ATTTAATATT
AATTTAAAGA TTAAGGAAAA AGATGAGGTT CTTTATAACC TTAATACAAC TGAGTTTATA
GGAGGGTGA
 
Protein sequence
MGIKKRKSKK GFTVIEAIIS LVLFTLIMIP LGSFTLTAVK TSAKSATKEQ AINAGQGVIE 
QLKTIKLSQF KKIDSDANYS ESNKATLKLG NLNVIKTKDD PTYKIEGEYV DPNNKKKFNI
DGDITQGKLE DNGESGKEHT LDKKEPKKNY VVYIGESLIT VYKIVSKENI LKKYIDNNTI
KIIDDKDNDL EEAKIVNGIF KGSSIKLRRW LLIKNKELKN NDNEITTLFI DENSNGEINI
KAENEDKGEE ISTPDSPSEE RKKLVKILNE YTDDFELEDG EKKTDIMVYF YRSSANKKID
LKVSNKCSGS RLNIYTCKNE GSGISYNIYT STENSHGNIS VFKNYIEGNS EDLRGQLFNI
NLKIKEKDEV LYNLNTTEFI GG