Gene CPR_1564 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1564 
Symbol 
ID4205481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1756065 
End bp1757774 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content27% 
IMG OID642566115 
Producthypothetical protein 
Protein accessionYP_698880 
Protein GI110802196 
COG category[S] Function unknown 
COG ID[COG2898] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.201819 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGGGATT CACTAAAAAA AAGTTATAGA CATTTAAAAA ATATTTTAGG ATTTGTTACT 
GATAAAAGAA ATTATGAAAA TATAAAGAAG CTATTAAAAA ATTACAAAAT CTTAAGTGAT
ATATCAAATA TAATAGTATC AGTTTTGGTA TTTCTAAGTG GTATTCTTTT AATAATTTCA
GGGATTTATC CTAGTATATT TTATAAGATA AAATTTTTAG ATAATATATA CAGTTTATCT
TTTTTAAGGT TTTCACATAG AGCTTCAATA TTAATTGGAT TAATGTTAAT AATGACCTCT
AAGGAAGTTT TCTTTAAGGT AAAAAGAGCT TATTATGTTA CATTAACATT GCTTATAGTA
GGAGGAGCCT TTGCCTTTAT AAAAGATTTA GATTACAAAG AAGGAATTTT TATTTTAGGA
GTAATAATAC TTCTAATATT ATCAAAAAAG AGTTTTTACA GAAAAAGTAT TCCTATTAAG
GTTACTAAAT TAAGTGGGAT ATTAATAGTT CTTTCAATTG TAATGATTAT CTTTGCGAGT
TTTATACATA AATTTAACAT ACATTTTAGC AAGAACTATA AATACTATAT AGACTTTTTC
CATAGCACAA AGGGGTATTT AAGAATAGCA TTATTCACAT ATATATCCTT TATAATATTT
GTGATAATAT GGTATTTAAC AATGCCTAAA ATAGAAGATG ACGAAAGGTA TATGGATGCT
GATTTAGAAA AGGTATCAAA ATTCTTTAAA GAAATAGATT ATGGAACAAT ATTCTCCCAT
TTAGTTTATT TAAAGGATAA AAAGGTCTTT TGGGCTAATG AAGGAGAGTC CTTAATAATG
TATAGCAAGT ACAAAGATAA GATAATAGTT TTAGGAGATC CTATAGCTAC TAAGGAAAAC
CTATATAGTT GTATAGAAGA GTTTCAAGCT TTTACAAATT TATATGGATA TGATGTTGTC
TTTTATGAAA TAGAAGAAAA AAACTTTTCT ACCTATCATG ATGCAGGGTA TTATTTCTTT
AAGTTAGGAG AAGAGGCAAG GATAGATTTA GAAGAATTTA ATTTGATTGG TTCTAAAAAG
AGTGCCTTTA GAAACACCTT AAGAAGAGTT GAAAGGGAAG GATATAATTT TAGCATTATA
GAGCCTCCTT TTAATAATGA GGTAGTAAGT CAATTGAAGG AAATATCTGA TAAATGGTTA
GGGGACAGAA AAGAAAAGGG ATTTTCTTTA GGATGGTTTA GTGAGGATTA TATACAAAGA
TCACCTATAG CTATTTTAAA GAATGAAGAA GAAAATAAGA TTATGGGCTT TGTAACAATA
ATGGATGCTA ATGATGGAGG GGAGACAGTA GCAATAGATT TAATGAGAAT AGATAAAGAT
GCTCCAAATG CCTCTATGGA TTACCTAATG CTTAATTTAT TCTTAACCTT TAAAGAAAAA
GGATATAAGT ATTTTAGCTT AGGAGAAGCA CCATTATCTA ATGTAGGATT TAACACTCAT
TCACATTTAC AAGAAAAGCT TGCAAGGTTA GTTTATAATA GTGGTAATAT ATTCTATAGT
TTTGATGGAC TAAGAAGATA TAAGTCAAAG TTTTCTCCAA TTTGGCAACC TAGATATTTA
GCATATCCTA AGTTTATGTC CTTACCAGAG GTGTTTATTA ACTTATGTTT ATTAATAGCT
AATTCAAAGG AAAGAGTAGA GAAAAAATAA
 
Protein sequence
MWDSLKKSYR HLKNILGFVT DKRNYENIKK LLKNYKILSD ISNIIVSVLV FLSGILLIIS 
GIYPSIFYKI KFLDNIYSLS FLRFSHRASI LIGLMLIMTS KEVFFKVKRA YYVTLTLLIV
GGAFAFIKDL DYKEGIFILG VIILLILSKK SFYRKSIPIK VTKLSGILIV LSIVMIIFAS
FIHKFNIHFS KNYKYYIDFF HSTKGYLRIA LFTYISFIIF VIIWYLTMPK IEDDERYMDA
DLEKVSKFFK EIDYGTIFSH LVYLKDKKVF WANEGESLIM YSKYKDKIIV LGDPIATKEN
LYSCIEEFQA FTNLYGYDVV FYEIEEKNFS TYHDAGYYFF KLGEEARIDL EEFNLIGSKK
SAFRNTLRRV EREGYNFSII EPPFNNEVVS QLKEISDKWL GDRKEKGFSL GWFSEDYIQR
SPIAILKNEE ENKIMGFVTI MDANDGGETV AIDLMRIDKD APNASMDYLM LNLFLTFKEK
GYKYFSLGEA PLSNVGFNTH SHLQEKLARL VYNSGNIFYS FDGLRRYKSK FSPIWQPRYL
AYPKFMSLPE VFINLCLLIA NSKERVEKK