Gene CPR_1516 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1516 
Symbol 
ID4206423 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1696926 
End bp1698038 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content30% 
IMG OID642566069 
Producthypothetical protein 
Protein accessionYP_698834 
Protein GI110803549 
COG category[S] Function unknown 
COG ID[COG4086] Predicted secreted protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.227418 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAAGA AAAAGAGCAT TATGAAAGTT CTTTGTGGGA CAATAGTTAG TACATTCATG 
TTTGGAATGG CTACTCCAGT ATTTGCACAA AGTGAAAAAA CTGAGGCTGT AGTAACTTTA
GGAGCTAATT TAACAAAGTC AGAAAGATTA CAAATGTTAG ATGCTTTTGG AGTAAAAGCT
AATGAGGTTA AAATAATAGA TGTAACTAAT CAAGATATAA GAGAACAATT AGGTTTAGAT
ACAAGTAAAC CAATACCTGC TAGCAGTCAA TCAATATCAA GTTCTTACGT TGTGGTTAAG
GACAAAGGTG GAATAAATGT AACTACTAAC AATTTAACAG AGGTAACAGG AAGTATGCTT
GCTAATGCCC TTCTTACTTC AGGGGTAAAT AATGCTGATG TAAAGGCTGA TGCTCCATTT
AAGGTTACGG GAACAGCTGC CTTAGCAGGT ATTTTAAAAG GATTTGAAGA TGCATCAGGA
GAGGAATTAT CTCTTCCAAA GAAAGAGGCG GCAAGAGAGG AAATTTCTTT AACTAATAAT
TTAAGTAATG CTAAAACTAA AGATGGACAA ACACTAGGAA AAGATGAAGC CGCTGTGGTA
GTAAATGATA TTAAAACTGA TGTAATTAAA GATAAACCTA AAAATGATGA GGAAATAGGT
AAGATAGTAA ACAATGTTAC AAATAACTAT AATATACTTT TAACACAAGG GCAACAAGAG
CAAACAATAA AATTTATGTC TAAAATAAAT GATTTAGACT ATAACTATGG TGCTATGAAA
GAATCTTTAA ATCAAATGAA TGACAAGCTT CAACAGATAT TAAAAGACAC AGGAAAACAA
TTAGAAGAAA GTGGTCTTTT AGAAAAAGCA TTAAATGGTA TAAAAAATGT TTTAGTTGAT
ATTAAGGATT TTTTAGTAAA TATGTTTAGC TCAGCTAGTG AAAAAGTTAA AGATGGAATA
ACCTACGATG AAAATGGTAA TATAGTTATA AAAACAGGAA ATAATTCTGA TGAATCAAAG
AATCAAGAAA ATATCCAAGA TAAGCCACAA ACTCAGTCAA ATGATAATAA TCAAAATCAA
CAAAATGAAC AAGGACAAAA TCAAACAAAA TAA
 
Protein sequence
MIKKKSIMKV LCGTIVSTFM FGMATPVFAQ SEKTEAVVTL GANLTKSERL QMLDAFGVKA 
NEVKIIDVTN QDIREQLGLD TSKPIPASSQ SISSSYVVVK DKGGINVTTN NLTEVTGSML
ANALLTSGVN NADVKADAPF KVTGTAALAG ILKGFEDASG EELSLPKKEA AREEISLTNN
LSNAKTKDGQ TLGKDEAAVV VNDIKTDVIK DKPKNDEEIG KIVNNVTNNY NILLTQGQQE
QTIKFMSKIN DLDYNYGAMK ESLNQMNDKL QQILKDTGKQ LEESGLLEKA LNGIKNVLVD
IKDFLVNMFS SASEKVKDGI TYDENGNIVI KTGNNSDESK NQENIQDKPQ TQSNDNNQNQ
QNEQGQNQTK