Gene CPR_1755 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1755 
SymboliscS 
ID4204924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1953558 
End bp1954754 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content32% 
IMG OID642566305 
Productcysteine desulfurase 
Protein accessionYP_699070 
Protein GI110802884 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID[TIGR03402] cysteine desulfurase NifS 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00778501 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAATA GAGTTGTTTA TATGGACTAC TCAGCAACTA CATATGTTAA GCCAGAAGTA 
TTAGAGGAAA TGTTACCATA TTTCACAAAT AAGTTTGGAA ATCCATCAGC ATTTTACGGA
GTTTCAAGAG AATCAAGAAT GGCTGTAGAT ACTGCTAGAG AAAGAGTAGC TAAAGTATTA
AATGCTGATA CAAATGAAAT CTACTTTACT GGTGGTGGAT CAGAAGCAGA TAACTGGGCA
ATAAAAGGAA TAGCTTTTGC TCATAAAAAT AAAGGAAATC ATATAATAAC TACAAAAATA
GAACACCATG CTGTATTACA TACTTGCCAA TGGTTAGAAA AACAAGGCTT TGAAGTAACT
TACTTAGATG TAAATGAAGA AGGTTTTGTT GATTTAGAAG AATTAAAAAA TGCTATTACT
GATAAAACTA TCTTAGTTTC TGTAATGTTT GCAAACAATG AAATAGGAAC TATAGAGCCA
GTTAAGGAAA TAGGAAAAAT TTGTAGAGAA AGAAAAGTAA TATTCCATAC AGATGCAGTT
CAAGCTGTAG GAAATGTAAA GATAGATGTT AAAGATATGA ACATCGATTT ACTTTCATTA
GCTGGACATA AAGTTTATGG ACCAAAAGGA ATCGGAGCTT TATATATAAG AAAAGGTATA
AGAATAGATA ACTTAATCCA CGGTGGTGGT CAAGAGAGAG CTAGAAGAGC TGGAACTGAG
AACATACCTG CAATAGTTGG ATTAGGAAAG GCTATGGAAA TAGCTGGAGA GAACTTAGAT
GAGCATATAG CTAAAATTTC TAAGTTAAGA GATAAGTTAA TAAAAGGATT ATTAGAAGTA
CCATTTACAA GATTAAACGG ACCAAAAGAT GGTAGCAAGA GATTACCAGG TAACGTAAAT
GTATGCTTTG AATTTATTGA AGGTGAAGGA ATTCTTCTTT CATTAGACTT TGAAGGAATT
TGTGGTTCAA GTGGAAGTGC TTGTACATCA GGATCATTAG ATCCATCACA CGTGTTATTA
GCAATAGGTT TACCTCATGA AATAGCACAT GGATCATTAA GATTAAGTTT AGGTGAAGGT
ACAACTGAAG AAGATGTTGA TTACGTATTA GAAAAAGTAC CACCAATAAT CGCAAGATTA
AGAAGTATGT CACCATTATG GAAAAATCAT TTAAGAGAAG TAGAAGGAGA GAATTAA
 
Protein sequence
MNNRVVYMDY SATTYVKPEV LEEMLPYFTN KFGNPSAFYG VSRESRMAVD TARERVAKVL 
NADTNEIYFT GGGSEADNWA IKGIAFAHKN KGNHIITTKI EHHAVLHTCQ WLEKQGFEVT
YLDVNEEGFV DLEELKNAIT DKTILVSVMF ANNEIGTIEP VKEIGKICRE RKVIFHTDAV
QAVGNVKIDV KDMNIDLLSL AGHKVYGPKG IGALYIRKGI RIDNLIHGGG QERARRAGTE
NIPAIVGLGK AMEIAGENLD EHIAKISKLR DKLIKGLLEV PFTRLNGPKD GSKRLPGNVN
VCFEFIEGEG ILLSLDFEGI CGSSGSACTS GSLDPSHVLL AIGLPHEIAH GSLRLSLGEG
TTEEDVDYVL EKVPPIIARL RSMSPLWKNH LREVEGEN