Gene CPF_2039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2039 
SymboliscS 
ID4202521 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2279431 
End bp2280627 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content33% 
IMG OID638082906 
Productcysteine desulfurase 
Protein accessionYP_696470 
Protein GI110800678 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID[TIGR03402] cysteine desulfurase NifS 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAATA GAGTTGTTTA TATGGACTAC TCAGCAACTA CATATGTTAA GCCAGAAGTA 
TTAGAGGAAA TGTTACCATA TTTCACAAAT AAGTTTGGAA ATCCATCAGC ATTTTACGGA
GTTTCAAGAG AATCAAGAAT GGCTGTAGAC ACTGCTAGAG AAAGAGTAGC TAAAGTATTA
AATGCTGATA CAAATGAAAT CTACTTTACT GGTGGCGGAT CAGAAGCAGA TAACTGGGCA
ATAAAAGGAA TAGCTTTTGC TCATAAAAAT AAAGGAAATC ATATAATAAC TACAAAAATA
GAGCACCATG CTGTATTACA TACTTGCCAA TGGTTAGAAA AACAAGGCTT TGAAGTAACT
TACTTAGATG TAAATGAAGA AGGTTTTGTT GATTTAGAAG AATTAAAAAA TGCTATTACT
GATAAAACTA TCTTAGTTTC TGTAATGTTT GCAAACAATG AAATAGGAAC TATAGAGCCA
GTTAAGGAAA TAGGAAAAAT TTGTAGAGAA AGAAAAGTAA TATTCCATAC AGATGCAGTT
CAAGCTGTAG GAAATGTAAA GATAGATGTT AAAGATATGA ACATCGATTT ACTTTCATTA
GCTGGACATA AAGTTTATGG ACCAAAAGGA ATCGGAGCTT TATATATAAG AAAAGGTATA
AGAATAGATA ACTTAATCCA CGGTGGTGGT CAAGAGAGAG CTAGAAGAGC TGGAACTGAG
AACATACCTG CAATAGTTGG ATTAGGAAAG GCTATGGAAA TAGCTGGAGA AAACTTAGAT
GAGCATATAG CTAAAATTTC TAAGTTAAGA GATAAGTTAA TAAAAGGATT ATTAGAAGTA
CCATTTACAA GATTAAATGG ACCAAAAGAT GGTAGCAAGA GATTACCAGG TAACGTAAAT
GTATGCTTTG AATTCATTGA AGGTGAAGGA ATTCTTCTTT CATTAGACTT TGAAGGAATT
TGTGGTTCAA GTGGAAGTGC TTGTACATCA GGATCATTAG ATCCATCACA CGTGTTATTA
GCAATAGGTT TACCTCATGA AATAGCACAC GGATCATTAA GATTAAGTTT AGGTGAAGGT
ACAACTGAAG AAGACGTTGA TTACGTATTA GAAAAAGTAC CACCAATAAT CGCAAGATTA
AGAAGTATGT CACCATTATG GAAAAATCAT TTAAGAGAAG TAGAAGGAGA GAATTAA
 
Protein sequence
MNNRVVYMDY SATTYVKPEV LEEMLPYFTN KFGNPSAFYG VSRESRMAVD TARERVAKVL 
NADTNEIYFT GGGSEADNWA IKGIAFAHKN KGNHIITTKI EHHAVLHTCQ WLEKQGFEVT
YLDVNEEGFV DLEELKNAIT DKTILVSVMF ANNEIGTIEP VKEIGKICRE RKVIFHTDAV
QAVGNVKIDV KDMNIDLLSL AGHKVYGPKG IGALYIRKGI RIDNLIHGGG QERARRAGTE
NIPAIVGLGK AMEIAGENLD EHIAKISKLR DKLIKGLLEV PFTRLNGPKD GSKRLPGNVN
VCFEFIEGEG ILLSLDFEGI CGSSGSACTS GSLDPSHVLL AIGLPHEIAH GSLRLSLGEG
TTEEDVDYVL EKVPPIIARL RSMSPLWKNH LREVEGEN