Gene CPR_2661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2661 
Symbol 
ID4206386 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2884243 
End bp2885403 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content30% 
IMG OID642567209 
Productcysteine desulfurase family protein 
Protein accessionYP_699896 
Protein GI110803551 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01977] cysteine desulfurase family protein 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.146987 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAA TATATTTTGA TAATGCAGCA ACTACTTTCC CTAAACCTGA CTCTGTAATA 
AAAGCTATGT TTGATTATAT GAGTTTTGAA GGCGGAAGTG CTAATAGAGG ATCCTCATCT
ACAGCTCTAC AAAGTAGTAG AGCTGTCTAT GAATGTAGAT ATGAAATAGC TAAATTCTTT
AATTTTCCTA AAAGTGAAAA TGTTATTTTC ACAAATAATA TTACAACATC ATTAAATATG
TTACTTTTAG GAATAATTAA ATCTGATTGG CATATAATTA CTACATCTAT GGAACATAAT
TCTGTCTTAA GACCTTTAGT AAAAATTAGC GAGGAGCTTC CTAATGTAGA ACTAGATATA
GTTCAATGTA ATAATGAAGG TTTAGTGTCA GTTGAAAAGA TAAAAGAAAA AATAAAAAAT
AACACAAAGC TTATAATTTT ATCTCATGCA TCAAACCTAG TTGGAACAAT TCAACCAATT
AAAGAAATAG GGAAGCTTTG TAAAGAAAAT GATATCTTTT TTATTTTAGA TTCTGCTCAA
ACAGCAGGGG TTATTCCAAT TGATATGACT GAACTTAATT TAAATGCATT AGCCTTTACA
GGTCATAAGT CTCTTTTAGG ACCTCAAGGA ATAGGTGGTT TTATTATAGA TGATAAATTA
AATTCTATAT GTAAAAATAT CTTTTCTGGC GGAACAGGAA GTAATTCATC ACTAATAGAA
CATCCTCAAG AATTGCCTGA TAAATTTGAA TATGGAACTT TAAACACTCC AGGAATAATA
GGGCTTCTAG AGGGAATAAA ATTCATAGAA AAAGAAGGCA TTGAAAATAT AAAAGCAAAA
GAAGAAGTAT TATGCCAAAA AGCTATGGAT TTATTATGTG AAATTCCAGA AGTTAAGATT
TATGGTCCTA TGGATGCCAA AAAGAAAACT TCAACAATAT CTTTCAATAT AGAAGGTATG
GATCCTGAAT TTACAGGATT CTTGTTAGAT AGTGAATTTA ACATAACATG TAGGACAGGA
ATTCATTGTA CTCCACTTGC TCATAAGACA GTTGGTTCAT ATCCAGCTGG AAGCATAAGA
ATAAGCTTAG GGTACTTTAA TACAATAGAA GAAGTCTATA GATTTGTTGA GGTTATAAAA
GAATTAATTT CAAGGAGGTA G
 
Protein sequence
MNKIYFDNAA TTFPKPDSVI KAMFDYMSFE GGSANRGSSS TALQSSRAVY ECRYEIAKFF 
NFPKSENVIF TNNITTSLNM LLLGIIKSDW HIITTSMEHN SVLRPLVKIS EELPNVELDI
VQCNNEGLVS VEKIKEKIKN NTKLIILSHA SNLVGTIQPI KEIGKLCKEN DIFFILDSAQ
TAGVIPIDMT ELNLNALAFT GHKSLLGPQG IGGFIIDDKL NSICKNIFSG GTGSNSSLIE
HPQELPDKFE YGTLNTPGII GLLEGIKFIE KEGIENIKAK EEVLCQKAMD LLCEIPEVKI
YGPMDAKKKT STISFNIEGM DPEFTGFLLD SEFNITCRTG IHCTPLAHKT VGSYPAGSIR
ISLGYFNTIE EVYRFVEVIK ELISRR