Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_2661 |
Symbol | |
ID | 4206386 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | + |
Start bp | 2884243 |
End bp | 2885403 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 642567209 |
Product | cysteine desulfurase family protein |
Protein accession | YP_699896 |
Protein GI | 110803551 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01977] cysteine desulfurase family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.146987 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAAAA TATATTTTGA TAATGCAGCA ACTACTTTCC CTAAACCTGA CTCTGTAATA AAAGCTATGT TTGATTATAT GAGTTTTGAA GGCGGAAGTG CTAATAGAGG ATCCTCATCT ACAGCTCTAC AAAGTAGTAG AGCTGTCTAT GAATGTAGAT ATGAAATAGC TAAATTCTTT AATTTTCCTA AAAGTGAAAA TGTTATTTTC ACAAATAATA TTACAACATC ATTAAATATG TTACTTTTAG GAATAATTAA ATCTGATTGG CATATAATTA CTACATCTAT GGAACATAAT TCTGTCTTAA GACCTTTAGT AAAAATTAGC GAGGAGCTTC CTAATGTAGA ACTAGATATA GTTCAATGTA ATAATGAAGG TTTAGTGTCA GTTGAAAAGA TAAAAGAAAA AATAAAAAAT AACACAAAGC TTATAATTTT ATCTCATGCA TCAAACCTAG TTGGAACAAT TCAACCAATT AAAGAAATAG GGAAGCTTTG TAAAGAAAAT GATATCTTTT TTATTTTAGA TTCTGCTCAA ACAGCAGGGG TTATTCCAAT TGATATGACT GAACTTAATT TAAATGCATT AGCCTTTACA GGTCATAAGT CTCTTTTAGG ACCTCAAGGA ATAGGTGGTT TTATTATAGA TGATAAATTA AATTCTATAT GTAAAAATAT CTTTTCTGGC GGAACAGGAA GTAATTCATC ACTAATAGAA CATCCTCAAG AATTGCCTGA TAAATTTGAA TATGGAACTT TAAACACTCC AGGAATAATA GGGCTTCTAG AGGGAATAAA ATTCATAGAA AAAGAAGGCA TTGAAAATAT AAAAGCAAAA GAAGAAGTAT TATGCCAAAA AGCTATGGAT TTATTATGTG AAATTCCAGA AGTTAAGATT TATGGTCCTA TGGATGCCAA AAAGAAAACT TCAACAATAT CTTTCAATAT AGAAGGTATG GATCCTGAAT TTACAGGATT CTTGTTAGAT AGTGAATTTA ACATAACATG TAGGACAGGA ATTCATTGTA CTCCACTTGC TCATAAGACA GTTGGTTCAT ATCCAGCTGG AAGCATAAGA ATAAGCTTAG GGTACTTTAA TACAATAGAA GAAGTCTATA GATTTGTTGA GGTTATAAAA GAATTAATTT CAAGGAGGTA G
|
Protein sequence | MNKIYFDNAA TTFPKPDSVI KAMFDYMSFE GGSANRGSSS TALQSSRAVY ECRYEIAKFF NFPKSENVIF TNNITTSLNM LLLGIIKSDW HIITTSMEHN SVLRPLVKIS EELPNVELDI VQCNNEGLVS VEKIKEKIKN NTKLIILSHA SNLVGTIQPI KEIGKLCKEN DIFFILDSAQ TAGVIPIDMT ELNLNALAFT GHKSLLGPQG IGGFIIDDKL NSICKNIFSG GTGSNSSLIE HPQELPDKFE YGTLNTPGII GLLEGIKFIE KEGIENIKAK EEVLCQKAMD LLCEIPEVKI YGPMDAKKKT STISFNIEGM DPEFTGFLLD SEFNITCRTG IHCTPLAHKT VGSYPAGSIR ISLGYFNTIE EVYRFVEVIK ELISRR
|
| |