Gene CPF_2983 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2983 
Symbol 
ID4201704 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp3243918 
End bp3245078 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content30% 
IMG OID638083850 
Productcysteine desulfurase family protein 
Protein accessionYP_697337 
Protein GI110800660 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01977] cysteine desulfurase family protein 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAA TATATTTTGA TAATGCAGCA ACTACTTTCC CTAAACCTGA CTCTGTAATA 
AAAGCTATGT TTGATTATAT GAGTTTTGAA GGCGGAAGTG CTAATAGAGG ATCCTCATCT
ACAGCTCTAC AAAGTAGTAG AGCTGTCTAT GAATGTAGAT ATGAAATAGC TAAATTCTTT
AATTTTCCTA AAAGTGAAAA TGTTATTTTC ACAAATAATA TTACAACATC ATTAAATATG
TTACTTTTAG GAATAATTAA ATCTGATTGG CATATAATTA CTACATCTAT GGAACATAAT
TCTGTCTTAA GACCTTTAGT AAAAATTAGC GAGGAGCTTC CTAATGTAGA ACTAGATATA
GTTCAATGTA ATAATGAAGG TTTAGTGTCA GTTGAAAAGA TAAAAGAAAA AATAAAAAAT
AACACAAAGC TTATAATTTT ATCTCATGCA TCAAACCTAG TTGGAACAAT TCAACCAATT
AAAGAAATAG GGAAGCTTTG TAAAGAAAAT GATATCTTTT TTATTTTAGA TTCTGCTCAA
ACAGCAGGGG TTATTCCAAT TGATATGACT GAACTTAATT TAAATGCATT AGCCTTTACA
GGTCATAAGT CTCTTTTAGG ACCTCAAGGA ATAGGTGGTT TTATTATAGA TGATAAATTA
AATTCTATAT GTAAAAATAT CTTTTCTGGC GGAACAGGAA GTAATTCCTC ACTAATAGAA
CATCCTCAAG AATTGCCTGA TAAATTTGAA TATGGAACTT TAAACACTCC AGGAATAATA
GGGCTTCTAG AGGGAATAAA ATTCATAGAA AAAGAAGGCA TTGAAAATAT AAAAGCAAAA
GAAGAAGTAT TATGCCAAAA AGCTATGGAT TTATTATGTG AAATTCCAGA AGTTAAGATT
TATGGTTCTA TGGATGCCAA AAAGAAAACT TCAACAATAT CTTTCAATAT AGAAGGTATA
GATCCTGAAT TTGCGGGATT CTTGTTAGAT AGTGAATTTA ACATAACATG TAGGACAGGA
ATTCATTGTA CTCCACTTGC TCATAAGACA GTTGGTTCAT ATCCAGCTGG AAGCATAAGA
ATAAGCTTAG GGTACTTTAA TACAATAGAA GAAGTCTATA GATTTGTTGA GGTTATAAAA
GAATTAATTT CAAGGAGGTA G
 
Protein sequence
MNKIYFDNAA TTFPKPDSVI KAMFDYMSFE GGSANRGSSS TALQSSRAVY ECRYEIAKFF 
NFPKSENVIF TNNITTSLNM LLLGIIKSDW HIITTSMEHN SVLRPLVKIS EELPNVELDI
VQCNNEGLVS VEKIKEKIKN NTKLIILSHA SNLVGTIQPI KEIGKLCKEN DIFFILDSAQ
TAGVIPIDMT ELNLNALAFT GHKSLLGPQG IGGFIIDDKL NSICKNIFSG GTGSNSSLIE
HPQELPDKFE YGTLNTPGII GLLEGIKFIE KEGIENIKAK EEVLCQKAMD LLCEIPEVKI
YGSMDAKKKT STISFNIEGI DPEFAGFLLD SEFNITCRTG IHCTPLAHKT VGSYPAGSIR
ISLGYFNTIE EVYRFVEVIK ELISRR