Gene CPR_2201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2201 
Symbol 
ID4205951 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2430446 
End bp2431429 
Gene Length984 bp 
Protein Length327 aa 
Translation table11 
GC content27% 
IMG OID642566751 
Productalpha/beta fold family hydrolase 
Protein accessionYP_699501 
Protein GI110803972 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAGA ATATAAAAAT AAAGCCTATA AACGTAATTA GGAATATTTT TATATTTATA 
TTAGCTCTAG TATTTATAGG CTTTGCTTAT CAAATGATAT TAAATAAAAT TGATAGTAAA
AAAATAGAAC CTGATACCAA GTATGTAAGA ATTGATAGCA AGAAAAATTA TTATAATTTT
CAAGGAGAAA GCAAACCAAC TATAATAATG AGTTCTGATA TAGGCTTAGG GTTAAGTGAA
TGGAGTAAGG TTCAAGAGCT TATAGAAAAG GAATATGGTT ATAGAACTTT TTCTTATGAT
AGACCTGGAT ATGGTTTTTC AGAATCAGTA AAAGATGATG AAGTTAAAGA ACAAGCTCAG
CATCTTAGAA TGATTCTGAA AAAATCAGGG ATTGGTGGAC CATATATACT TGTTGGAGAA
GGATATGGTG GATTAGTAAT GTGTAACTTT GCAGAACTTT ATCCTGATTT AGTTCAAGGA
GTTATTCTTG TAGATCCAAT AAGTGAAGAA GCTTTAAGTG AAAATAAAGA TTATATGAAA
CAGTATTCAA GTCAAAAAAC TAGTAGATTT ATACAAAAGT GTGGTTCATA TTTTGGATTA
ACATCAATAA TGAATAAATT TGGTATGTTG AAAAATACAA ATGGCTTAAG AGAAAATTTA
AGTAATGAAA ATTTTAAGGT ATATAATATT TTAAGAACAA AAAGTGATTT TAATAGTGGA
TATTATAGTG AGCTTACAAA TATTTTAGAG CAAAATAGTA GTTCACAAAA ATCTGGTTTA
TTAAATGGTA AACCTTTGAG CATAATAGTT AATGATAATG CTTTTACTAA GGAGCAAGAG
AGTTTAAAGA AACTCACTTT AGACAATAAA GTTCAAATAA TAAATGCTAA GAATAAGACA
GATGTTATAC CTTTAGAAAA GCCAGAATTA TTTTTAGATA GCATAAGATT TATTCAAGAT
AATAGCTTGG AAGAGCAGAA TTAG
 
Protein sequence
MNKNIKIKPI NVIRNIFIFI LALVFIGFAY QMILNKIDSK KIEPDTKYVR IDSKKNYYNF 
QGESKPTIIM SSDIGLGLSE WSKVQELIEK EYGYRTFSYD RPGYGFSESV KDDEVKEQAQ
HLRMILKKSG IGGPYILVGE GYGGLVMCNF AELYPDLVQG VILVDPISEE ALSENKDYMK
QYSSQKTSRF IQKCGSYFGL TSIMNKFGML KNTNGLRENL SNENFKVYNI LRTKSDFNSG
YYSELTNILE QNSSSQKSGL LNGKPLSIIV NDNAFTKEQE SLKKLTLDNK VQIINAKNKT
DVIPLEKPEL FLDSIRFIQD NSLEEQN