Gene CPR_0877 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0877 
Symbol 
ID4204180 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1012783 
End bp1013931 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content32% 
IMG OID642565436 
Productsialidase 
Protein accessionYP_698202 
Protein GI110802436 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4409] Neuraminidase (sialidase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0420038 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTAACA AAAACAACAC CTTTGAAAAA AATCTAGATA TAAGCCATAA ACCAGAACCA 
CTAATACTAT TTAACAAGGA TAGTAACATA TGGAATTCAA AGTATTTTAG AATTCCCAAT
ATACAATTAT TAAATGATGG CACAATTTTA ACCTTTTCAG ATATTCGTTA TAATGGCCCT
GATGACCATG CTTATATAGA CATAGCTTCT GCACGCAGTA CTGATTTTGG AAAGACATGG
AGCTATGATG TAGCAATGAA AAATAATCGT ATTGACTCTA CTTATTCTCG TGTAATGGAC
TCCACAACAG TTATTACAAA TACAGGTAGA ATAATATTAA TTGCAGGCTC ATGGAATACA
AATGGAAACT GGGCAATGAC TACTTCTGCA AGAAGAAGTG ATTGGTCTGT TCAAATTATT
TATTCTGATG ATAATGGATT AACTTGGTCT AACAAAATAG ATTTAACCAA GGACTCATCA
AAAGTAAAAA ATCAACCAAG TAATACAATT GGATGGCTAG GAGGAGTTGG CTCAGGTATT
GTAATGGATG ATGGAACAAT AGTTATGCCA GCACAAATTT CCTTAAGAGA AAATAATGAA
AATAACTATT ATTCATTAAT TATCTATTCA AAGGATAATG GTGAAACATG GACAATGGGA
AACAAGGTTC CTAATTCAAA TACTTCTGAA AATATGGTCA TAGAATTAGA TGGAGCTTTA
ATTATGAGTA CAAGATATGA TTACTCTGGT TATAGGGCAG CATACATCTC TCACGATTTA
GGAAGCACCT GGGAAATATA TGAACCTTTA AACGGTAAAG TTTTAACTGG TAAGGGCTCT
GGATGCCAAG GTTCATTTAT TAAAGCTACT ACTTCAAATG GACATAGAAT AGGATTAATT
TCAGCACCTA AAAACACTAA AGGTGAATAT ATAAGAGACA ATATTGCTGT TTATATGATT
GACTTTGATG ATTTATCTAA AGGTGTTCAG GAAATATGTA TTCCTTACCC TAAAGATGGT
AACAAATTAG GCGGTGGCTA TTCTTGTCTA TCCTTTAAAA ATGACCATTT AGCCATTGTT
TATGAAGCCA ACGGAAATAT AGAATATCAA GACTTAACAC CTTATTACTT ACTAATTGAT
AAAGAATAA
 
Protein sequence
MRNKNNTFEK NLDISHKPEP LILFNKDSNI WNSKYFRIPN IQLLNDGTIL TFSDIRYNGP 
DDHAYIDIAS ARSTDFGKTW SYDVAMKNNR IDSTYSRVMD STTVITNTGR IILIAGSWNT
NGNWAMTTSA RRSDWSVQII YSDDNGLTWS NKIDLTKDSS KVKNQPSNTI GWLGGVGSGI
VMDDGTIVMP AQISLRENNE NNYYSLIIYS KDNGETWTMG NKVPNSNTSE NMVIELDGAL
IMSTRYDYSG YRAAYISHDL GSTWEIYEPL NGKVLTGKGS GCQGSFIKAT TSNGHRIGLI
SAPKNTKGEY IRDNIAVYMI DFDDLSKGVQ EICIPYPKDG NKLGGGYSCL SFKNDHLAIV
YEANGNIEYQ DLTPYYLLID KE