Gene CPR_1521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1521 
Symbol 
ID4206462 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1706686 
End bp1707684 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content28% 
IMG OID642566074 
Productpolysaccharide deacetylase family protein 
Protein accessionYP_698839 
Protein GI110801790 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0726] Predicted xylanase/chitin deacetylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0654867 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTTTG TGAGTAATCA ATTAAGAAGT GGAAAAAATA ATAAGTATTT AAAAAGAAGA 
ATAGCTGTAG CTATAGCTGG TATTGTAGTA ATTGCAGGAG TTGGTTTTGG AATTAAATCA
ATAGTTAGCA ATAAAGCTAA TAAAGAGGCT TTAGCATCAG CTAAAAAAGA AGAAGCTAAA
AAAGAAGAGG AAAAACCAAA ACCAACAGAC ATAATGCCAA ATGGGAATAT AATTTATGCT
GCAGATTCAT ATGCAGTTAG TGCAGATGAA GTTGAAAAAA TGCTTGAAGG AAAAGCTTCA
AATAATGATA AAGAAATATT TTTAACTTTT GATGATGGTC CAAGTGAAAA TACAAGAGAA
ATATTAAAGA TTTTAAAGGA AGAAGATGTA CATGCTACAT TTTTTGATAT AGGATCTGCC
TTAAAGGACA ATAAAGAAAA TCAAGAGTTA TTAAAACAAG AAATTGATCA AGGAAATGCA
GTAGCTGGTC ATAGTTTTTC ACATAATTAT AAAACATTAT ATCCAGGAAA TTCAGTTGAT
GTAAATAAGT TTATGAACGA ATTAAATGAA ACTAATGAAA TAATGAAAAG TGTATTAGGA
AAAAACTTTA ATGCTAGAGT AATTAGAATG CCAGGTGGAT ATATGTCTAG AAGATATTAT
AGAGATCCTA ACTTAAAAGC TTTAGATGAG GCTTTTGCAA AGGATAATAT AGTTAGTATA
GATTGGGATG CAGAAACTGG TGATGCAACT GGAAGACATT ATACAGTAGA GCAATATGTT
CAAAACTCAG CTAAAAATAT TAATACTTTA AATCATGTAA TTTTATTAAT GCATGATGCA
GCAGCTAAAA AAGAAACTGT ACAAGCATTA CCTGCAATAA TTAAATTCTA TAAAGAACAT
GGGTATGCAT TTAAAGTAAT AAAAAATTCA CCTGTAGGTG AAAAAAATTC TTCAGATTCA
ACTAAAGGTT CACAAAATAC TGATAATAAA ACAAAGTAA
 
Protein sequence
MNFVSNQLRS GKNNKYLKRR IAVAIAGIVV IAGVGFGIKS IVSNKANKEA LASAKKEEAK 
KEEEKPKPTD IMPNGNIIYA ADSYAVSADE VEKMLEGKAS NNDKEIFLTF DDGPSENTRE
ILKILKEEDV HATFFDIGSA LKDNKENQEL LKQEIDQGNA VAGHSFSHNY KTLYPGNSVD
VNKFMNELNE TNEIMKSVLG KNFNARVIRM PGGYMSRRYY RDPNLKALDE AFAKDNIVSI
DWDAETGDAT GRHYTVEQYV QNSAKNINTL NHVILLMHDA AAKKETVQAL PAIIKFYKEH
GYAFKVIKNS PVGEKNSSDS TKGSQNTDNK TK