Gene CPF_1802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1802 
Symbol 
ID4201961 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2033922 
End bp2034920 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content29% 
IMG OID638082672 
Productpolysaccharide deacetylase family protein 
Protein accessionYP_696236 
Protein GI110800487 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0726] Predicted xylanase/chitin deacetylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.12203 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTTTG TGAGTAACAA AGTAAGAAAT GGAAAACATA ATAAGTATTT AAAAAGAAGA 
ATAGCAGTTG CTGTAGCTGG TATTGTAGTA ATTGCAGGAG TAGGTTTTGG AATTAAATCA
CTAGTTAGCA ATAAAGCTAA TAAGGAGGCT TTAGCATCAG CTAAGGAAGA GGAAACTAAA
AAAGAAGAGG AAAAACCAAA ACCAACAGAC ATAATGCCAA ATGGTAATAT AATTTATGCT
GCAGATTCCT ATGCAGTTAG TGCAGATGAA GTTGAAAAAA TGCTTGAAGG AAAAGCTTCA
AATAATGATA AAGAAATATT TTTAACTTTT GATGATGGTC CAAGTGAAAA TACAAGGGAA
ATATTAAAGA TTTTAAAGGA AGAAGATGTA CATGCTACAT TTTTTGATAT AGGATCTGCC
TTAAAGGATA ATAAAGAAAA TCAAGAGTTA TTAAAACAAG AAATTGATCA AGGAAATGCA
GTAGCTGGTC ATAGTTTTTC ACATAATTAT AAAACATTGT ATCCAGGAAA TTCAGTTGAT
GTAAATAAAT TTATGAGCGA ATTAAATGAA ACTAATGAAA TAATGAAAAG CGTATTAGGA
AAAAACTTTA ATGCTAGAGT AATTAGAATG CCAGGTGGAT ATATGTCTAG AAGATATTAT
AGAGATCCTA ACTTAAAAGC TTTAGATGAG GCTTTTGCAA AGGATAATAT AGTTAGTATA
GATTGGGATG CAGAAACTGG TGATGCAACT GGAAGACATT ATACAGTAGA GCAATATGTT
GAAAACTCAG CTAAAAATAT TAATACTTTA AATCATGTAA TTTTATTAAT GCATGATGCA
GCAGCTAAGA AAGAAACTGT ACAAGCATTA CCTGCAATAA TTAAATTCTA TAAAGAACAT
GGGTATGCAT TTAAAGTAAT AAAAAATACA CCTGTAGGTG AAAATAATTC TTCAGATGCA
ACTAATAGTT CACAAAATAC TGACAATAAA ACAAAGTAA
 
Protein sequence
MNFVSNKVRN GKHNKYLKRR IAVAVAGIVV IAGVGFGIKS LVSNKANKEA LASAKEEETK 
KEEEKPKPTD IMPNGNIIYA ADSYAVSADE VEKMLEGKAS NNDKEIFLTF DDGPSENTRE
ILKILKEEDV HATFFDIGSA LKDNKENQEL LKQEIDQGNA VAGHSFSHNY KTLYPGNSVD
VNKFMSELNE TNEIMKSVLG KNFNARVIRM PGGYMSRRYY RDPNLKALDE AFAKDNIVSI
DWDAETGDAT GRHYTVEQYV ENSAKNINTL NHVILLMHDA AAKKETVQAL PAIIKFYKEH
GYAFKVIKNT PVGENNSSDA TNSSQNTDNK TK