Gene CPR_1245 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1245 
Symbol 
ID4206580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1397214 
End bp1400123 
Gene Length2910 bp 
Protein Length969 aa 
Translation table11 
GC content30% 
IMG OID642565801 
Productmannosyl-glycoprotein endo-beta-N-acetylglucosamidase domain-containing protein 
Protein accessionYP_698567 
Protein GI110801810 
COG category[G] Carbohydrate transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG3103] SH3 domain protein
[COG4193] Beta- N-acetylglucosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0232944 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAGAA ATAGATTAAG CTGTTTTATA GTAGGTGCTG TAATTGGAGC TGGAGCAATA 
GTTTGTACAA CTAATACTAA AGTACATGCG AAACCAGTGA ATGAGGTTAA AAACATTAAT
ACTTCTAAAG GAAATTCGTT TGGAGAAATT ATTTCATCAG AAGATTTAGG ATTAAGAAAA
GGTGCAGATT CTTCTCATGA AATAATAACT TCAATACCTA GTGGCGCGAG AGTTAACATA
ATAGACAAGG TATCTGATAA CTGGTATAAA GTTGGTTATA AAGATTTTGT AGGGTATGTA
GAAGCTAAGG ATATAAGAAT ACTAGGATAT AATTTAAATC AAGATAATGT TGCTTTAATT
TCTGCTAATC AATTAAATGT TAGAACTAGC CCAAATGAAA ATGGGCAAGT GATCGGAACT
TTATATAAAA ATAATAAGGT AAATGTATTA GATAAATCAA TTGATGGTTG GTATAAAATT
GACTTTAATG GTAGAAGAGC ATATGTATCT AGTAAATATG TTAATTTGAT TTCATATAAA
AATAATGAAG TTAAGACAGA AGTGAAAAAA GACCCAATTG AAGGAACAGG TAAGGTTAAT
ATAAATACAG CTTTAAATGT TAGACAAGCT TCTACTACAA GTAGTAGAAT TATTGGTAGC
TTAAAAGGTG GGGAAAAAGT TAACATAATA AATGAAAGTA ATGGATTTTA TAAAATAGAG
TTTAATAATT CATATGGCTA TGTTTATTCT AAATACATAT CAAAAGATGG AGGCGATGAA
AAGGCTCAAA TTGTAAAACA AGAAGAAGTT AAAAAAGAGA AAGTTGATGA ATCTAAAAAA
GAAGCTAAAT CTACTACTAA AGCAGAACCT ATAGTTTTTG CTATTAGATA TCTTAACAAG
ACAGGAATAG TAAATGTAAG TAGCTCCCTA AACGTAAGAG AAGGAGCAAG TACAAGTAGT
AAAGTTATAG GAAGCTTAAG CGGAAACTCA AAAGTAACAA TAGTAGGAGA AGAAGGAGCT
TTCTATAAAA TAGAATACAA GGGTTCACGT GGATATGTAG CTAAGGAATA TGTTAAAGAT
GTTACAGAAA ATAGTAATAG TAACCAGGGT ACACAAACTC CAGAAAAACC AAGTATTCCT
GAAAATACTG AAAAAATAGG GATAGTAAAT GTAAGCAGTT CTCTAAACGT AAGAGAAAGA
GCAAGTATAA GTAGTAAAGT TATAGGAAGC TTAAGCGGAA ACTCAAAAGT AACAATAGTA
GGAGAAGAAG GAGCTTTCTA TAAAATAGAA TACAAGGGTT CACGTGGATA TGTAGCTAAG
GAATATGTTA AAGATGTTAC AGAAAATAGT AATAGTAACC AGGGTACACA AACTCCAGAA
AAACCAAGTA TTCCTGAAAA TACTGAAAAA ATAGGGATAG TAAATGTAAG CAGTTCTCTA
AACGTAAGAG AAAGAGCAAG TATAAGTAGT AAAGTTATAG GAAGCTTAAG CAGAAACACA
AAGGTAACAA TAGTAGGAGA AGAAGGAGCT TTCTATAAAA TAGAATATAA AGGTTCTCAT
GGATATGTAG CTAAGGAATA TGTTAAAGAT GTTACAGAAA ATAGTAATAG TAACCAGGGT
ACACAAACTC CAGAAAAACC AAGTATTCCT GAAAATACTG AAAAAACAGG AATAGTAAAT
GTAAGCAGTT CTCTAAACGT AAGAGAAAGA GCAAGTACAA GTAGTAAAGT TATAGGAAGC
CTAAGCGGAA ACACAAAGGT AACAATAGTA GGAGAAGAAG GAGCATTCTA TAAAATAGAA
TATAGGGGTT CACATGGATA TGTAGCTAAG GAATATGTTA AAGATGTTAC AGAAAGTAAT
AATAGTAACC AAGGTACACA AACTCCAGAA AAACCAAGTA TTCCTGAAAA TAGTAAGAAA
ACTGGCGTTG TAACTGCATC TAAAGGATTA AATGTAAGAA AAGAAGCTAA TACTTCATCT
AAAATTATTG GAATTTTAAA TAGTGGAGAA AGTGTTGAAA TAATAGGAGA AGAAAATGGT
TTCTATAAGA TAACTTATAA AGGACAAGAA GCTTATGCAT CTAAAAATTA TATAAATATT
TTTAATAGTA ACTCAAATGT TAATCCAGGA TTAGATATAG GAAATGCTTC AAAAACAAAT
TATGGAGTAT CACTTAACGA ATATATAAAA TTACAACAAA GAAATAACCC TTCAAATTAT
TCGCATTCAG AATTAGAAAA ATATATAAAT CCAGCTAAAG CTACTAATAA GCTACAGTTT
TTAAGAATAG ATAAATTTAG ATCAGTTAAT GTAAGTGGAT TAAGTAGTAG ATTAAGTAAC
AAAGGTGTTT TAACAGGACA AGGACAAGCT TTTATAAATG CTGCTAAAGC CTTTAACATA
GATCCTATTT ACTTAGTTTC TCAGTGCTTG CATGAAACTG GTAATGGAAC AAGCAAACTT
GCAAAGGGCG TAACAATTAC TGAAATTGCA GATGAAAGTA GACCTATATA TAATGGTACT
GGTCAATTAG TAGGATATCA TATGATTAAA TTATCTAAGC CAGTAACAGT TTATAATTTA
TTTGGAATAG GGGCTAAGGA TAATTCATCA GTTTTCCCAA ATAGAGCTTT AATATTAGGA
ACAACATATG CTTATAATAG AGGTTGGACA AGTATTGAAA ATGCTATAAA GGGTGCTGCA
GAATTTGTTT CATTAAATTA TGTTCATAGT TCAAGATATA GTCAAAATAC TCTTTATAAG
ATGAGATATA ATCAAAATGT ATCAAATATA TGGCATCAAT ATGCTACAAC ACCATGGTAT
GCATCAAGTA TTGCTGATAT TATGAGGAGT TATCAAGATT TATATTTAGA AAATAATTTC
ACATTTGATG TACCTGTTTT TGAAGGATAA
 
Protein sequence
MNRNRLSCFI VGAVIGAGAI VCTTNTKVHA KPVNEVKNIN TSKGNSFGEI ISSEDLGLRK 
GADSSHEIIT SIPSGARVNI IDKVSDNWYK VGYKDFVGYV EAKDIRILGY NLNQDNVALI
SANQLNVRTS PNENGQVIGT LYKNNKVNVL DKSIDGWYKI DFNGRRAYVS SKYVNLISYK
NNEVKTEVKK DPIEGTGKVN INTALNVRQA STTSSRIIGS LKGGEKVNII NESNGFYKIE
FNNSYGYVYS KYISKDGGDE KAQIVKQEEV KKEKVDESKK EAKSTTKAEP IVFAIRYLNK
TGIVNVSSSL NVREGASTSS KVIGSLSGNS KVTIVGEEGA FYKIEYKGSR GYVAKEYVKD
VTENSNSNQG TQTPEKPSIP ENTEKIGIVN VSSSLNVRER ASISSKVIGS LSGNSKVTIV
GEEGAFYKIE YKGSRGYVAK EYVKDVTENS NSNQGTQTPE KPSIPENTEK IGIVNVSSSL
NVRERASISS KVIGSLSRNT KVTIVGEEGA FYKIEYKGSH GYVAKEYVKD VTENSNSNQG
TQTPEKPSIP ENTEKTGIVN VSSSLNVRER ASTSSKVIGS LSGNTKVTIV GEEGAFYKIE
YRGSHGYVAK EYVKDVTESN NSNQGTQTPE KPSIPENSKK TGVVTASKGL NVRKEANTSS
KIIGILNSGE SVEIIGEENG FYKITYKGQE AYASKNYINI FNSNSNVNPG LDIGNASKTN
YGVSLNEYIK LQQRNNPSNY SHSELEKYIN PAKATNKLQF LRIDKFRSVN VSGLSSRLSN
KGVLTGQGQA FINAAKAFNI DPIYLVSQCL HETGNGTSKL AKGVTITEIA DESRPIYNGT
GQLVGYHMIK LSKPVTVYNL FGIGAKDNSS VFPNRALILG TTYAYNRGWT SIENAIKGAA
EFVSLNYVHS SRYSQNTLYK MRYNQNVSNI WHQYATTPWY ASSIADIMRS YQDLYLENNF
TFDVPVFEG