Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_1245 |
Symbol | |
ID | 4206580 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | - |
Start bp | 1397214 |
End bp | 1400123 |
Gene Length | 2910 bp |
Protein Length | 969 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 642565801 |
Product | mannosyl-glycoprotein endo-beta-N-acetylglucosamidase domain-containing protein |
Protein accession | YP_698567 |
Protein GI | 110801810 |
COG category | [G] Carbohydrate transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG3103] SH3 domain protein [COG4193] Beta- N-acetylglucosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0232944 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAGAA ATAGATTAAG CTGTTTTATA GTAGGTGCTG TAATTGGAGC TGGAGCAATA GTTTGTACAA CTAATACTAA AGTACATGCG AAACCAGTGA ATGAGGTTAA AAACATTAAT ACTTCTAAAG GAAATTCGTT TGGAGAAATT ATTTCATCAG AAGATTTAGG ATTAAGAAAA GGTGCAGATT CTTCTCATGA AATAATAACT TCAATACCTA GTGGCGCGAG AGTTAACATA ATAGACAAGG TATCTGATAA CTGGTATAAA GTTGGTTATA AAGATTTTGT AGGGTATGTA GAAGCTAAGG ATATAAGAAT ACTAGGATAT AATTTAAATC AAGATAATGT TGCTTTAATT TCTGCTAATC AATTAAATGT TAGAACTAGC CCAAATGAAA ATGGGCAAGT GATCGGAACT TTATATAAAA ATAATAAGGT AAATGTATTA GATAAATCAA TTGATGGTTG GTATAAAATT GACTTTAATG GTAGAAGAGC ATATGTATCT AGTAAATATG TTAATTTGAT TTCATATAAA AATAATGAAG TTAAGACAGA AGTGAAAAAA GACCCAATTG AAGGAACAGG TAAGGTTAAT ATAAATACAG CTTTAAATGT TAGACAAGCT TCTACTACAA GTAGTAGAAT TATTGGTAGC TTAAAAGGTG GGGAAAAAGT TAACATAATA AATGAAAGTA ATGGATTTTA TAAAATAGAG TTTAATAATT CATATGGCTA TGTTTATTCT AAATACATAT CAAAAGATGG AGGCGATGAA AAGGCTCAAA TTGTAAAACA AGAAGAAGTT AAAAAAGAGA AAGTTGATGA ATCTAAAAAA GAAGCTAAAT CTACTACTAA AGCAGAACCT ATAGTTTTTG CTATTAGATA TCTTAACAAG ACAGGAATAG TAAATGTAAG TAGCTCCCTA AACGTAAGAG AAGGAGCAAG TACAAGTAGT AAAGTTATAG GAAGCTTAAG CGGAAACTCA AAAGTAACAA TAGTAGGAGA AGAAGGAGCT TTCTATAAAA TAGAATACAA GGGTTCACGT GGATATGTAG CTAAGGAATA TGTTAAAGAT GTTACAGAAA ATAGTAATAG TAACCAGGGT ACACAAACTC CAGAAAAACC AAGTATTCCT GAAAATACTG AAAAAATAGG GATAGTAAAT GTAAGCAGTT CTCTAAACGT AAGAGAAAGA GCAAGTATAA GTAGTAAAGT TATAGGAAGC TTAAGCGGAA ACTCAAAAGT AACAATAGTA GGAGAAGAAG GAGCTTTCTA TAAAATAGAA TACAAGGGTT CACGTGGATA TGTAGCTAAG GAATATGTTA AAGATGTTAC AGAAAATAGT AATAGTAACC AGGGTACACA AACTCCAGAA AAACCAAGTA TTCCTGAAAA TACTGAAAAA ATAGGGATAG TAAATGTAAG CAGTTCTCTA AACGTAAGAG AAAGAGCAAG TATAAGTAGT AAAGTTATAG GAAGCTTAAG CAGAAACACA AAGGTAACAA TAGTAGGAGA AGAAGGAGCT TTCTATAAAA TAGAATATAA AGGTTCTCAT GGATATGTAG CTAAGGAATA TGTTAAAGAT GTTACAGAAA ATAGTAATAG TAACCAGGGT ACACAAACTC CAGAAAAACC AAGTATTCCT GAAAATACTG AAAAAACAGG AATAGTAAAT GTAAGCAGTT CTCTAAACGT AAGAGAAAGA GCAAGTACAA GTAGTAAAGT TATAGGAAGC CTAAGCGGAA ACACAAAGGT AACAATAGTA GGAGAAGAAG GAGCATTCTA TAAAATAGAA TATAGGGGTT CACATGGATA TGTAGCTAAG GAATATGTTA AAGATGTTAC AGAAAGTAAT AATAGTAACC AAGGTACACA AACTCCAGAA AAACCAAGTA TTCCTGAAAA TAGTAAGAAA ACTGGCGTTG TAACTGCATC TAAAGGATTA AATGTAAGAA AAGAAGCTAA TACTTCATCT AAAATTATTG GAATTTTAAA TAGTGGAGAA AGTGTTGAAA TAATAGGAGA AGAAAATGGT TTCTATAAGA TAACTTATAA AGGACAAGAA GCTTATGCAT CTAAAAATTA TATAAATATT TTTAATAGTA ACTCAAATGT TAATCCAGGA TTAGATATAG GAAATGCTTC AAAAACAAAT TATGGAGTAT CACTTAACGA ATATATAAAA TTACAACAAA GAAATAACCC TTCAAATTAT TCGCATTCAG AATTAGAAAA ATATATAAAT CCAGCTAAAG CTACTAATAA GCTACAGTTT TTAAGAATAG ATAAATTTAG ATCAGTTAAT GTAAGTGGAT TAAGTAGTAG ATTAAGTAAC AAAGGTGTTT TAACAGGACA AGGACAAGCT TTTATAAATG CTGCTAAAGC CTTTAACATA GATCCTATTT ACTTAGTTTC TCAGTGCTTG CATGAAACTG GTAATGGAAC AAGCAAACTT GCAAAGGGCG TAACAATTAC TGAAATTGCA GATGAAAGTA GACCTATATA TAATGGTACT GGTCAATTAG TAGGATATCA TATGATTAAA TTATCTAAGC CAGTAACAGT TTATAATTTA TTTGGAATAG GGGCTAAGGA TAATTCATCA GTTTTCCCAA ATAGAGCTTT AATATTAGGA ACAACATATG CTTATAATAG AGGTTGGACA AGTATTGAAA ATGCTATAAA GGGTGCTGCA GAATTTGTTT CATTAAATTA TGTTCATAGT TCAAGATATA GTCAAAATAC TCTTTATAAG ATGAGATATA ATCAAAATGT ATCAAATATA TGGCATCAAT ATGCTACAAC ACCATGGTAT GCATCAAGTA TTGCTGATAT TATGAGGAGT TATCAAGATT TATATTTAGA AAATAATTTC ACATTTGATG TACCTGTTTT TGAAGGATAA
|
Protein sequence | MNRNRLSCFI VGAVIGAGAI VCTTNTKVHA KPVNEVKNIN TSKGNSFGEI ISSEDLGLRK GADSSHEIIT SIPSGARVNI IDKVSDNWYK VGYKDFVGYV EAKDIRILGY NLNQDNVALI SANQLNVRTS PNENGQVIGT LYKNNKVNVL DKSIDGWYKI DFNGRRAYVS SKYVNLISYK NNEVKTEVKK DPIEGTGKVN INTALNVRQA STTSSRIIGS LKGGEKVNII NESNGFYKIE FNNSYGYVYS KYISKDGGDE KAQIVKQEEV KKEKVDESKK EAKSTTKAEP IVFAIRYLNK TGIVNVSSSL NVREGASTSS KVIGSLSGNS KVTIVGEEGA FYKIEYKGSR GYVAKEYVKD VTENSNSNQG TQTPEKPSIP ENTEKIGIVN VSSSLNVRER ASISSKVIGS LSGNSKVTIV GEEGAFYKIE YKGSRGYVAK EYVKDVTENS NSNQGTQTPE KPSIPENTEK IGIVNVSSSL NVRERASISS KVIGSLSRNT KVTIVGEEGA FYKIEYKGSH GYVAKEYVKD VTENSNSNQG TQTPEKPSIP ENTEKTGIVN VSSSLNVRER ASTSSKVIGS LSGNTKVTIV GEEGAFYKIE YRGSHGYVAK EYVKDVTESN NSNQGTQTPE KPSIPENSKK TGVVTASKGL NVRKEANTSS KIIGILNSGE SVEIIGEENG FYKITYKGQE AYASKNYINI FNSNSNVNPG LDIGNASKTN YGVSLNEYIK LQQRNNPSNY SHSELEKYIN PAKATNKLQF LRIDKFRSVN VSGLSSRLSN KGVLTGQGQA FINAAKAFNI DPIYLVSQCL HETGNGTSKL AKGVTITEIA DESRPIYNGT GQLVGYHMIK LSKPVTVYNL FGIGAKDNSS VFPNRALILG TTYAYNRGWT SIENAIKGAA EFVSLNYVHS SRYSQNTLYK MRYNQNVSNI WHQYATTPWY ASSIADIMRS YQDLYLENNF TFDVPVFEG
|
| |