Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPF_1117 |
Symbol | |
ID | 4202661 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens ATCC 13124 |
Kingdom | Bacteria |
Replicon accession | NC_008261 |
Strand | + |
Start bp | 1274949 |
End bp | 1277291 |
Gene Length | 2343 bp |
Protein Length | 780 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 638081998 |
Product | glycosy hydrolase family protein |
Protein accession | YP_695563 |
Protein GI | 110799795 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1554] Trehalose and maltose hydrolases (possible phosphorylases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.158926 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGATT ATAGAGAGCC TATTTATAAA ATAGAGCCTT GGAATATAAC AGAAGAAGAA TTTCTTTTAA AAAATAATTA TAGAAATGAA ACAACATTTT CATTGGCTAA TGGATATATA GGTACTAGAG GCACCTTTGA AGAGGAATAT GATTTTGATG TTGAAACAGG ATTAGAAGGA AACTTTGTAA ATGGCTTTTA TGAAAGTGAG CATATAAGAT ATGGGGAATG GAATTTTGGT TTCCCTACAG AAAGTCAATC ACTTTTAAAT CTTCCAAATG CTAAAATAAT AAAGCTATTT ATAGAAGATG AAGAATTCAG TATGCTTACA GGTGAAATAG AAGATTATAA GAGAGTTTTG CATATGAAAG AGGGAAGGAT AACTCGAGAT CTTATATGGG TATCACCTAA GGGTAAAAAA ATAAAAATTA GCATAAGTAG ATTTGTTAGT TTTAATAATA AAAATTTAAT GGAAATACGT TATAAAGTTA CTCCTTTAAA CTTTAGTGGA AATTTAAAAT TTATCTCTGC TATTGATGCA AATGTAGAAA ATCATACAAG AAAAACTAAT CCTTTAGTTG ATTATGGTCC TTTTGGTAAA AGGCTTGCAA ATGATTATAT AGATTCCATA AAAGATGAAC TTTATTATGA AGGAACAACA TTAAATAGTG AACTTTCAAT AGCTTGTGGA GCAGTAAATA AAATATCAGC AGAAAACTTT ATAAGAAAGA ATTTTAAAAA TTATGAGCTT TTTGGGGTAT CTTATGAATT TTATGCTAAG GAAAATAAAG AAATTATATT AGATAAGTTT ATTGCATATA GCACATCTTT AGATATGAAT TGTAAAAAGT TACATGGCTT TATAAAAACT ATTTTAAGTG ATGCCAAGAA AAAAGGATAT ATAGAAGCTG AGAGAGAGCA AAAGGAATAT GTTGAAGAAT TTTGGAGAAC AGCTGATGTA ATTATTGAGG GAGATGATGC ACTTCAACAG GGAATAAGAT TTAATCTTTT TCACCTTATG CAATCAGCTG GGAGAGATGG TAAAACCGGA ATGGGAGCTA AAGGGTTAAG TGGAGAAGGA TATGAGGGAC ATTATTTTTG GGATACAGAA ATGTATGTAC TTCCTGTATT TGTATATACT AAGCCAGATT TAGCTAAGAA ATTACTAGAT TATAGGTATT TTACTTTAGA TAAAGCTAGA GAGAGAGCAA GGGTATTAGG ACATGATAAA GGAGCCTTAT ATCCATGGAG AACAATTAAT GGAGAGGAAG CATCAACATA TTTTCCTTTA GGAACGGCTC AATATCATAT AAATGCAGAC ATAGCATATG CCTTTAAGCT TTACGTAGAT GTTAATGATG ATTTTCACTA TTTAAAGGAT AAAGCAGCTG AGGTTTTATG TGAAACTGCA AGGGTTTGGG CTGATGTAGG ATCATTTTCT GAGTATGTAG GAAATAAATA TTGTATTTGT GCTGTAACTG GGCCAGATGA GTACAATGCT ATAGTTGATA ATAATTTTTA CACTAATCTT ATGGCCCGTG AAAACCTTAG AGATGCAATA TGGGCATTAA ATAAGATTAA AGAAAAAGAT CAATTAGCTT ATGATAATTT GGTTAAAAAG ATTGATTTAA AAGATGAAGA AATAGAATAT TGGAAAAAAA TAATTGAAAA TATGTATTTC CCGTATGATG AGAAAAGAGG GGTGTATCCT TTAGATGATG GCTTTATGAA GAGGAAGCAT TGGGATGATT CAAAAATACC TAAAGAAAAG AGACATCTTC TTTATGAAAA TTATCATCCT CTATTTATAT TTAGACAAAG GATGTCAAAG CAAGCTGATG CAATTTTAGC AATGTACCTT CATAGTAATC TTTTTAGTAT AGAGGAATTA AGAAAAAACT ATGATTTCTA TCAAGAAGTA ACATTGCATC ATTCCTCTTT ATCAACTTGT ATATTTGGTA TTTTAGCTAG TCAAATTGGA TATGATGAGG AGGCTTACAA GTATTTTTCA CAGTCAGCAA GAATGGACTT AGATGATTAT CACAATAACT TTTATGCTGG AATTCATGCT GCAAATATGG CAGGAACCTG GCAAGGCATT GTAAATGGTT TTGCAGGACT TAGAACTAAT AAAGGAATTT TAGAATTAAA TCCTACAATA CCTAAGGAGT GGAATGCATA TAGTTTTAAA ATTTTCTATA AGAAGAATCT ATTAGAAATA AAAATTTCTA AAGATGAAAT TGAAATAAGA CTATTAGAGG GCGAGAATTT AGAATTATAT GTATATGGAG AAAAAGTTTA TCTTAAAAAT TTAAGTGAAA TAATAAAAAT ACCAGCTAAA TAA
|
Protein sequence | MEDYREPIYK IEPWNITEEE FLLKNNYRNE TTFSLANGYI GTRGTFEEEY DFDVETGLEG NFVNGFYESE HIRYGEWNFG FPTESQSLLN LPNAKIIKLF IEDEEFSMLT GEIEDYKRVL HMKEGRITRD LIWVSPKGKK IKISISRFVS FNNKNLMEIR YKVTPLNFSG NLKFISAIDA NVENHTRKTN PLVDYGPFGK RLANDYIDSI KDELYYEGTT LNSELSIACG AVNKISAENF IRKNFKNYEL FGVSYEFYAK ENKEIILDKF IAYSTSLDMN CKKLHGFIKT ILSDAKKKGY IEAEREQKEY VEEFWRTADV IIEGDDALQQ GIRFNLFHLM QSAGRDGKTG MGAKGLSGEG YEGHYFWDTE MYVLPVFVYT KPDLAKKLLD YRYFTLDKAR ERARVLGHDK GALYPWRTIN GEEASTYFPL GTAQYHINAD IAYAFKLYVD VNDDFHYLKD KAAEVLCETA RVWADVGSFS EYVGNKYCIC AVTGPDEYNA IVDNNFYTNL MARENLRDAI WALNKIKEKD QLAYDNLVKK IDLKDEEIEY WKKIIENMYF PYDEKRGVYP LDDGFMKRKH WDDSKIPKEK RHLLYENYHP LFIFRQRMSK QADAILAMYL HSNLFSIEEL RKNYDFYQEV TLHHSSLSTC IFGILASQIG YDEEAYKYFS QSARMDLDDY HNNFYAGIHA ANMAGTWQGI VNGFAGLRTN KGILELNPTI PKEWNAYSFK IFYKKNLLEI KISKDEIEIR LLEGENLELY VYGEKVYLKN LSEIIKIPAK
|
| |