Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPF_0766 |
Symbol | |
ID | 4202276 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens ATCC 13124 |
Kingdom | Bacteria |
Replicon accession | NC_008261 |
Strand | + |
Start bp | 908589 |
End bp | 911627 |
Gene Length | 3039 bp |
Protein Length | 1012 aa |
Translation table | 11 |
GC content | 28% |
IMG OID | 638081650 |
Product | beta-galactosidase |
Protein accession | YP_695217 |
Protein GI | 110799026 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.156765 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATATTAA AAAATGAATA CCATGAAGAT ATTTCTAAAC TGCATGTAAA TATGATGCCA AGAAGAAGTT ATTATGTTCC TTTTGTGGAT ACTGATGAAG CTTTAAATAT TAAAGATAGA AGTAAGCAAC TTAACTTCTT TTCACTTAAT GGTAAATGGG AATTTAATTA CTTTGATAGC TTACAAAAAG TTAAAGAGTT TGATGATATA AATCAAATTA GTTTTTCAGA TAATATTGAT GTTCCATCTC TTTGGCAATT AAAGGGATAT GATTATAATC AATACACTAA TGTAAAATAT CCAATTCCTT TTGATCCACC TTTTGTTCCT ATAAACAATC CATGTGGAAT TTATAAGAGA GATTTTGAAA TTGAAATACT TCCTGAAAAT TATGATTATA ATATAAACTT TGAAGGCGTT GATTCCTGCT TTTATTTTTG GATAAATGAT AACTTTGTTG GGTATAGCCA AATATCTCAT AGCATATCTG AGTTTGATAT TACAGAATTT TTAGTAAAGG GTAAAAATAC AATTACAGTC TTAGTTTTAA AATGGTGTGA TGGAACTTAT TTTGAAGATC AAGACAAGTT TAGAATGTCT GGAATTTTTA GAGATGTATA TATATTAAGA AGAGCAAAGG AAAGAATAGT TGACTATAAA ATAACTCAAA GTATAGATTT TTCAGCTAAA GAAGGAAAGT TAGACCTTGA AATTTTATCT AATATAGGAA ATCCAAAAGG GAAATATTAT TTGTTAAATC CTAATAATCA CATGATAGCA TCTGGAAATA TAGATAATAA TAAAATCCAA ATCAATATTA AAAATGTAGA ACTTTGGAGT GCTGAAATAC CTAATTTATA TACCTTATTA ATTGAAACAG AGCATGAGGT TATAAAAGAA AGAATAGGTA TGAGAGAAAT TAAAATAGAG AATTCTATTT TAAAAATTAA TAATAAAAAA ATAAAGTTAA GAGGAGTTAA TCATCATGAT AGTAATCCAA CTAAGGGATA TGTTATGACA TATGATGATA TGATTCTAGA TTTAAAAATA ATGAAAGAGT GTAATGTTAA CTCTATAAGA ACAGCTCATT ATCCTAAATC TCCGATTTTT TATGAGTTAT GTGATGAATA TGGTTTTTAT GTTATGAGTG AGGCAGATAT TGAAATTCAT GGGGTTGTTG AACTATATGG ATTAGGATAT TTAGATAATT ATAATATGAT AGCTGATGAT AAAGTTTATG AAAAGGTTAT TATTGATAGA GTTGATTCAT CTATAGTTCC TTTTAAGAAT AAATCTTGTA TTTTCATGTG GTCTCTTGGA AATGAGTCAG GATTTGGATG TAACTTTGAA AGAGGATTAG AATATGCTAG GGCATTAGAT CCTACACGTC CACTTCATTA TGAAGGTGCT TATTATGCTA GCAAAGAAAG AGAAAATGAC TTTACAAATA TTGATGTTAT TAGTCGTATG TATATAAGTA TAGAGGAAAT CAAGGATTAC TTTGAAAAAG GAATAGACAA GCCATTAATA TTATGTGAGT ATGCACATGC TATGGGAAAT GGACCAGGTG GATTACAAGA CTATGATGAA ATGATACAAA AGTATGACCA GTTTGCAGGG GCATATGTTT GGGAGTGGTG TGACCATGCA ATACTTATTA ATGAAAATAT TAATGATAAA AAAGCATATG GATATGGCGG CGATTTTGAA GAGGAAAATC ATGATGGAAA CTTCTGTGTA GATGGTTTAG TTTACCCAGA TAGAACTCCT CATACTGGAT TATTAGAATA TAAAAATATC AATAGACCTA TAAGAGCCAT AGAATTTGAT GAAGTTAAGA AAAGAGTTAA GCTTAAAAAT ATGTTTGATT TTAGGAATGC TGGTGAATTC TTAGATGTTA CCTATAAAGT ATTTTTAGAT GGAGAAATAA TTTTCGGAGA TGATATAGAT TTAGAAAGCT TAAAAGCAAA AGAAGAAAAA TGGTATGATT TAAGCATATC AGAATTACCA AAGGAAATTA TAACTATATT ATTTCAGTAT AAAGTGAAAA ATCATAATCA TCTGTATGAA AAAGGTGAAG TTCTAGGCTT TGACAATTTT ATCATAAAAA ATGGGGTAGA TAATATCTCA TCTGTTGATA AAATCATAAA AGGTAATATT AATGAACAAA AATTTTATGT AGAAGAAACA GTTAATAAGA TTAAAGTTAT AAATAATGAG TTTATTTATA ATTATAATAA AAATACAGGT TCCTTTGATT TTATTCAAGC ATTAGGAGAA ACATTTATAG ATGATCCAAT GAAATTTATT ATTTGGAGAG CACCTACAGA TAATGATAGA AAGATTAAAA ATCTGTGGAT TGAAGCTGGC TTTAATCAAA TTACTACAAG AGTGTATAAT AGTAAAATTA AGGAGTTTTC AAATAGGGTG GAGATAACTA GCGATTTAAG CTTGATACCA CCATATAGAG AAAGAGTCCT TGATCTTAAG GTAACATGGA GTATCTATTC AGAGGGGTTA ATTAAATGCC ATGTTAAAGG GGATAAAAAT ATGAAAACAC CTTATTTACC AAGGTTTGGT GTAGAGCTTA AGCTGAATAA ATCCTATGAA GAGGTAAGTT ACTTTGGATT TGGACCATAT GAAAATTACG TAGATAAAAA TTCATCTTGT TATTTAGGAA GATTTAATTC TAAGGTTTCT GAAATGCATG AAGATTATAT AAGACCTCAA GAAAATGGAA GTCATCATTA TTGTAGAGAA GTAGCTATTA ATAATGAAAA AGGAAAGGTT TATGTTTTAT CAGAAAATGA CTTTGCCTTT AATGTTTCAC ACTTTTCTTT AAATCAATTA ACTAATGCAA ATCATAATTT TGATTTGAAT GAAGAAGAGG CAACTTATTT AATTGTAGAT TATAAACAAA GTGGTATAGG ATCAAATAGT TGTGGCCCTG ATTTAGATGA AGAATATAGA CTAAATGAAA AAGAATTTTC TTATGATTTC TACTTAAAAT TTGTAAAAGA TAATATAAAG GATAATTAA
|
Protein sequence | MILKNEYHED ISKLHVNMMP RRSYYVPFVD TDEALNIKDR SKQLNFFSLN GKWEFNYFDS LQKVKEFDDI NQISFSDNID VPSLWQLKGY DYNQYTNVKY PIPFDPPFVP INNPCGIYKR DFEIEILPEN YDYNINFEGV DSCFYFWIND NFVGYSQISH SISEFDITEF LVKGKNTITV LVLKWCDGTY FEDQDKFRMS GIFRDVYILR RAKERIVDYK ITQSIDFSAK EGKLDLEILS NIGNPKGKYY LLNPNNHMIA SGNIDNNKIQ INIKNVELWS AEIPNLYTLL IETEHEVIKE RIGMREIKIE NSILKINNKK IKLRGVNHHD SNPTKGYVMT YDDMILDLKI MKECNVNSIR TAHYPKSPIF YELCDEYGFY VMSEADIEIH GVVELYGLGY LDNYNMIADD KVYEKVIIDR VDSSIVPFKN KSCIFMWSLG NESGFGCNFE RGLEYARALD PTRPLHYEGA YYASKEREND FTNIDVISRM YISIEEIKDY FEKGIDKPLI LCEYAHAMGN GPGGLQDYDE MIQKYDQFAG AYVWEWCDHA ILINENINDK KAYGYGGDFE EENHDGNFCV DGLVYPDRTP HTGLLEYKNI NRPIRAIEFD EVKKRVKLKN MFDFRNAGEF LDVTYKVFLD GEIIFGDDID LESLKAKEEK WYDLSISELP KEIITILFQY KVKNHNHLYE KGEVLGFDNF IIKNGVDNIS SVDKIIKGNI NEQKFYVEET VNKIKVINNE FIYNYNKNTG SFDFIQALGE TFIDDPMKFI IWRAPTDNDR KIKNLWIEAG FNQITTRVYN SKIKEFSNRV EITSDLSLIP PYRERVLDLK VTWSIYSEGL IKCHVKGDKN MKTPYLPRFG VELKLNKSYE EVSYFGFGPY ENYVDKNSSC YLGRFNSKVS EMHEDYIRPQ ENGSHHYCRE VAINNEKGKV YVLSENDFAF NVSHFSLNQL TNANHNFDLN EEEATYLIVD YKQSGIGSNS CGPDLDEEYR LNEKEFSYDF YLKFVKDNIK DN
|
| |