Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_0750 |
Symbol | |
ID | 4205160 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | + |
Start bp | 878279 |
End bp | 881308 |
Gene Length | 3030 bp |
Protein Length | 1009 aa |
Translation table | 11 |
GC content | 28% |
IMG OID | 642565310 |
Product | beta-galactosidase |
Protein accession | YP_698076 |
Protein GI | 110802032 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0186034 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATATTAA AAAATGAATA CCATGAAGAT ATTTCTAAAC TGCATGTAAA TATGATGCCA AGAAGAAGTT ATTATGTTCC CTTTGTGGAT ACTGATGAAG CTTTAAATAT TAAAGATAGA AGTAAGCAAG TTAACTTCTT TTCACTTAAT GGTAAATGGG AATTTAATTA CTTTGATAGC TTACAAAAAG TTAAAGAGTT TGATAATATA AATCAAATTA ATTTTTCAGA TAATATTGAT GTACCATCTC TTTGGCAATT AAAGGGATAT GATTATAATC AATACACTAA TGTAAAATAT CCAATTCCTT TTGATCCACC TTTTGTTCCT ATAAACAATC CATGTGGAAT TTATAAGAGA GATTTTGAAA TTGAAATACT TCCTGAAAAT TATGATTATA ATATAAACTT TGAAGGCGTT GATTCCTGCT TTTATTTTTG GATAAATGAT AACTTTGTTG GGTATAGCCA AATATCTCAT AGCATATCTG AGTTTGATAT TACAGAATTT TTAGTAAAGG GTAAAAATAC AATTACAGTC TTAGTTTTAA AATGGTGTGA TGGAACTTAC TTTGAAGATC AAGACAAGTT TAGAATGTCT GGAATTTTTA GAGATGTATA TATACTAAGA AGAGCAAAGG AAAGAATAGT TGACTATAAA ATAACTCAAA GTATAGATTT TTCAGCTAAA GAAGGAAAGT TAGACCTTGA AATTTTATCT AATATAGGAA ATCCAAAAGG GAAATATTAT TTATTAAATC CTAATAATCA CATGATAGCA TCTGGAAATA TAGATAATAA TAAAGTCCAA ATCAATATTA AAAATGTAGA ACTTTGGAGT GCTGAAATAC CTAATTTATA TACCTTATTA ATTGAAACAG AGCATGAAGT TATAAAAGAA AGAATAGGTA TGAGAGAAAT TAAAATAGAG AATTCTATTT TAAAAATTAA TAATAAAAAA GTAAAGTTAA GAGGAGTTAA TCATCATGAT AGTAATCCAA CTAAGGGATA TGTTATGACA TATGATGATA TGATTCTAGA TTTAAAAATA ATGAAAGAGT GTAATTTTAA CTCTATAAGA ACAGCTCATT ATCCTAAATC TCCGATTTTT TATGAGTTAT GTGATGAATA TGGTTTTTAT GTTATGAGTG AGGCTGATAT TGAAATTCAT GGGGTTGTTG AACTATATGG ACTAGGATAT TTAGATAATT ATAATATGAT AGCTGATGAT AAAGTTTATG AAAAGGTTAT TATTGATAGA GTTGATTCAT CTATAGTTCC TTTTAAGAAT AAATCTTGTA TTTTCATGTG GTCTCTTGGA AATGAGTCAG GATTTGGATG TAACTTTGAA AGAGGATTAG AATATGCTAG GGCATTAGAT CCTACACGTC CACTTCATTA TGAAGGTGCT TATTATGCTA GCAAAGAAAG AGAAAATGAC TTTACAAATA TTGATGTTAT TAGTCGTATG TATATAAGTA TAGAGGAAAT CAAGGATTAT TTTGAAAAAG GAATAAACAA GCCATTAATA TTATGTGAGT ATGCACATGC TATGGGAAAT GGACCAGGTG GATTACAAGA CTATGATGAA ATGATACAAA AGTATGACCA GTTTGCAGGG GCATATGTTT GGGAGTGGTG TGACCATGCA ATACTTATTA ATGAAGATGT TAATGGTAAA AAAGCATATG GATATGGTGG CGATTTTGAA GAGGAAAATC ATGATGGAAA CTTCTGTGTT GATGGTTTAG TTTACCCAGA TAGAACTCCT CACACTGGAT TATTAGAATA TAAAAATATC AATAGACCTA TAAGAGCAAT AGAATTTGAT GAAGTTAAGA AAAGAGTTAA GTTTAAAAAT ATGCTTGATT TTAGGGATGT TTCTGAGTTC TTAGATGTTA CCTATAAAGT ATTTTTAGAT GGAGAAACAA TTTTCGGAGG TAATATAGAT TTAGAAAGCT TAAAAGCAAA AGAAGAAAAA TGGTATGATT TAAGCATATC AGAATTACCA AAGGGAATTA TAACTATATT ATTCCAGTAT AGAGTGAAAA ATAATAATCA CCTTTATGAA AAAGGTGAAG TTCTAGGCTT TGACAATTTT ATCATAAAAA ATGGAGTAGA TAATATCTCA TCTGTTGATA AAATCTTAAA AAGTACTATT AATGAACAAA AATTTTATGT AGAAGAAACA GTTAATAAGA TTAAAGTTAA AAATAATGAG TTTATTTATA ATTATAATAA AAATACAGGT TCCTTTGATT TTATTCAGGC ATTAGGAGAA ACATTTATAG ATGATCCAAT GAAATTTATT ATTTGGAGAG CACCTACAGA TAATGATAGA AAGATTAAAA ATCTGTGGAT TGAAGCTGGC TTTAATCAAA TTACTACAAG AGTGTATAAT AGTAAAATTA AGGAGTTTTC CAATAGGGTG GAGATAACTA GCGATTTAAG CTTGATACCA CCATATAGAG AAAGAGTCCT TGATCTTAAG TTAACATGGA GTATATACTC AGAGGGATTA ATTAAATGCC ATGTTAAAGG GAATAAAAAT ATGAAAACAC CTTATTTACC AAGGTTTGGT GTAGAGCTTA AGCTTAATAA ATCCTATGAA GAGGTAAGTT ACTTTGGATT TGGACCATAT GAAAATTACG TGGACAAAAA TTCATCTTGT TATTTAGGAA GATTTAATTC TAAGGTTTCT GAAATGCATG AAGATTATAT AAGACCTCAA GAAAATGGAA GTCATCATTA TTGTAGAGAA GTAGCTATTA ATAATGAAAA AGGAAAGGTT TGTGTTTTAT CAGAAAATGA CTTTGCATTC AATGTTTCAC ACTTTTCTTT AAATCAATTA ACTAATGCAA ATCATAATTT TGATTTGAAT GAAGAAGAGG CAACTTATTT AATTGTAGAT TATAAACAAA GTGGTATAGG ATCAAATAGT TGTGGCCCCG ATTTAGATGA AGAATATAGA CTAAATGAAA AAGAATTTTC TTATGATTTT TACTTAAAAT TTGTGAAAGA AAATATATAG
|
Protein sequence | MILKNEYHED ISKLHVNMMP RRSYYVPFVD TDEALNIKDR SKQVNFFSLN GKWEFNYFDS LQKVKEFDNI NQINFSDNID VPSLWQLKGY DYNQYTNVKY PIPFDPPFVP INNPCGIYKR DFEIEILPEN YDYNINFEGV DSCFYFWIND NFVGYSQISH SISEFDITEF LVKGKNTITV LVLKWCDGTY FEDQDKFRMS GIFRDVYILR RAKERIVDYK ITQSIDFSAK EGKLDLEILS NIGNPKGKYY LLNPNNHMIA SGNIDNNKVQ INIKNVELWS AEIPNLYTLL IETEHEVIKE RIGMREIKIE NSILKINNKK VKLRGVNHHD SNPTKGYVMT YDDMILDLKI MKECNFNSIR TAHYPKSPIF YELCDEYGFY VMSEADIEIH GVVELYGLGY LDNYNMIADD KVYEKVIIDR VDSSIVPFKN KSCIFMWSLG NESGFGCNFE RGLEYARALD PTRPLHYEGA YYASKEREND FTNIDVISRM YISIEEIKDY FEKGINKPLI LCEYAHAMGN GPGGLQDYDE MIQKYDQFAG AYVWEWCDHA ILINEDVNGK KAYGYGGDFE EENHDGNFCV DGLVYPDRTP HTGLLEYKNI NRPIRAIEFD EVKKRVKFKN MLDFRDVSEF LDVTYKVFLD GETIFGGNID LESLKAKEEK WYDLSISELP KGIITILFQY RVKNNNHLYE KGEVLGFDNF IIKNGVDNIS SVDKILKSTI NEQKFYVEET VNKIKVKNNE FIYNYNKNTG SFDFIQALGE TFIDDPMKFI IWRAPTDNDR KIKNLWIEAG FNQITTRVYN SKIKEFSNRV EITSDLSLIP PYRERVLDLK LTWSIYSEGL IKCHVKGNKN MKTPYLPRFG VELKLNKSYE EVSYFGFGPY ENYVDKNSSC YLGRFNSKVS EMHEDYIRPQ ENGSHHYCRE VAINNEKGKV CVLSENDFAF NVSHFSLNQL TNANHNFDLN EEEATYLIVD YKQSGIGSNS CGPDLDEEYR LNEKEFSYDF YLKFVKENI
|
| |