Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_0815 |
Symbol | |
ID | 4205918 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | + |
Start bp | 941279 |
End bp | 943693 |
Gene Length | 2415 bp |
Protein Length | 804 aa |
Translation table | 11 |
GC content | 29% |
IMG OID | 642565374 |
Product | beta-galactosidase |
Protein accession | YP_698140 |
Protein GI | 110803978 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.57434 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGAAAA TAATTCATAT CAATGATCAA TGGTTTTATG CAAATGATTA CAAAGGTGAG TATTTAAAAA ATGAGTTTGA TTTTAGTAAT TTTGAAAGGG TAGATTTACC TCATACAAAT ATAGAGTTAC CATATAATTA TTTTGATGAG AAATCATATC AATTTGTTTC TACTTATGTT AAAACTTTAA AATTTGATAA TAGTGTTAAA GGGAAAAAGG TATTTTTAGA TTTTGAAGGA GTTATGATAG CCGCAGAAGT ATATTTAAAT GGAATTCATG TTGGTGGACA TAAGGGTGGT TACACTAATT TTTCAATAGA TATTACTGAT GCTTTAAAAA TTAATGAAGA TAATATATTA AAGGTTGTTG TTGACTCAAC AGAAAGACCG GATATACCTC CTCATGGATA TGTTGTTGAT TATTTAACCT ATGGAGGAAT ATATAGAGAA GTTTCCTTAA GAGTAGTTGA ACCTATATTT ATAAATAATT TATATGCAAG GGCATATGAT TGTTTAAAGG AAGAAAAAAG ATTAGAACTT TATATAGAAA TAAATAATTT TGAAAAATAT AGAGATGATT TAGAAATTGT TGTAGACTTT GGTGATGATA CTTTTGAAGA AACTTTAAGT ACAAAGTTAC CAATTGAAGA AGGAACAAGT ATTAAAAATA TAGAAATAGA CCAATTAAAT ATGGTTAAGT TATGGGATAT AGAAAATCCT AAGCTTTATG AAATAAAAGT TAAGTTATTA AAAGGTTCAG AAGTTATAGA TGAATATAAA GATAATTTTG GATTTAGAGA GGCTGAATTT AGATCAGATG GTTTCTATTT AAATGGAAGA AGAGTTAAAC TTGTTGGATT AAATCGTCAT CAGGCTTATC CATATGTAGG ATATGCTATG CCTCAAAGAG TTCAAGAAAA GGATGCTGAG ATTTTAAAAT ATGAATTAGG ACTCAACATA GTTAGAACAT CTCACTATCC GCAATCAGTA CACTTCTTAA GAAAATGTGA TGAGATTGGA CTATTAGTTT TTGAAGAGAT ACCTGGTTGG CAACATATAG GTGATGAAGC TTGGCAAGCA GAATCTATTA AAAATGTAGA GGAAATGATA AAAAAAGATT ACAATAGACC TTCCATAGTT TTATGGGGCG TTAGAATAAA TGAGTCTCAA GATAGTCATG ATTTTTATGT GAAAACTAAT GCTATGGCAA AGAGTTTAGA TCCTATTAGA CAAACTGGTG GGGTTAGATA CTTAGAAAAT AGTGATTTCC TAGAAGATGT TTATACCATG AATGATTTTA TACACAGTGG GGGAGAAAAA GTATTAAGAA CTCAAAGTGA AGTAACAGGA CAAGTAGATA AAGTTCCTTA TTTAGTAACT GAGTATAATG GGCATATGTA TCCAACAAAA AGCTTTGATC AAGAATGTAA AAAAGTTGAA CATGCTTATA GACATTTGAG AGTTATTAAT GAATCCTTTG GCTTAGATGA AATAAGTGGA GCCATAGGAT GGTGTGCTTT TGATTATAAT ACACATAGTT CCTTTGGTTC AGGAGATAAA ATTTGTTACC ATGGAGTTTC TGATATGTTC AGAAATCCTA AGTATGCAGC TTATTCCTAT GCTAGCCAAA AGAAAGTAGA AGATGGTGTG GTTTTAGAAC CTATTACTTT AGGGGCTAAG GGAGAAAGGG ATGGAGGAGC AATACTTCCA TTTACAGTTC TTACAAACTG TGATTATATA AAAATATTTA AAGATGGAAT ATATATAGAT ACTTATTATC CTAATAAAGA AAAGTTCCCT AATTTACCAC ATCCACCAAT AGAGGTTTCA CATATTTTAT CTATGGATTC AGAAATACCT CTTACTGAAG AAGCAAAAAA AGAAATTAAA GACTTTGTAT TAAATAAATT AAAAGATTCT AATTTAACTA ATTTAGCTGA AGAAGATTTT AAATATATTG AAGAATTTAG TGAAAGAGTA AATATACCTG TATTTAAAAT AATGTCTTTA GTTTATAAAT TAGCTGGAGG TTGGGGAGAT AAGGAAAACT CCTTAATAAT AAAAGGCTTT ATAGATAATA AAGAGGTTGC TTCTAAAGAA ATAGGTGAAC TTAGAAGCAT GAATAAACTT GAAGTTACAC CAGATAATTT AGAACTTTCA TTAGATAAAA CAAGTTATGA TGCTACTAGA ATCGTGGTTA AACTTTTAGA TAATTTAGGA GAGGTTCTTT TCTTAAATAA TGATTTTATT GAAGTAGAAA TAGATGGACC TTTAAGTATA ATGGGACCAA GTAAGTTTGG AATTTCTGGT GGAGCAGTCG CTTTCTGGGT AAGAACTCAA GGAAAAACTG GGCTTTGCAA AATAAAGGTT AAGAGCATGT ACTTTGAAGA AGAAATTTCT ATAGAAGTTA AGTAG
|
Protein sequence | MRKIIHINDQ WFYANDYKGE YLKNEFDFSN FERVDLPHTN IELPYNYFDE KSYQFVSTYV KTLKFDNSVK GKKVFLDFEG VMIAAEVYLN GIHVGGHKGG YTNFSIDITD ALKINEDNIL KVVVDSTERP DIPPHGYVVD YLTYGGIYRE VSLRVVEPIF INNLYARAYD CLKEEKRLEL YIEINNFEKY RDDLEIVVDF GDDTFEETLS TKLPIEEGTS IKNIEIDQLN MVKLWDIENP KLYEIKVKLL KGSEVIDEYK DNFGFREAEF RSDGFYLNGR RVKLVGLNRH QAYPYVGYAM PQRVQEKDAE ILKYELGLNI VRTSHYPQSV HFLRKCDEIG LLVFEEIPGW QHIGDEAWQA ESIKNVEEMI KKDYNRPSIV LWGVRINESQ DSHDFYVKTN AMAKSLDPIR QTGGVRYLEN SDFLEDVYTM NDFIHSGGEK VLRTQSEVTG QVDKVPYLVT EYNGHMYPTK SFDQECKKVE HAYRHLRVIN ESFGLDEISG AIGWCAFDYN THSSFGSGDK ICYHGVSDMF RNPKYAAYSY ASQKKVEDGV VLEPITLGAK GERDGGAILP FTVLTNCDYI KIFKDGIYID TYYPNKEKFP NLPHPPIEVS HILSMDSEIP LTEEAKKEIK DFVLNKLKDS NLTNLAEEDF KYIEEFSERV NIPVFKIMSL VYKLAGGWGD KENSLIIKGF IDNKEVASKE IGELRSMNKL EVTPDNLELS LDKTSYDATR IVVKLLDNLG EVLFLNNDFI EVEIDGPLSI MGPSKFGISG GAVAFWVRTQ GKTGLCKIKV KSMYFEEEIS IEVK
|
| |