Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_0662 |
Symbol | |
ID | 5743778 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | + |
Start bp | 862473 |
End bp | 864716 |
Gene Length | 2244 bp |
Protein Length | 747 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 641291774 |
Product | glycoside hydrolase family 3 protein |
Protein accession | YP_001557788 |
Protein GI | 160878820 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.409181 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCAAAG ACCTGATTAA ACAAATGACA CTGGAAGAAA AGGCAGGATT GTGCTCCGGT GCAGATTTTT GGCATACGAA AGCAGTGGAA AGACTAGGCA TTCCGGCCGT GATGGTAACG GACGGCCCGC ATGGTCTACG TAAGCAGGAA GGGGAAGCCG ATCATCTCGG TCTGAACCAG AGTGTTGCTG CTGTATGCTT TTCTCCAGCG TGTGCGAGCG CTTCCAGCTT TGATGAGAAT TTATTATATC GGATGGGCAG TGCTCTTGGT GAAGAATGCA GATCAGAGAA TATAGCAATT TTACTGGGAC CTGCCGTTAA TATTAAGAGA AGTCCGCTTT GCGGTCGTAA TTTTGAATAT TTCTCAGAGG ACCCTTATCT GACTGGTAAG CTTTCTTCAG CACAGATCAA GGGAATTCAG GAGTGGGAGG TTGGTACCAG CCTAAAGCAT TATGCAGTTA ATAATCAGGA GACCTACCGC ATGACCTGCT CCTCAGAGGT GGATGAACGT ACACTGCGTG AAATCTACCT CTCGGGCTTT GAGATGGCTG TGAGAGAGGC AAAGCCTTGG ACCGTAATGA GCAGCTATAA CAAAATTAAT GGAGAATATG CCAGTGAAAA TAAAAAATTG CTAACGGATA TCCTGCGTGA GGAATGGGGC TTCGAGGGAT TCGTAATGTC GGACTGGGGA GCTGTTAACG ACAGAGTAAA GAGCCTTGAG GCAGGACTGG AGCTTGAGAT GCCCTCTTCT AATGGTATTC GTGATGAACA GATTGTGAAA GCAGTCAGAG AAGGCAAGCT CTCGGAGGAA TTATTGGATC TGGCTGTGGA ACGCATTCTT AAGGTTATAT TCAAATATTC CAAGGCCGAT GCTACCGATA CACATTATGA CAGAGAGGAG CATCACAAGA TAGCAACAGA TATGGCAAAG GAATGTGCTG TTCTTTTGAA GAATGAAGGT GCGCTGCCCC TAAGCCGTCA AACAAAAGTT GCCTACATAG GTGCATTTGC CAAAATACCG CGCTATCAGG GAGGAGGCTC CAGCCACATT CATGCCAGCA GGGTAACGAA TGCATTAGAT ATCGGTAAGG ATAAAAATCC TAATATAATT TATGCGGAGG GTTTTCCACA TGACAAGGAT ATGGAAGATC AATCTCTTTT TGAAGAAGCA GTCAAAGCAG CTTCCGAAGC GGATGCAGCC GTTATCTTTG CCGGATTACC CGAATCCTTT GATTCCGAGG GTATTGACAG AAAACATATG AGGCTGCCGG AATGTCAGAA CCGACTCATT GAGCAGATCG CGGGAGTACA GAAGAATACC ATTGTAGTTC TTCATAATGG CTCTGCTGTG GAAATGCCCT GGGCAGATAA GGTGAATGCT ATTCTTGAAA TGTTTCTTGC AGGTCAGGGA GTCGGTGAGG CAACAGATGC CCTGTTATAT GCAGATGCAA ATCCCTGTGG CAGACTGGCA GAAACCTTTC CGCTAAGGCT TGAGGATACC CCGTCATATT TGAATTTCCC TGGTGATGGC AAAAAGGTCG TTTATGCGGA AGGGATTTAT ATTGGGTACC GTTATTATGA GGCTAAAAAA ATACCGGTAT TATTCCCCTT TGGACATGGA CTGTCCTATA CCGAATTCTC TTATCATGAT ATACAGGTAA CTCGTGGTAA TTTTACAGAG GGTGAGAGTA TAACTGTAAC GGCACAGATC ACCAATTCAG GGAAGATGGT AGGAAAAGAA GTAGTGCAGC TGTATGTCGG CGACCGTACC GGCACTCCAG GAAGGCCTGT AAAGGAGTTA AAAGGATTTG CCAAGGTAGA GCTTAAACCG GAAGAGACGA AGCCGGTAAC CTTTGAAATC GATGCCCGTT CTCTATCTTG GTATAACGAA GAAATCGGAG ACTGGTACGC TGCCGGTGGA ACCTATGAGC TTTTACTGGC TCATTCCTCT GCAGATATAC GTCTATCGGA GAAGGTTGAG TTCACTCCTG TGAGGGAGAT TCCTTTTCGG GTAGATGAGA ATACAACGAT AGGCTCATTG TTAAAGAACC CAAAAACCGC ACCGATTATG TCGATGATGC TTTCGAAGTC TAATGGAGGC AATGCATCAG GCTTTGGCAG TTCAAATGAG ATGCTGAAAG AAATGACAGG GGGGCTGCCG CTTAGGGCAC TGTTTGGATT TTCGAAGATT ACAGAGGATC AGATGAAGGA ATTAATCTAT GTATTAAAGG AGCAGCTGAA ATAA
|
Protein sequence | MIKDLIKQMT LEEKAGLCSG ADFWHTKAVE RLGIPAVMVT DGPHGLRKQE GEADHLGLNQ SVAAVCFSPA CASASSFDEN LLYRMGSALG EECRSENIAI LLGPAVNIKR SPLCGRNFEY FSEDPYLTGK LSSAQIKGIQ EWEVGTSLKH YAVNNQETYR MTCSSEVDER TLREIYLSGF EMAVREAKPW TVMSSYNKIN GEYASENKKL LTDILREEWG FEGFVMSDWG AVNDRVKSLE AGLELEMPSS NGIRDEQIVK AVREGKLSEE LLDLAVERIL KVIFKYSKAD ATDTHYDREE HHKIATDMAK ECAVLLKNEG ALPLSRQTKV AYIGAFAKIP RYQGGGSSHI HASRVTNALD IGKDKNPNII YAEGFPHDKD MEDQSLFEEA VKAASEADAA VIFAGLPESF DSEGIDRKHM RLPECQNRLI EQIAGVQKNT IVVLHNGSAV EMPWADKVNA ILEMFLAGQG VGEATDALLY ADANPCGRLA ETFPLRLEDT PSYLNFPGDG KKVVYAEGIY IGYRYYEAKK IPVLFPFGHG LSYTEFSYHD IQVTRGNFTE GESITVTAQI TNSGKMVGKE VVQLYVGDRT GTPGRPVKEL KGFAKVELKP EETKPVTFEI DARSLSWYNE EIGDWYAAGG TYELLLAHSS ADIRLSEKVE FTPVREIPFR VDENTTIGSL LKNPKTAPIM SMMLSKSNGG NASGFGSSNE MLKEMTGGLP LRALFGFSKI TEDQMKELIY VLKEQLK
|
| |