Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_3329 |
Symbol | |
ID | 5741609 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | - |
Start bp | 4051097 |
End bp | 4054015 |
Gene Length | 2919 bp |
Protein Length | 972 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 641294430 |
Product | glycoside hydrolase family 3 protein |
Protein accession | YP_001560421 |
Protein GI | 160881453 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTAATG CATCATATCT TGAAACACCA TTATGGGATA ATAGTTTACC TCTTAAGAAA AGGTTAGATT ATCTCGTTGA AAACCTGACA TTGGAAGAAA AATTTGAGTT TCTAGGGACA GGATGTCCGA CGATTGAACG CCTTGGAATA CAAAGTACCT TTCATGGAGG AGAGGCTGCG CATGGTATAG AAGCAAGACA CGACCAGTCA TTTAATAAAG GAGAGCCGGA ACCAACGACA ATATTTCCGC AACCGATTGG AATGAGCTCT ACTTGGGATA CGACACTTTT AACTAAGATT GGCGTAACAG TTGGTAATGA AGCTAGAGTT TTATATCAAC GTCACAAAAA TGGAGGTTTG TGCCGTTGGG CGCCAACGAT AGACATGGAA CGTGATCCAC GTTGGGGACG AACAGAAGAA GCTTATGGGG AGGACCCCTA TCTTACAGGG AAGATGGCTT CTGCATATAT TCAAGGTATG CGTGGAGTTG ATCCGTTTTA TATTCGTTGC GGTGCGACCT TAAAGCATTT CTACGCTAAT AATACAGAAA AGGATAGAAT TTTTTCATCC TCTTCTATCG ACCCTAGAAA TAAACATGAG TATTATTTAG AACCTTTCAA ACGTGCAATT ACAGAGGGGA AAGCCGAAGC TATTATGACG GCATACAACG AAATTAATGG AGTCCCTTGT ATTGTAAATA ATGAAGTAAA AAACATTGTA AAAGAACAGT GGGGGCTTCG TGGGCATGTT GTATGTGATG GTGGTGACAT GATGCAAACA GTTAGTGATC ATAAATATTT TGGCTCCCAT GCAGAAACCA TTGCCTATGG GTTAAAAGCA GGTATTGATT GTTTCACTGA TAATCGTGAA GTGGTTAAAC AAGCAGCCAA AGAAGCTTAT CAAGCCGGGT TGATTACAGA GGCAGATTTG GATACCTCGA TTTGTAATTC ATTTTCAACC AGAATCCGTC TTGGTTATTT TGATGCGATA GGACAAAACC CATATGCACA TATAACCGAA AAATATATTA ATAGTGAAGA GAATAAATCT TTAACGCTAG AAGCTGCAAC AAAAGCCATG GTATTATTAA AGAATGAAGG ACAGATATTG CCACTTACTA AAGAAAATTC ATCCTTTTGT GTGATCGGTC CATTATCCGA TGTTTGGTAT AAAGATTGGT ATAGTGGGAT ACCACCATAT TCCGTAACAC CATTACAAGG TATCAAAGAG TACAACAAAG GGACTCGTAA GAATCTTACG AATTCTAAGG TGGTGGATGG ATTACCTCGC GTTAGAATTC GATATCAAGA GAAATATCTT TGTGTAACAG AGGAAGGTTT TGTCACACTA GGGAGTAAAG AATTAGCAGA GACATTTACT ATCACAGACT GGGGTAACGG AAATCTGACC ATAGCGACTA GTCAAGGGAA ATACCTGTCA GCAGATGAGG AAGAAGGACT TATTACTGCA ACTAAGACGG AAGTATTTGA ATGGTTTGTA AAAGAAGCAT TTGATTTCCA TCTTGTGGGT AATTTAGGCG AAAAAAGTTT TCAAACTATA TTTGAAATTC TAAGTGAACA AAAAAGACAG GAAATATCCG TAACAATAGA TACTTGGAAG GATGATTATG TAGCGATTAA TAAGGAAGGA AAGCTTTCAT TTTCTAAGGA AGAAAAAGTT ACATTGGATC TTGAACTTGT ATCCGATGGT ATCGAAGAAG CGAAAGAATA TACAAAGAAT AGTGATTATG CTATTCTAGT AATGGGTTGT AATCCTGTTA TTAACAGTAA AGAAGAAATT GATCGGAATG ATTTAGATCT ACCGCCATAT CAAGAAAGAT TAATAAAGCA AGTACATAAA GTAAATCCTA AGGTAATTCT TGTACTTATA ACAAATTATC CATATGCAAT ACGCTGGGAA AAGGAACATA TTCCGGCAAT TATAACGACT ACATCTGGAA GTCAAGAACT TGGTAATGCG ATTGCTGCTG TGTTATTTGG TGATGTATCA CCATCGGGTA GGTTGCCAAT GACTTGGTAT CTCGATACGA AGGATTTACC ACCGATAGAG GATTATGATA TTATTCGCGG TAACCGAACT TACCAGTATT TTAACAAAGA GGTGTTATAC CCATTTGGAC ATGGTCTTAC CTATACAACT ATGCAATATC AAAAGCTTAC AGTGCAATTA GAGGATTTTA CTAATCTATT AATTAAAGTA ACTATTGCAA ATACAGGAAA TAGAATTAGT GATGAAGTGG TGCAGGTTTA TGTGAGGCAG GAAGTCTCAA GAACTGTTAG ACCTCGTTTG CAGCTAAAAG CTTTTGAGAG AGTAAAGAAT ATCTTACCTG GAGAGAAGAG GGAGATAGAA TTTATAATTT CTACCAGTGA TTTAACCTAT TATGATGTAG TAAATGGTGG TATGATATTA GAAGAATCTG AGTATACAAT CTTAGTTGGG GCATCTTCTG AGGACATTAG ACTACGCGAA ACAGCTTTCA TTCCTGGAGT GAAAGTGGGA TCTCGAAATC TTAGGAAAAA GATTTTAGCG GATCATTATG ACGACTGCAG GAATTCGTAC CTACACCGTG GAAGCTTAGG TGATACTGCG GTTGTTGTTA AGAATAAAAC TGAAGCAGCG ATACTTCTTT ACCGAGATGT TAGAATAGAA GAAAAGCCAA TTAAATTTCA TTCTACCGTT CAATGTGTGG GTGAAGGAAG TCTGATAGTT TCATACATTA AGCAAAGTGA TGTAGCGAGT TCCGAGGTAA AACTAGGTGA TATTACTTTA GAGAATCAAG ATAAGTTTTG TGATGTTATC ATACCAATTA ACTGGGATAG AATAACATGC AATGAAGTCA TTACATTAAA AATTACATTG TTAGAAGAGA TGAAATTATC CTCTTTCTAT GCAGAATGA
|
Protein sequence | MGNASYLETP LWDNSLPLKK RLDYLVENLT LEEKFEFLGT GCPTIERLGI QSTFHGGEAA HGIEARHDQS FNKGEPEPTT IFPQPIGMSS TWDTTLLTKI GVTVGNEARV LYQRHKNGGL CRWAPTIDME RDPRWGRTEE AYGEDPYLTG KMASAYIQGM RGVDPFYIRC GATLKHFYAN NTEKDRIFSS SSIDPRNKHE YYLEPFKRAI TEGKAEAIMT AYNEINGVPC IVNNEVKNIV KEQWGLRGHV VCDGGDMMQT VSDHKYFGSH AETIAYGLKA GIDCFTDNRE VVKQAAKEAY QAGLITEADL DTSICNSFST RIRLGYFDAI GQNPYAHITE KYINSEENKS LTLEAATKAM VLLKNEGQIL PLTKENSSFC VIGPLSDVWY KDWYSGIPPY SVTPLQGIKE YNKGTRKNLT NSKVVDGLPR VRIRYQEKYL CVTEEGFVTL GSKELAETFT ITDWGNGNLT IATSQGKYLS ADEEEGLITA TKTEVFEWFV KEAFDFHLVG NLGEKSFQTI FEILSEQKRQ EISVTIDTWK DDYVAINKEG KLSFSKEEKV TLDLELVSDG IEEAKEYTKN SDYAILVMGC NPVINSKEEI DRNDLDLPPY QERLIKQVHK VNPKVILVLI TNYPYAIRWE KEHIPAIITT TSGSQELGNA IAAVLFGDVS PSGRLPMTWY LDTKDLPPIE DYDIIRGNRT YQYFNKEVLY PFGHGLTYTT MQYQKLTVQL EDFTNLLIKV TIANTGNRIS DEVVQVYVRQ EVSRTVRPRL QLKAFERVKN ILPGEKREIE FIISTSDLTY YDVVNGGMIL EESEYTILVG ASSEDIRLRE TAFIPGVKVG SRNLRKKILA DHYDDCRNSY LHRGSLGDTA VVVKNKTEAA ILLYRDVRIE EKPIKFHSTV QCVGEGSLIV SYIKQSDVAS SEVKLGDITL ENQDKFCDVI IPINWDRITC NEVITLKITL LEEMKLSSFY AE
|
| |