Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_3749 |
Symbol | |
ID | 5742948 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | - |
Start bp | 4599498 |
End bp | 4601201 |
Gene Length | 1704 bp |
Protein Length | 567 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 641294861 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001560835 |
Protein GI | 160881867 |
COG category | [R] General function prediction only |
COG ID | [COG3858] Predicted glycosyl hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.634589 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA GAACAAAGCT TGCATTTATT GGAACTATCG CAGTAGTGGT TGTTGTACTT GCGATTGTTG TATCTTATAT CGTAGAGAAA AATACACCAA GTAAAGAAGT AAAAGCACTA TCGGAATTCT ATCAAGTACC GGAAGGGGAA GCGATGGTCA TCATGGATGA TATCGTTTAT GAGAGAAATG CAAAACTTCT GGATGGTGTT TTATATATGG ATCTTGAAAC CATAAAAGTT AAGTTCAATC AGAGGTTTTA TTGGGATGCG ATAGAAAATG TATTAATCTA CACAACGCCA ACAGAAATTA TTAAGGCAGA GGTAGGTACA AAAGATTATC TTGTAAATAA AAATAAAGTA TCATCAAATT ATCCAATCGT AAAGATGGTG AATACAGAAG TTTATGTAGC ACTACCCTTT GTTGCAGAAT ATTCAGATAT GCGTTATAAA GCTTATGAAA ATCCGGATTT AGTTGTAATT CAGTGTAAAT GGGGAGATTA CTTATTTGCT GATGTAGAGA ATGCAACACA GATAAGAACA GGAGCATCCA TTAAAAGTCC CATATTAAAA GAACTAAATA AAGGGGATCG TGTTTTATTA ATCAATAATG GCGGAAATCA ACAAAATGGA TTCTTAACCG TTATGACAGA AGAAGGAATT AGGGGATTCG TCCGTAAAAA GAACCTTTCA AATTCTTACT ACGATAAGGT AACAAGTAAC TTTGAAGCTC CTGTTTATGA AAGTATCACG AAGGATTATA AGATTAATTT AACTTGGCAT CAGGTTACCA ATCAAGAGGC TAATAAAAAG TTAGCAGAGG TATTAGATTC TACCAAGGGT GTTACGACGA TATCGCCAAC TTGGTATCGT ATTAATTCGG CAGAAGGAAC ATTAGCCTCT TTGGCAAGTG AAAGCTATAT AGAAAAAGCT CATAGTATGG GAATTGAAGT TTGGGCCCTA GTGGATAATT TTGATCCTAC TGTTGATACG TTTGAAGTAT TATCAAAAAC TTCAAGCAGA GAGCGTTTGA TTAATGAATT GATAGCACAA GCCATAAAAT ATAACCTCGA TGGTATTAAT ATTGATTTTG AGAGTTTATC GGTAGAGACT GGCCCACATT ACATTCAATT TTTACGTGAA TTATCTGTCA AATGCCGAAG CAACCAGATT GTATTATCTT CCGATACTTA TGTTCCTGCA TCTTACTCTA AGTTCTATGA TAGACAAGAA CAGGGGGCAG TACTTGATTA TGTTATAATT ATGGCATATG ATGAGCATCA CAGTAAATCA GAAGAAGCTG GTTCTGTTGC ATCTATCGGA TTTCTACAAA AAGCAATCGA AGATACGCTA CTCCAAGTGC CAAAGGAAAA GCTTATTATG GGAATACCAT TTTACGCAAG ACTATGGAAG GAATATACGG AACTTGGTAA TCCAGCACTT GCTTCAGAGG CAGTTAGTAT GACGAGTGCT GAAAAAACGT TAGAAGCGAA CAAAGCAACG AAGAGCTGGG ACCAAACGAC AGGACAATAC TATGCTGAGT ATGAGAAAGA TGGTGCGAAG TATAAAATCT GGCTAGAGGA AGAGGAATCC ATCGAAGCTA AGTTGAAACT TATCTCTGAA GCTGATTTGG CGGGTGTTGC AAGTTGGAGA TTAGGATTTG AAAAACCTAG CATATGGAAT GTAATTCAGA AATATGTGAA TTAG
|
Protein sequence | MKKRTKLAFI GTIAVVVVVL AIVVSYIVEK NTPSKEVKAL SEFYQVPEGE AMVIMDDIVY ERNAKLLDGV LYMDLETIKV KFNQRFYWDA IENVLIYTTP TEIIKAEVGT KDYLVNKNKV SSNYPIVKMV NTEVYVALPF VAEYSDMRYK AYENPDLVVI QCKWGDYLFA DVENATQIRT GASIKSPILK ELNKGDRVLL INNGGNQQNG FLTVMTEEGI RGFVRKKNLS NSYYDKVTSN FEAPVYESIT KDYKINLTWH QVTNQEANKK LAEVLDSTKG VTTISPTWYR INSAEGTLAS LASESYIEKA HSMGIEVWAL VDNFDPTVDT FEVLSKTSSR ERLINELIAQ AIKYNLDGIN IDFESLSVET GPHYIQFLRE LSVKCRSNQI VLSSDTYVPA SYSKFYDRQE QGAVLDYVII MAYDEHHSKS EEAGSVASIG FLQKAIEDTL LQVPKEKLIM GIPFYARLWK EYTELGNPAL ASEAVSMTSA EKTLEANKAT KSWDQTTGQY YAEYEKDGAK YKIWLEEEES IEAKLKLISE ADLAGVASWR LGFEKPSIWN VIQKYVN
|
| |