Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_3571 |
Symbol | |
ID | 5742975 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | - |
Start bp | 4409134 |
End bp | 4411029 |
Gene Length | 1896 bp |
Protein Length | 631 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 641294682 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001560659 |
Protein GI | 160881691 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3525] N-acetyl-beta-hexosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.475227 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTCCGT TTCATGTATT GGCAGATCAG GCATGGCAAT ATGAACTTAT TGAGGAATTG AAAAAATCCT TACCATTTAC CCTTTCAGAA GATGGCTTAC CTGTTGAGGT AAAGCATCAA GGTATTGGAA TTCAAGTAAC AAAGAAGGAT AATGTAAGAC AAATTACAGT GGGAGATAAA AACCAGTTAG CAAGAGCATG GTACCTCATG CTTCGGAATG AATCTCTTAA TGAATACGAT ATCAAGGAAC AATGTAGCTT TGGAGATTTG GGTGTGATGC TAGATTGTTC TAGAAATGCA GTGATGAAAC TGCCTACCCT ATTAGAATAC ATAAGGCAGT TAGCAATATT TGGATATCAT AGTTTACAAC TATATACTGA GGATACGATT AGAATTGAGG AAGAACCGTA CTTTGGTTAT ATGCGTGGAG CTTATACAAA AGAAGAAATA AAAAAGGTTG ATGCTTACTG CGCAAAATTC GGCATAGAAT TAGTGCCATG CGTACAAACA TTAGCTCACA TTAATCAGAT TACAAGATAT GAGCGTTACC AAGACATCAT CGATGTGAAT GATATTTTCT TAGTAGGAAA TGAAAAAACA TATCAATTAA TTGAACGTAT CATATCGACT GCATCCGAGT GTTTTACTTC AAGAAGAATC AACATCGGTA TGGATGAAGC TCATATGCTT GGTTTAGGAA AATACCTAGA TCAGAACGGT TATCAGAACC GTTTCACCAT TATGGTTGAG CATTTAAAGA GAGTACAAGA AATTTTGCAT CGTTATGGAT TTACTGCAAT GATGTGGAGT GATATGTTTT TTAGATTGCT TGCAAACGGT GAATATTATT CCCTTAAGGA AGAGCAGTTA AAAAGTGATT TATTACAACA TGTACCGAAG GATATCGAGT TAGTTTACTG GGATTATTAT TCAAGAGACT ATCATCATTA CGAACAGAAC TTAGTGACAC ATTTTAGAAT TTCAGACCGC ATAGGTTTCG CTGCCGGTGC ATGGAAATGG ACTGGATTTG CACCAGAGAA TAGCTTTAGC CAGGTGGCTG GTAAAGAAGC AATGAAGGCT TGTGTAGAAA AGGGAGTAAA TACTTTCTTA GTAACCTGTT GGGGAGATAA CGGCGCAGAA GCAAGTGCCC TCTCGGTGTT ACCAACCCTG TTTTATTATG CGGAATTAGC TTATCAGTAT GAAAGCCTTG TAGATAAAGA GTTTACCAAA GAGAAAGATT ATTCCGAGTA TTTTAAAGTT GCAACAGGAA TTTCTTTTGC AGAATTTATG CTACTGGATA GTCCAAATGA AGTATTTGCA GAAACTACTT ATACACACAG TAATGCTTGT AAGTTCTTAT TATACAACGA TGTGTTAATC GGAACATTTG ATTCCATTGT GAAGAGTGAA ACAAGGATAG CTTACGAAGA CAAGAAGAAT CAATTAGCTG CAGTAGCGAA GGCGGGTACA AGATATTCCT ATCTATTTCA GACCCTATCA AGTCTATGTT CCCTATTAGA GGAGAAAGCA GACTTGGGAG TAGAGATAAA AAATGCATAT GATAAAAAGG ATTTTGATAG GTTACGTGAA ATTGCTGAAT CAAAAATTCC CGAAGTATTG AAGAGACTCG ACCAGTTTAT ACGAGATTTT CGGTATCAAT GGCATAAGGA GAACAAATCC TTTGGCTTTG AAATACAATT GATACGTCTT GGCGGATTAA AGGAAAGACT TTACTACGCA AAGGAACAAA TTTTACTATG GACAGAAGGA GATATTGAAC GGATCGACGA ATTAGAAGAA GAGAGACTTC CATTTGCTTA TTTTGAACAA GAGGATGGAA GTAGACTAAA TTATAATTTG TGGAACGTTA TTGTATCACC GGCTGTTATG GGATGA
|
Protein sequence | MIPFHVLADQ AWQYELIEEL KKSLPFTLSE DGLPVEVKHQ GIGIQVTKKD NVRQITVGDK NQLARAWYLM LRNESLNEYD IKEQCSFGDL GVMLDCSRNA VMKLPTLLEY IRQLAIFGYH SLQLYTEDTI RIEEEPYFGY MRGAYTKEEI KKVDAYCAKF GIELVPCVQT LAHINQITRY ERYQDIIDVN DIFLVGNEKT YQLIERIIST ASECFTSRRI NIGMDEAHML GLGKYLDQNG YQNRFTIMVE HLKRVQEILH RYGFTAMMWS DMFFRLLANG EYYSLKEEQL KSDLLQHVPK DIELVYWDYY SRDYHHYEQN LVTHFRISDR IGFAAGAWKW TGFAPENSFS QVAGKEAMKA CVEKGVNTFL VTCWGDNGAE ASALSVLPTL FYYAELAYQY ESLVDKEFTK EKDYSEYFKV ATGISFAEFM LLDSPNEVFA ETTYTHSNAC KFLLYNDVLI GTFDSIVKSE TRIAYEDKKN QLAAVAKAGT RYSYLFQTLS SLCSLLEEKA DLGVEIKNAY DKKDFDRLRE IAESKIPEVL KRLDQFIRDF RYQWHKENKS FGFEIQLIRL GGLKERLYYA KEQILLWTEG DIERIDELEE ERLPFAYFEQ EDGSRLNYNL WNVIVSPAVM G
|
| |