Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_3239 |
Symbol | |
ID | 5742017 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | - |
Start bp | 3943604 |
End bp | 3945511 |
Gene Length | 1908 bp |
Protein Length | 635 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 641294339 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001560332 |
Protein GI | 160881364 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3525] N-acetyl-beta-hexosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTATAA ATATGAACAT AATATTAAAG GGATTTACAG ATGAGATAAA AGAAGGACTA AATCTACTAT TACAGGTGCA TTCGTTTGTA AATCAGGACC AGAGAGAAAT CGAGATAACC GTAGAAAAGA TATTAGAGGG GAATACTTCT CTTATTGTAT CGGGAAAGGT TGGTTGGCAC ATTCAATATA AGGAAACGGT ACATTTTTTC CGCGCCCTTG GTATCCTTAT TGAAAACATT GAGAATACAT CCTTTGAAAC GACGGAAACA GCACTTTTTG AAGGTTGCTC TAATATGATT GACCTTTCAA GAAACGCTGT CTACACTGTG GATGAAATGA AAAGAATGCT ATGTTACCTT GCACTTACTG GGCATAACAA ATGCTATTTA TATATGGAAG ATACATATGA GCTACCAGAC TATCCTTATT TCGGTTATTT AAGAGGTAGA TATTCCATCG CAGAGATGAA GGAAATCGAT GATTTTGCAT ATGCTTTAGG AATCGAGGCA ATCCCATGTA TTCAAACATT AGCACATTTG AAGACAACGT TAAAATGGAA TTATGCTTTA TCAATGAAAG ATACTGCGGA TATCCTACTT GTGGGAGAAG AAAAAACTTA TCAATTTATT GAGGCGATGT TTGTCTCCCT GAAGCAGACC TTCCGTTCTA GAACCATCCA TATAGGTATG GATGAGGCTA TGGATCTTGG TAGTGGTAAA TATCTAAGAG AAAATGGTTA CCGCGTTCAA TATGATTTAA TGACAGAACA TTTAGCTAAG GTCAATGAGA TTGCGATAAA ACATGGTATC AAGCCGTTGA TTTGGGATGA TATGTTTTAC CGTTCCTTAA ATAAAGATCA TGAGTATTAT GATACAACAA TTCCTCTTAC AGACGAGCAT ATAAAAAAGG TCCCTAGCAA TATTGGATTG GTATATTGGG ATTATTATCA TAATAATAAA GAGGATTATG AGACCTTACT TACGATGAGA GACCGTTTTC CAAATGATAT TATCTTTGCA GGAGGAATCT GGCGTTGGAT GGGTTATGTT CCAGGGTATA CAAAAACATT TGCAACAACA AATGCAGCAC TCGACCGATG CAAGCATCAT AAAGTAAAAG AAATTATGTC CACTTGTTGG GGCGATGACG GAGCGGAAAC ACCAATCGAA ACCATTATAC CAGGTCTAAT TCTTTTTGGT GAGCACGGTT ATGGACAGGA TACTTCTATG GATGCGATTA GTAGTAAGTG TAAATTCTTA ACTGGTGTTT CTTTATTTGA CTTTATGAAA ATTGAAGAAA TTGATATTAT TCCAGGATGT GAAGCACAAA ATATCAAGAC TAGAAATCCA TCCAAACATA TTTTATTTCA AGACTTGCTA CTAGGTGCCT TTGATACCTA TTTTGATAGA GAAGGACTTG AGGAACATTA TGCGAAAGTA AAAGAAGAGC TATATACGAT CTCAAAAACT GCAGGTAAGT TTGAACAACT TTTTGTTATG TATGCAAAGC TTGCAACGGT TCTAGAGAAA AAGGTGAAGC TTGGAATTAA AATAAGAAAG GCATATCAAT TAAAAGATAA AGATACATTA AAGACAATCT GTGAACAGAT ATTGCCGGTT CTAAAAGAAG ATGTGGAAAA CTTTAAAAAG GAATATACAA AGGTGTGGTT CAATGAAAGT AAGGGACATG GCTTTGAAGT TATTGATGTT AGACTTGGTG GTTTAATGAG CAGAATTGAT ACGGTAAAAT ATCGATTGGA AGACTATATA ACTGGTGATA TTTTAAAAGT TGAAGAGTTA GAGGAAACAA TTTTACCATA TGAACTTGGC GGATATCCAG AAGGACCGTA TTTAGCTTAT AACAAATACA AAGATATTGT AACTCAGAAT CTGCTTTCTC ATCACTAA
|
Protein sequence | MVINMNIILK GFTDEIKEGL NLLLQVHSFV NQDQREIEIT VEKILEGNTS LIVSGKVGWH IQYKETVHFF RALGILIENI ENTSFETTET ALFEGCSNMI DLSRNAVYTV DEMKRMLCYL ALTGHNKCYL YMEDTYELPD YPYFGYLRGR YSIAEMKEID DFAYALGIEA IPCIQTLAHL KTTLKWNYAL SMKDTADILL VGEEKTYQFI EAMFVSLKQT FRSRTIHIGM DEAMDLGSGK YLRENGYRVQ YDLMTEHLAK VNEIAIKHGI KPLIWDDMFY RSLNKDHEYY DTTIPLTDEH IKKVPSNIGL VYWDYYHNNK EDYETLLTMR DRFPNDIIFA GGIWRWMGYV PGYTKTFATT NAALDRCKHH KVKEIMSTCW GDDGAETPIE TIIPGLILFG EHGYGQDTSM DAISSKCKFL TGVSLFDFMK IEEIDIIPGC EAQNIKTRNP SKHILFQDLL LGAFDTYFDR EGLEEHYAKV KEELYTISKT AGKFEQLFVM YAKLATVLEK KVKLGIKIRK AYQLKDKDTL KTICEQILPV LKEDVENFKK EYTKVWFNES KGHGFEVIDV RLGGLMSRID TVKYRLEDYI TGDILKVEEL EETILPYELG GYPEGPYLAY NKYKDIVTQN LLSHH
|
| |