Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_1853 |
Symbol | |
ID | 7408966 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 1929397 |
End bp | 1931898 |
Gene Length | 2502 bp |
Protein Length | 833 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 643716225 |
Product | Cellulose 1,4-beta-cellobiosidase |
Protein accession | YP_002573714 |
Protein GI | 222529832 |
COG category | [R] General function prediction only |
COG ID | [COG3889] Predicted solute binding protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0781355 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTAACTA ATAAAAAGCT TAGGGAGTAT GTATATTTTG TATCGATTAT TTTTTTCATG ACTATTATAT TGTTAAATGG ACAGATAAAT GTGAGCAAAA GTAATTTAGC AATGGCGGCA ACAGGAAGTC AAATTGAGAA GTTAAATAGA GGATTAATTG CAATAAAAGT AACCAACGGA GTATTTTTGA GTTGGAGAAT GTTTGGTTCA GATTTAGCGA ATATAGGGTT TAATATTTAC AGAAATGGAG TGAAGATAAC TAATACGCCG ATTCAGAACA GCACAAACTA TGTTGATACT GCTGGGACAG CTGCTTCGAA GTATTATGTA AAAGCAGTGA TAAATGGTGT AGAGGTAGAG CAATCTGAAG AAGTAAGTGT GCTTAGTAGC AATTATATTG AAATAAGATT AAATAAACCA GCGAATTCTC CACTCGGTGC TTCATATTCG CCAAATGATG CAAGTGTTGG CGATTTAGAT GGTGATGGGG AATATGAGAT TGTTCTGAAA TGGGATCCAA GTGATTCAAA GGATAACTCA CAATCTGGAT ACACAAGCAA TGTGTATTTA GACGCTTACA AATTAAATGG CAAGTTTTTA TGGAGAATTG ATTTGGGTAG GAATATAAGA GCAGGAGCAC ATTATACACA ATTTATAGTA TATGATTTAG ATGGTGACGG AAAAGCTGAA GTTGCATGTA AAACAGCTGA TGGAACAATA GATGGACAAG GGAATGTGAT AGGTGATCCA AATGCTGATT GGAGGAATTC CTCTGGTTAT ATTTTATCAG GGCCTGAATA TTTGACTATA TTTGAAGGTG CGACAGGACG AGCAATAAAG ACGGTTAATT ATATTCCGCC ACGAGGGAAT GTTTCATCAT GGGGTGATTC TTATGGAAAC AGGGTTGACA GGTTTTTAGC AGCAGTAGCT TATTTGGATG GGAATAGACC TAGCTTAATT ATGTGCCGAG GATATTACAC AAAAACATAT ATAGTTGCTT GGAATTGGCG AAATGGTGAG TTAACAAAGT TATGGCAATT TGACACAGGA GAGATTAGAG ATGGATATAG AGATGATTAC GAAGGACAAG GAAATCACAA TTTGAGTGTG GCTGATGTTG ACAATGATGG TAAAGATGAG ATAATATATG GTGCTATGGT AGTAGATGAT AATGGAGCAC CATTATATTC AACTAAATTA GGTCATGGTG ATGCAATGCA TGTGACAGAT ATTGATCCAG ACAGGCCAGG ATTAGAAGTT TGGCAGTGTC ATGAAGGAAG TACAGGAGCG AGTTTAAGGG ATGCACGAAC TGGACAGATA TTAGTGAGAG TTTTAACATC TGGAGATAGT GGACGTGCTT TGACGGCAGA TATTAACCCG CGATATAGAG GATTAGAAAT GTGGGCGGCA GGTGGAATAA GTGTTAGAGA TTGTAGAGGT AATGTAATCA GTAATGCGAC ACCACCAATT AATTTTGCAA TATGGTGGGA TGGAGATTTA GGTAGAGAAT TGTTGGATAA TGTATATATT TATAAATGGG ATTATAACAA TAATAGGAGT AATACTATAT TCACAGCAAG TGGATGTTCA TCAAACAATG GCACAAAAGC AACACCTTGC TTGAGTGCAG ATATACTTGG GGATTGGCGA GAAGAGGTCA TATTTAGGAC TAGTGATAAC AATGCAATCA GGATATATAC AACCACCACA TTGACAGATT ATAAGATACC TACGCTTATG CATAACAGGC AATATAGGGT GTCTATAGCA TGGCAGAACG TTGCATATAA TCAACCACCT CACGTAAGTT TTTATTTAGG GTATGAGACT AATGTGAATA ACATATATCA ATATTTTGAA GGTTATGGGC AACAACCAAT TGTTACACCG TCGCTAACCC CGACAAAAAC ACCAACGCCT ACATCAACTC CATTGCCAAC ATCAACTGCA ACATCTACGC CAACTCCAAC AGCAACAGCA ACACCAACAC CGACACCAAC AGCAACACCA ACACCGACGC CGAGCAGCAC ACCTGTAGCA GGTGGACAGA TAAAGGTATT GTATGCTAAC AAGGAGACAA ATAGCACAAC AAACACGATA AGGCCATGGT TGAAGGTAGT GAACACTGGA AGTAGCAGCA TAGATTTGAG CAGGGTAACG ATAAGGTACT GGTACACGGT AGATGGGGAC AAGGCACAGA GTGCGATATC AGACTGGGCA CAGATAGGAG CAAGCAATGT GACATTCAAG TTTGTGAAGC TGAGCAGTAG CGTAAGTGGA GCGGACTATT ATTTAGAGAT AGGATTTAAG AGTGGAGCTG GGCAGTTGCA GGCTGGTAAA GACACAGGGG AGATACAGAT AAGGTTCAAC AAGAGTGATT GGAGCAATTA CAATCAAGGG AATGACTGGT CATGGATGCA GAGCATGACG AATTATGGAG AGAATGTGAA GGTAACAGCG TATGTAGATG GGGTGCTGGT ATGGGGGCAA GAACCAAAAT AA
|
Protein sequence | MLTNKKLREY VYFVSIIFFM TIILLNGQIN VSKSNLAMAA TGSQIEKLNR GLIAIKVTNG VFLSWRMFGS DLANIGFNIY RNGVKITNTP IQNSTNYVDT AGTAASKYYV KAVINGVEVE QSEEVSVLSS NYIEIRLNKP ANSPLGASYS PNDASVGDLD GDGEYEIVLK WDPSDSKDNS QSGYTSNVYL DAYKLNGKFL WRIDLGRNIR AGAHYTQFIV YDLDGDGKAE VACKTADGTI DGQGNVIGDP NADWRNSSGY ILSGPEYLTI FEGATGRAIK TVNYIPPRGN VSSWGDSYGN RVDRFLAAVA YLDGNRPSLI MCRGYYTKTY IVAWNWRNGE LTKLWQFDTG EIRDGYRDDY EGQGNHNLSV ADVDNDGKDE IIYGAMVVDD NGAPLYSTKL GHGDAMHVTD IDPDRPGLEV WQCHEGSTGA SLRDARTGQI LVRVLTSGDS GRALTADINP RYRGLEMWAA GGISVRDCRG NVISNATPPI NFAIWWDGDL GRELLDNVYI YKWDYNNNRS NTIFTASGCS SNNGTKATPC LSADILGDWR EEVIFRTSDN NAIRIYTTTT LTDYKIPTLM HNRQYRVSIA WQNVAYNQPP HVSFYLGYET NVNNIYQYFE GYGQQPIVTP SLTPTKTPTP TSTPLPTSTA TSTPTPTATA TPTPTPTATP TPTPSSTPVA GGQIKVLYAN KETNSTTNTI RPWLKVVNTG SSSIDLSRVT IRYWYTVDGD KAQSAISDWA QIGASNVTFK FVKLSSSVSG ADYYLEIGFK SGAGQLQAGK DTGEIQIRFN KSDWSNYNQG NDWSWMQSMT NYGENVKVTA YVDGVLVWGQ EPK
|
| |