Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2812 |
Symbol | |
ID | 4809649 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3319096 |
End bp | 3320931 |
Gene Length | 1836 bp |
Protein Length | 611 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640108232 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001039204 |
Protein GI | 125975294 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.283645 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAAAA ATCTTTTTGC AAAAAGAGCG GTGGCTTTCC TGCTCGGTAT TGTGATTACG GCTGCAGGGA TTGTCTCTTT CAACACCGTA AGCACCAGTG CCGCCGGAGA ATACAATTAT GCAAAGGCGC TGCAGTATTC CATGTTCTTC TATGATGCGA ACATGTGCGG TACAGGTGTT GACGAGAACA GCCTTTTGTC ATGGAGAGGA GACTGCCACG TATATGATGC AAGACTTCCT CTGGATTCCC AGAACACCAA CATGTCCGAT GGTTTTATAA GCAGCAACAG AAGTGTGCTT GACCCTGACG GAGACGGCAA AGTTGACGTG TCAGGCGGTT TTCATGACGC CGGCGACCAT GTGAAGTTTG GTTTGCCTGA GGCTTATGCC GCTTCAACAG TGGGTTGGGG TTACTATGAA TTTAAAGACC AGTTCCGTGC AACGGGACAG GCCGTCCATG CTGAAGTAAT TTTAAGATAC TTCAATGACT ATTTTATGAG ATGTACTTTC AGAGACGCTT CCGGAAATGT TGTGGCGTTC TGTCATCAGG TGGGCGACGG AGATATCGAC CATGCATTTT GGGGTGCTCC GGAAAATGAC ACCATGTTCA GAAGAGGTTG GTTTATTACC AAAGAAAAGC CTGGAACTGA CATTATTTCG GCAACAGCAG CTTCTTTAGC AATAAACTAC ATGAATTTTA AAGACACAGA CCCTCAATAT GCGGCAAAAA GCCTTGATTA TGCAAAAGCT TTGTTTGATT TTGCGGAGAA AAATCCAAAA GGGGTAGTTC AGGGAGAGGA CGGACCAAAA GGTTATTATG GTTCAAGCAA ATGGCAGGAT GACTACTGCT GGGCTGCCGC ATGGCTTTAT TTGGCAACGC AGAATGAGCA CTATTTGGAT GAAGCATTTA AATATTATGA TTATTATGCT CCGCCGGGAT GGATACATTG CTGGAATGAC GTGTGGTCGG GAACCGCATG TATTTTGGCG GAAATAAATG ATTTGTACGA CAAGGACAGC CAGAATTTCG AAGACAGGTA TAAAAGAGCT TCCAATAAGA ATCAGTGGGA GCAGATAGAC TTCTGGAAAC CCATACAAGA TTTGCTTGAC AAGTGGTCGG GTGGCGGTAT TACAGTTACA CCGGGCGGAT ACGTTTTCCT CAATCAGTGG GGTTCTGCAA GATACAATAC TGCCGCTCAG CTGATAGCTC TTGTTTATGA CAAGCATCAT GGTGACACAC CGTCAAAATA TGCTAACTGG GCACGGTCGC AGATGGATTA TCTGTTGGGT AAAAACCCGT TGAATCGCTG CTATGTTGTA GGCTACAGCA GCAATTCGGT CAAATACCCG CACCACAGAG CGGCTTCCGG ACTGAAAGAT GCCAATGATT CTTCTCCGCA CAAATATGTG TTGTATGGTG CCCTGGTCGG AGGGCCGGAT GCAAGTGACC AGCATGTGGA TAGAACAAAT GATTATATTT ACAATGAGGT TGCCATTGAC TATAATGCCG CTTTTGTGGG AGCATGTGCA GGTCTTTACA GATTCTTCGG GGATTCTTCA ATGCAGATAG ACCCGTCAAT GCCGTCGCAT AACGTACCTG TACCACCGAC ACCCACACCT CCTGATACGC AAATTGTATA TGGAGATTTG AACGGCGACC AGAAAGTGAC TTCCACAGAC TATACGATGC TCAAGAGGTA TTTGATGAAA AGCATTGATA GGTTTAATAC TTCCGAACAA GCTGCGGATT TGAACAGAGA CGGCAAAATC AATTCCACGG ACTTGACAAT ATTGAAAAGA TATTTGCTTT ACAGCATACC GTCTCTCCCT ATATAA
|
Protein sequence | MRKNLFAKRA VAFLLGIVIT AAGIVSFNTV STSAAGEYNY AKALQYSMFF YDANMCGTGV DENSLLSWRG DCHVYDARLP LDSQNTNMSD GFISSNRSVL DPDGDGKVDV SGGFHDAGDH VKFGLPEAYA ASTVGWGYYE FKDQFRATGQ AVHAEVILRY FNDYFMRCTF RDASGNVVAF CHQVGDGDID HAFWGAPEND TMFRRGWFIT KEKPGTDIIS ATAASLAINY MNFKDTDPQY AAKSLDYAKA LFDFAEKNPK GVVQGEDGPK GYYGSSKWQD DYCWAAAWLY LATQNEHYLD EAFKYYDYYA PPGWIHCWND VWSGTACILA EINDLYDKDS QNFEDRYKRA SNKNQWEQID FWKPIQDLLD KWSGGGITVT PGGYVFLNQW GSARYNTAAQ LIALVYDKHH GDTPSKYANW ARSQMDYLLG KNPLNRCYVV GYSSNSVKYP HHRAASGLKD ANDSSPHKYV LYGALVGGPD ASDQHVDRTN DYIYNEVAID YNAAFVGACA GLYRFFGDSS MQIDPSMPSH NVPVPPTPTP PDTQIVYGDL NGDQKVTSTD YTMLKRYLMK SIDRFNTSEQ AADLNRDGKI NSTDLTILKR YLLYSIPSLP I
|
| |