Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2167 |
Symbol | |
ID | 4810880 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2577170 |
End bp | 2578312 |
Gene Length | 1143 bp |
Protein Length | 380 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640107570 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001038562 |
Protein GI | 125974652 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAGAT TTTATTTGGG GCTTGTTGTC ATTGTTTCTT TTGTTCTCAT GCTCGGAGGC ATAATTTACA GCCGGAGATT TGGAAACGGA CTTTTAACTG ACGCATCCGG CATATTTTTG GACGGAGATT TCCAAAGAAA TAAGTACGGG GAAAAAAAAC ACATCCTTTT AAAATTTATA AAGAGCCGTC TGATGGATTC TGAAGGAGGA ATCACAACGA ACACCCACCC GAGTATGGGA GATTCAAACA CTTTGTCGGA GTCAACAGGC ATACTTATGG AATGCGCTCT TATAAACAAC GATAAGAATT TGTTTGACAT GGAATACCGG TATATTAAGG TAAATCTTGT AACTGAGAAC TTCTTTATAA AATGGAAAGA GGGAAATGAC GTTTTCTGCA ATGCTTCCAT AGATGATTTA AGGATTATTG GCGCGCTTCT TGAAGCATAT GAAAAGTGGG GGGAAAAAGA TTATAAAAAT TTTGCACTTT TGCTGCAGAA AAAAATATAT GATGTCCAGG TAAAAGACGG AAGTCTTTAT GAGTTTTTTG ACTGGAAGTA CAACATTCCA AAGACTTCCA CTCCTTTGTG TTATTTAAAT TTGAAAGTAA TAAAAAAATT GCAGAAATAC AATAAAGGCT GGAGAAAAGT ATATGATAGA AGTTTGAATA TCATTAAGAA AGGGAAGATT GAGACTTCAC CTTTATTTTA CAAATATTTT GACTACAACC TTGGCAGGTA TTCTTTTGAT GAAGAGTATC AGGAAAAGGG CGGAATATGC CTGGTGTATA CCCTTTACAC CGCTCTTTCC ATGGCAGAGG CCGGCATTTT CGATGATGAA CTCCTAAACT GGCTTCGAAA TGAGATGGAA AAAGGTAAAG TATATGCATG GTATAATCCG TACAGCCAAA AACCTGTGTC CGATATGGAA AGCACGGCAG TATATGCTTT GGGCTCGATT TTTGCCGGAC TTTCGGGGGA TGAAGTGCTT TCTCAAAAAC TTTTGGACAG AATGCTTGAG TTTATGGTGA CCGATGAAAA TTCCCAATAT TACGGAGGAT TTGGGAACAG TGAGACAGGA GAATTTTATT CCTTTGACAA TCTTATGGCA CTAAAAGCAC TGGCTTTGGC CGAGAAACCG TGA
|
Protein sequence | MKRFYLGLVV IVSFVLMLGG IIYSRRFGNG LLTDASGIFL DGDFQRNKYG EKKHILLKFI KSRLMDSEGG ITTNTHPSMG DSNTLSESTG ILMECALINN DKNLFDMEYR YIKVNLVTEN FFIKWKEGND VFCNASIDDL RIIGALLEAY EKWGEKDYKN FALLLQKKIY DVQVKDGSLY EFFDWKYNIP KTSTPLCYLN LKVIKKLQKY NKGWRKVYDR SLNIIKKGKI ETSPLFYKYF DYNLGRYSFD EEYQEKGGIC LVYTLYTALS MAEAGIFDDE LLNWLRNEME KGKVYAWYNP YSQKPVSDME STAVYALGSI FAGLSGDEVL SQKLLDRMLE FMVTDENSQY YGGFGNSETG EFYSFDNLMA LKALALAEKP
|
| |