Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0274 |
Symbol | |
ID | 4808557 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 339019 |
End bp | 340710 |
Gene Length | 1692 bp |
Protein Length | 563 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640105686 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001036706 |
Protein GI | 125972796 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00113767 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAAAA TAGCTGTATT GTTAATTACT TTGACGTTCT TAGTCACGAC TCTGTTTTCA ATGTCATTTT CATCGGCAGC TGCAAGTCAT GATTATGCCA CCGCATTAAA ATACTCAATT TTATTTTACG ACGCCAACAA ATGCGGTCCG GATGCAGCAG TTGACAATGT CTTCAGTTGG AGAGGACCGT GTCACACGAC TGACGGAAGT GAAATAGGCC TGGATCTGAC CGGAGGATAT CATGATGCCG GAGACCATGT CAAATTCGGT TTGCCCCAGG CTTATGCGGC AGCGGTACTG GGTTGGTCAC TTTATGAATA CAAAGGAGTG TTTGACGCAA CAGGAAACAC CTCCAAAATG CTCAGCACTC TCAAATATTT CACCGACTAT CTTTTAAAAT GTCATCCGGA CTCCAATACC TTCTACTATC AAGTAGGAGA CGGTCAGGCA GACCATACAT ACTGGGGTGC GCCGGAAGTC CAGCCGGGTC CGAGACCCGT ACCCTATGTT GCCAATGCCT CAAATGCGGC TTCTGATGTA TGTGGTTTGA CATCCGCCGC TCTTACGATA ATGTACCTAA ACTACAAGGA TATAGACCAA AACTATGCCA ACAAATGTTT AAAAGCTGCA AAAGAACTTT ATACAATGGC AAAAACCAAT TTGGGCTACT ATGCCGAGAA TGCTTTTTAC ATCTCCCACT CCTACTGGGA CGATCTTTCC TTTGCAGCCA CCTGGCTCTA TGTAGTGGAA AAAGACCCGA CTTACCTGAA GGAAATTGAC AGCTATCTTT CAAATAAAAC CCTTTGGGGA GAGAGTCCTT TCAACAACAA ATGGACAATG TGCTGGGATG ACATGTACAT GGCTGTCTTC TGCAAACTTG CAGAGATAAC AGGTGAACAA AAGTACATTG ACGCAATGAA TTACAATCTC GATTACTGGA TGAATTCCCT TAATACAACT CCCGGAGGCC TTAAGTATCT TGACAGCTGG GGAGTATTAA GATATGCGGC CGCCGAGGCC TTTATCGCCA TGAGATACTA CGAGCTTACC AAAAATGAAG CATTAAAATC CTTTGCAAAA TCTCAAATAG ACTACATACT TGGCAGCAAT CCCATCAACA TGTCCTATGT TATAGGTTAT GGATCAAACT ACCCAAAATG TCCTCACCAC AGGGCAGCCA ACGGCTACAC TTACGCCAAC GGTGACAATG CAAAACCTGC CAAAAACCTT CTTTTAGGTG CTTTGGTGGG CGGTCCGAAT ATGTCTGACA ACTTTATCGA TGATGTCAAT CAGTTCCAGT TTACGGAAGT GGCTATTGAC TATAATGCTG CTTTCGTGGG CGCTCTGGCT GCTATTGAAA AATACTACGG CAATATCGTT ATACCCACTC CTCCAGCCAC TACACCCCCG TCTCCTACCG CAACGCCTTC CTTAATATGG TGTGATGTCG GAGACTTAAA CGTTGACGGT TCAATAAACT CAGTAGACAT TACATACATG AAAAGGTATC TTTTGCGCAG TATAAGTGTC CTTCCTTACC AGGAAAATGA AAGGATTCGC ATACCGGCGG CAGACACCAA CGGCGACGGT GCAATCAATT CCAGTGACAT GGTATTGCTA AAAAGATATG TCCTTCGCAG TATTAGCGAA TTTCCGGTTA AATATGATAT CTATGGAAAC ATCATAAATT AA
|
Protein sequence | MRKIAVLLIT LTFLVTTLFS MSFSSAAASH DYATALKYSI LFYDANKCGP DAAVDNVFSW RGPCHTTDGS EIGLDLTGGY HDAGDHVKFG LPQAYAAAVL GWSLYEYKGV FDATGNTSKM LSTLKYFTDY LLKCHPDSNT FYYQVGDGQA DHTYWGAPEV QPGPRPVPYV ANASNAASDV CGLTSAALTI MYLNYKDIDQ NYANKCLKAA KELYTMAKTN LGYYAENAFY ISHSYWDDLS FAATWLYVVE KDPTYLKEID SYLSNKTLWG ESPFNNKWTM CWDDMYMAVF CKLAEITGEQ KYIDAMNYNL DYWMNSLNTT PGGLKYLDSW GVLRYAAAEA FIAMRYYELT KNEALKSFAK SQIDYILGSN PINMSYVIGY GSNYPKCPHH RAANGYTYAN GDNAKPAKNL LLGALVGGPN MSDNFIDDVN QFQFTEVAID YNAAFVGALA AIEKYYGNIV IPTPPATTPP SPTATPSLIW CDVGDLNVDG SINSVDITYM KRYLLRSISV LPYQENERIR IPAADTNGDG AINSSDMVLL KRYVLRSISE FPVKYDIYGN IIN
|
| |