Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2089 |
Symbol | |
ID | 4810949 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2482535 |
End bp | 2484760 |
Gene Length | 2226 bp |
Protein Length | 741 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640107496 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001038489 |
Protein GI | 125974579 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000609917 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTAAAAA GCAGAAAGAT TTCTATTCTG TTGGCAGTTG CAATGCTGGT ATCCATAATG ATACCCACAA CTGCATTCGC AGGTCCTACA AAGGCACCTA CAAAAGATGG GACATCTTAT AAGGATCTTT TCCTTGAACT CTACGGAAAA ATTAAAGATC CTAAGAACGG ATATTTCAGC CCAGACGAGG GAATTCCTTA TCACTCAATT GAAACATTGA TCGTTGAAGC GCCGGACTAC GGTCACGTTA CTACCAGTGA GGCTTTCAGC TATTATGTAT GGCTTGAAGC AATGTATGGA AATCTCACAG GCAACTGGTC CGGAGTAGAA ACAGCATGGA AAGTTATGGA GGATTGGATA ATTCCTGACA GCACAGAGCA GCCGGGTATG TCTTCTTACA ATCCAAACAG CCCTGCCACA TATGCTGACG AATATGAGGA TCCTTCATAC TATCCTTCAG AGTTGAAGTT TGATACCGTA AGAGTTGGAT CCGACCCTGT ACACAACGAC CTTGTATCCG CATACGGTCC TAACATGTAC CTCATGCACT GGTTGATGGA CGTTGACAAC TGGTACGGTT TTGGTACAGG AACACGGGCA ACATTCATAA ACACCTTCCA AAGAGGTGAA CAGGAATCCA CATGGGAAAC CATTCCTCAT CCGTCAATAG AAGAGTTCAA ATACGGCGGA CCGAACGGAT TCCTTGATTT GTTTACAAAG GACAGATCAT ATGCAAAACA GTGGCGTTAT ACAAACGCTC CTGACGCAGA AGGCCGTGCT ATACAGGCTG TTTACTGGGC AAACAAATGG GCAAAGGAGC AGGGTAAAGG TTCTGCCGTT GCTTCCGTTG TATCCAAGGC TGCAAAGATG GGTGACTTCT TGAGAAACGA CATGTTCGAC AAATACTTCA TGAAGATCGG TGCACAGGAC AAGACTCCTG CTACCGGTTA TGACAGTGCA CACTACCTTA TGGCCTGGTA TACTGCATGG GGTGGTGGAA TTGGTGCATC CTGGGCATGG AAGATCGGAT GCAGCCACGC ACACTTCGGA TATCAGAACC CATTCCAGGG ATGGGTAAGT GCAACACAGA GCGACTTTGC TCCTAAATCA TCCAACGGTA AGAGAGACTG GACAACAAGC TACAAGAGAC AGCTTGAATT CTATCAGTGG TTGCAGTCGG CTGAAGGTGG TATTGCCGGT GGAGCAACCA ACTCCTGGAA CGGTAGATAT GAGAAATATC CTGCTGGTAC GTCAACGTTC TATGGTATGG CATATGTTCC GCATCCTGTA TACGCTGACC CGGGTAGTAA CCAGTGGTTC GGATTCCAGG CATGGTCAAT GCAGCGTGTA ATGGAGTACT ACCTCGAAAC AGGAGATTCA TCAGTTAAGA ATTTGATTAA GAAGTGGGTC GACTGGGTAA TGAGCGAAAT TAAGCTCTAT GACGATGGAA CATTTGCAAT TCCTAGCGAC CTCGAGTGGT CAGGTCAGCC TGATACATGG ACCGGAACAT ACACAGGCAA CCCGAACCTC CATGTAAGAG TAACTTCTTA CGGTACTGAC CTTGGTGTTG CAGGTTCACT TGCAAATGCT CTTGCAACTT ATGCCGCAGC TACAGAAAGA TGGGAAGGAA AACTTGATAC AAAAGCAAGA GACATGGCTG CTGAACTGGT TAACCGTGCA TGGTACAACT TCTACTGCTC TGAAGGAAAA GGTGTTGTTA CTGAGGAAGC ACGTGCTGAC TACAAACGTT TCTTTGAGCA GGAAGTATAC GTTCCGGCAG GTTGGAGCGG TACTATGCCG AACGGTGACA AGATTCAGCC TGGTATTAAG TTCATAGACA TCCGTACAAA ATATAGACAA GATCCTTACT ACGATATAGT ATATCAGGCA TACTTGAGAG GCGAAGCTCC TGTATTGAAT TATCACCGCT TCTGGCATGA AGTTGACCTT GCAGTTGCAA TGGGTGTATT GGCTACATAC TTCCCGGATA TGACATATAA AGTACCTGGT ACTCCTTCTA CTAAATTATA CGGCGACGTC AATGATGACG GAAAAGTTAA CTCAACTGAC GCTGTAGCAT TGAAGAGATA TGTTTTGAGA TCAGGTATAA GCATCAACAC TGACAATGCC GATTTGAATG AAGACGGCAG AGTTAATTCA ACTGACTTAG GAATTTTGAA GAGATATATT CTCAAAGAAA TAGATACATT GCCGTACAAG AACTAA
|
Protein sequence | MVKSRKISIL LAVAMLVSIM IPTTAFAGPT KAPTKDGTSY KDLFLELYGK IKDPKNGYFS PDEGIPYHSI ETLIVEAPDY GHVTTSEAFS YYVWLEAMYG NLTGNWSGVE TAWKVMEDWI IPDSTEQPGM SSYNPNSPAT YADEYEDPSY YPSELKFDTV RVGSDPVHND LVSAYGPNMY LMHWLMDVDN WYGFGTGTRA TFINTFQRGE QESTWETIPH PSIEEFKYGG PNGFLDLFTK DRSYAKQWRY TNAPDAEGRA IQAVYWANKW AKEQGKGSAV ASVVSKAAKM GDFLRNDMFD KYFMKIGAQD KTPATGYDSA HYLMAWYTAW GGGIGASWAW KIGCSHAHFG YQNPFQGWVS ATQSDFAPKS SNGKRDWTTS YKRQLEFYQW LQSAEGGIAG GATNSWNGRY EKYPAGTSTF YGMAYVPHPV YADPGSNQWF GFQAWSMQRV MEYYLETGDS SVKNLIKKWV DWVMSEIKLY DDGTFAIPSD LEWSGQPDTW TGTYTGNPNL HVRVTSYGTD LGVAGSLANA LATYAAATER WEGKLDTKAR DMAAELVNRA WYNFYCSEGK GVVTEEARAD YKRFFEQEVY VPAGWSGTMP NGDKIQPGIK FIDIRTKYRQ DPYYDIVYQA YLRGEAPVLN YHRFWHEVDL AVAMGVLATY FPDMTYKVPG TPSTKLYGDV NDDGKVNSTD AVALKRYVLR SGISINTDNA DLNEDGRVNS TDLGILKRYI LKEIDTLPYK N
|
| |