Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0760 |
Symbol | |
ID | 4810378 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 925091 |
End bp | 926725 |
Gene Length | 1635 bp |
Protein Length | 544 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640106177 |
Product | hypothetical protein |
Protein accession | YP_001037188 |
Protein GI | 125973278 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGAATAG AAAGCTCGGA AAATGTATTA ATTCAACAGA CCGGAGCTTT GGACATACTT TCGCGGCTTG ATACGGGAGA TACCTTGAGA GCAAGGGTAG TTGACATTAC TGCAAATGAG CTGCTCCTAA AGCTGTTTGA CGGAACATTG ATAAATGCCG GTACTATGAC TCCCATAGAT GCCAAAAAGG GAGAGTTGCT TGATTTTATT GTAAAAAACA AAGTGAACAA TCAATTGTTT TTGGAAATCA TGAAGGATGG CGTTCAAAAT GCCGCTCAGC CCAATGTTGA GGACGAAATT AAAAACAAGC TTGCACAGCT TGGCATAAAG CCTGACAGAA GAAATATGGA GACGGCGGCT GAACTTCGTG CCAATGGAAC TCAGCTTAAT GCTGAAAACA TTACCAAAGT TGTTGATGCG GTGCTTAGGT TTAAAAACCT TGGCATTGCA AAAGCAGCCT ATCTTGTTTC AAACAATATT ATTCCCGAAG AAAAAAGTAT AACAAGTCTA AATAGATTTG TCGAGGGCAG AGTGAGATTG AGTTCTGAGC TTTTAGATCT GGCTTCAAGT CTTGCAGAGA TTCAGGATAA AGATGTGGCT TTTGCCATCT TAAAAAAGCT TAATGCCATG GATTATCCTT TTCAAAAGAG CGGAGAAACT TCTGATGTCA GCACTTCCCC TATTCATTAT GAAAACGGAA AAATTTATAA CAGCCAAAAG AATTGGCATA TTGAAGGGAA TGTGGCTCAA AAAAGTGAAA AGGAGCATTG GAGCCGTAAG CCTGAATTGT CCCGAAATGG CAATATAGGT ACAGAAAATT TGGATAAAAC AGTGAAAAAT GTTGACATAG AGGGAAATGA GATATTACAA TTAAAGAAAA AAACTGCTGA TTTTGTTAAT TACAAAGAAA ACATAGAGAA ATCTGAAAAA GAGCTTGCGG ACTTTTTAAA TGTGATTCTG GCTTCACAAG GAAGCAGAAG CGCCGGCAAA GGCCGTTCTG AAACTAATAA TGTGGCTCAA GTTATAAAAA AATCCTTTGA AAAGATGTTT GCAAAAATTA ATGAGGAAGT TGAAGGCAGA GATATTAATG TAAAGGAATT TTACAGAGAT ATTTATAAAA AGCTTGAAAT AGTCCGCAAA GTTTTAGAAG AGACCGACAT CCCAGGTAAA CAAGAGATTT TAAACAAAGT TGACAACATT AAGAGCGATA TAAATTTTTT AAATGAGTTA AACAAACATA CCGTATATTT TCAGATACCT CTGAAAATAT TTGACAAGAA TACTAACGGT GAGCTTTATA TATTGAAAAG AAACAACGGA AGAAAAAGAA TTGATCCGCA AAATGCCACT GTGTTTTTGT CATTGGATAC GGAAAATTTG GGACAAGTGG ACTCTCTTAT CAGTGTAAAC AAAAAGAATG TCAGCCTCAA TTTTAGGCTT GAAAAAAATG AAATCATAGA TTATATCAAA GAAAACTATA TTCAGCTTTA TGAAGGATTG GCCAAAAAAG GCTATAAACT TGTGGATATC AAATACAGGC TTATAGATGA AAAGGTAAAT CTGTTAAATG CCCGGGAGGT TTTGGAAAAA GAAATAGAAA GAACAAGAAA CAGAGGGTTT GACTGCAAAA TTTGA
|
Protein sequence | MRIESSENVL IQQTGALDIL SRLDTGDTLR ARVVDITANE LLLKLFDGTL INAGTMTPID AKKGELLDFI VKNKVNNQLF LEIMKDGVQN AAQPNVEDEI KNKLAQLGIK PDRRNMETAA ELRANGTQLN AENITKVVDA VLRFKNLGIA KAAYLVSNNI IPEEKSITSL NRFVEGRVRL SSELLDLASS LAEIQDKDVA FAILKKLNAM DYPFQKSGET SDVSTSPIHY ENGKIYNSQK NWHIEGNVAQ KSEKEHWSRK PELSRNGNIG TENLDKTVKN VDIEGNEILQ LKKKTADFVN YKENIEKSEK ELADFLNVIL ASQGSRSAGK GRSETNNVAQ VIKKSFEKMF AKINEEVEGR DINVKEFYRD IYKKLEIVRK VLEETDIPGK QEILNKVDNI KSDINFLNEL NKHTVYFQIP LKIFDKNTNG ELYILKRNNG RKRIDPQNAT VFLSLDTENL GQVDSLISVN KKNVSLNFRL EKNEIIDYIK ENYIQLYEGL AKKGYKLVDI KYRLIDEKVN LLNAREVLEK EIERTRNRGF DCKI
|
| |