Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1888 |
Symbol | |
ID | 4809219 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2241344 |
End bp | 2242927 |
Gene Length | 1584 bp |
Protein Length | 527 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 640107307 |
Product | hypothetical protein |
Protein accession | YP_001038302 |
Protein GI | 125974392 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTACAAA TGGACAATGA AGCACAGTTT AAAGTGGAAT TTCATACCAC TAGCAACAAG TTGATAACAA AGATTTTATA TACCAGTGAT TTTTCTTCTA CCAGTAAGTT TAAGAGTGCC CTTAACCGCA ACAGCATAGA CCTTATATAC CAGGGAAATG ACCACGACCT TGAGATTATT AAGGATATCG TCTCCAGGAA GCCTTATGCT GTTAAGACAG GTGTAAGTTA TATAGGTATT GTGAAAAAAG AGTCACAGTA CATTTTTGTA TCAGACACAA AAGCCATTGA TAAAAATGAC AATATTGTTG ATGATATTGT AATCATGGAA AACTCAAAGT GTATCGGCAC TGGACTTGCT GAAGTACAGC CTATAGATAA TATTGGATTA GCGCAATTAT CAAAACTTCT TTTTAATTTT AATATTCTTC CTATTACAGC GACTATTATC GGTTTTTGTG CAAGTTGTTT TTTGAAAGAA AAATTATGGC AAAGCGGGAG AATAAAGCAT AACTACTTAA TTATTACAGG CGAATCAGGT TCGGGGAAAA GTGAAACAGT TGAAAATGTT ATTATGCCTA TGTTTGCTGC AACAAATTGC ACTATAAGCA GCGGGCAGAT AACAAGATTT GTAAATGCAA AAATGTCAGC AAGCAGCAAT CTTATCCCTA TGATTATAAC TGAATATAAG CCAACCAAGT TAAATAAGAA CAGAATGGAT GAAATTTCTG ACCTGTTAAG AAATACCTAT GACAGAACAC CTGCCTATAG AGGCAGGCCT GATTTGATAC TTAACGAATA TCTACCTCTT GCCCCAATAA TTCTAGTAGG GGAAATGGGC TTCGATGAAA CTGCTATAAA GGAACGCAGT CTGGAAGTCT TGTACTCTAA AGCGAATATT AGGGATGAGG TTATTCAAGG ACGATTTAAA ATGCTAAAAC GGCGTTCAAG AGAACTAAGA ATGCTTGGAA GAAGCCTATT AAACCAAGCA TTAAAAATAG AGCCTGATGA ATTGATAAGA AGGCATAGAG CAATTGAAGA AAGGATAAAT TTAAGTTTAC CATCCAGAGT TAAGAATTCT ATTGCAAATT GTATGCAGGG ATTGCTCCTT CTCAAGGATG TATATGATTC TCTGAATATG AACTTTGAGA AACAGGTAGG CTATAGCATT CAAGAACTCT TCAATTCTGT TTTAAGTGGA GTTCATGACT ACCTGCTTGA TGGGCAAAAT GAGGCTAAAG GAAGCATAGA GAAAATTCTA GAGGTTATCT GTCGTATGGA AGAATCAGGG GTACTTATAA GGGGTAATGA TTATCAGGTT ATAAATCAGG GTACTGAGTT GGCACTGAAT ATTAGTCCTT TATATGATAA ATTTACAAAG TATGTAAGAG AACATAACAT ATTAGATGTG GAAGTTTTAT CCCTTCCTCA ATTCAGAAAG CAATTAAGAA CAAAAGAGTA TTTCTCTGAC TACAAAACAG TAAGGTTCAA TACAGCCAAC TCTGCTGATA AGCCTGTAAA AGCATATACA CTTGATATTG CTAAGTTAAG AGAGATTCTT GATATTTCTG CACTGGTAGA ATAA
|
Protein sequence | MVQMDNEAQF KVEFHTTSNK LITKILYTSD FSSTSKFKSA LNRNSIDLIY QGNDHDLEII KDIVSRKPYA VKTGVSYIGI VKKESQYIFV SDTKAIDKND NIVDDIVIME NSKCIGTGLA EVQPIDNIGL AQLSKLLFNF NILPITATII GFCASCFLKE KLWQSGRIKH NYLIITGESG SGKSETVENV IMPMFAATNC TISSGQITRF VNAKMSASSN LIPMIITEYK PTKLNKNRMD EISDLLRNTY DRTPAYRGRP DLILNEYLPL APIILVGEMG FDETAIKERS LEVLYSKANI RDEVIQGRFK MLKRRSRELR MLGRSLLNQA LKIEPDELIR RHRAIEERIN LSLPSRVKNS IANCMQGLLL LKDVYDSLNM NFEKQVGYSI QELFNSVLSG VHDYLLDGQN EAKGSIEKIL EVICRMEESG VLIRGNDYQV INQGTELALN ISPLYDKFTK YVREHNILDV EVLSLPQFRK QLRTKEYFSD YKTVRFNTAN SADKPVKAYT LDIAKLREIL DISALVE
|
| |