Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0369 |
Symbol | |
ID | 4808446 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 463483 |
End bp | 464706 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640105783 |
Product | hypothetical protein |
Protein accession | YP_001036800 |
Protein GI | 125972890 |
COG category | [S] Function unknown |
COG ID | [COG1641] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR00299] conserved hypothetical protein TIGR00299 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.299825 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAATAC TGTATTTTGA CTGTTTTGCG GGTGCCAGCG GTGATATGAT TCTGGGTGCC CTGCTGGATT TGGGAATTGA TGTCGGAATT TTTAAAAGGG AGCTTGCAGG ATTAAATTTG GACGGTTTTG ATATTGCTGT TGAAAAAAAA GTAATAAACT CAATAGCCGT AACCGATGTG AATGTTATTG TAAAAGAGGA ATGTAATCAT CATACCGGAC ACCATCATCA TTGTGAGCGC AATTTGGCGG ATATTGAGAA AATAATTGAC GAAAGCAGCC TGAAAGACAA TGTAAAAAGG CTTAGCAAAA AGATATTTTC AGAAATTGCC CGGGCTGAGG CAAAGGTCCA CAACAAATCC ATTGAAGATG TGCACTTTCA TGAAGTCGGT GCCATTGATT CCATTGTTGA TATTGTAGGT ACTGCAATTT GTCTGGACCT TTTGAAAGTT GACAAAATAT ACTCGTCACC GATGCATGAC GGCACGGGCT TTATAGAGTG TCAGCACGGA AAACTGCCGG TCCCGGTTCC TGCGGTTTTG GAAATGCTTA AGGAAAGCAA TATACCTTAC ATAACCGAGG ATGTGAACAC GGAATTGTTA ACTCCGACGG GCCTCGGAAT TATAAAATGT GTGGCTTCAA AGTTTGGCCC CATGCCCCCG ATGACTATTG AAAAAGTCGG ATATGGGGCA GGCAAAAGAC AGACGGGGCG TTTTAATGCC TTAAGGTGTA TTTTGGGAAA TGCTAAAGAA AAAGAAAAAA TTGATGATGA AATTTGTATG CTTGAGACAA ATATTGACGA TATGAATCCG GAGATTCTTG GCTATGTTAT GAACAGGCTT TTTGAGAACG GTGCACTGGA TGTATTCTAT ACGCCCGTTT ACATGAAAAA GAACAGACCG GGAGTTTTAT TGACGGTGCT TACGGACAAG GAGCATGAAG AGAAGCTTGT GGATATTATT CTGACAGAAA CGACCACTTT GGGAATCAGA AAGACCACCG CCCAAAGATA TGTTCTTGAA AGGGAAATAA AACATGTGAA TACTGAGTTT GGGAAAATAA GAGTGAAAGA GTCGTCCTTT GGCGATTACA AGAAATATTC GCCGGAATTT GAAGACTGTA AAAAAGTGGC CCAGGAATTG AAAATACCGC TGTCAAAGGT ATATGATGCC GTAAACAAAG CTATTTTAGT ATTTGAAGAA AGGAATGAAA ATGCTTTACA ATAA
|
Protein sequence | MRILYFDCFA GASGDMILGA LLDLGIDVGI FKRELAGLNL DGFDIAVEKK VINSIAVTDV NVIVKEECNH HTGHHHHCER NLADIEKIID ESSLKDNVKR LSKKIFSEIA RAEAKVHNKS IEDVHFHEVG AIDSIVDIVG TAICLDLLKV DKIYSSPMHD GTGFIECQHG KLPVPVPAVL EMLKESNIPY ITEDVNTELL TPTGLGIIKC VASKFGPMPP MTIEKVGYGA GKRQTGRFNA LRCILGNAKE KEKIDDEICM LETNIDDMNP EILGYVMNRL FENGALDVFY TPVYMKKNRP GVLLTVLTDK EHEEKLVDII LTETTTLGIR KTTAQRYVLE REIKHVNTEF GKIRVKESSF GDYKKYSPEF EDCKKVAQEL KIPLSKVYDA VNKAILVFEE RNENALQ
|
| |