Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2495 |
Symbol | |
ID | 4809433 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2963546 |
End bp | 2964928 |
Gene Length | 1383 bp |
Protein Length | 460 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640107910 |
Product | hypothetical protein |
Protein accession | YP_001038890 |
Protein GI | 125974980 |
COG category | [S] Function unknown |
COG ID | [COG3391] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCCTTTTC AGAAAAAAGT CTGGCAGTTT AACGACATAA TCACAGAAGG CGAATTGAAT CGCATGGAAC AAGGTATTGA AGATTCTATA ACTGCCGCGA ATCAAGCTGA AGTAAATGCA AAGGCTTATA CTGACCAAGA AGTAGGTGAA GTTGCCCAAG AACTTGCTGC ACATAAGGCG GAAAGTACGC AGAACGCTCA TTTGGCGAAA AACATCGGGA TTGAAGACGC TGCGGGTAAC TTCACAGCGA CCGACGTGGA AGGGGCACTG GCCGAGCTTT TTACGTCTGT CAGTAATGGT AAGACTCTTA TCGCTGGGGC CATTACTGAC AAAGGAGTGC CGACCAATCC CAGCGATACA TTCCAGCAAA TGGCAACAAA TATTCAAGCA ATTCCTGTTG GAGATTATGC TGTAGGGGGT ACAATCCGTG ATTCTGTCTT GCGTTTTTTG CCGGGCGGTA TGGGTGTAGA AATCTGGTCG AAGACGGACG TGGCGAGAGG GCAGGGCATC GCCGTAGACA GTGCAGGAAA CGTATATGTC GCTCACTCTG TGGGCAGCGG CGGAAAAGCC GTACGAAAGT TGGATTCAGC AGGAAACGAA ATCTGGTCGA AGACGGACGT GGCGTATGGG CAGGGCATCG CCGTAGACAG TGTAGGAAAC GTATATGTCA CTCATTTTGT GAGCAGCAGC GAAAAAGCCG TACGGAAGCT GGACCCGAAC GGAAACGAGA TCTGGTCGAA GACGGACGTG GCGTATGGGT GGGGCATTGC CGTAGACAGT GCAGGAAACG TATATGTCGC TCACTCTGTG GGCAGCGGCG GAAAAGCCGT ACGAAAGTTG GATTCAGCAG GAAACGAAAT CTGGTCGAAG ACGGACGTGG CGAATGGGCG GTACATCGCC GTAGACAGTG CAGGAAACGT ATATGTCGCT CACAATGTGA GCAGCGGAAA AACCGTACGA AAGTTGGATT CAGCAGGAAA CGAAATCTGG TCGAAGACGG ACGTGGCGTA TGGGTGGGGC ATTGCCGTAG ACAGTGCAGG AAACGTATAT GTCGCTCACA ATGTGAGCAG CGGAAAAACC GTACGAAAGT TGGATTCAGC AGGAAACGAA ATCTGGTCGA AGACGGACGT GGCGTATGGG CAGGGCATCG CCGTAGACAG TGTAGGAAAC GTATATGTCA CTCATTTTGT GAGCAGCAGC GAAAAAGCCG TACGGAAGCT GGACCCGAAC GGAAACGAGA TCTGGTCGAA GACGGACGTG GCGAGAGGGC AGGGCATCGC CGTAGACAGT GTAGGAAACG TATATGTCAC TCACGATGTG AGCAGCGGCG AAAAAGCCGT ACGAAAGCTG GATGGGAACA GATATTTTCA AATAGTGGGG TGA
|
Protein sequence | MPFQKKVWQF NDIITEGELN RMEQGIEDSI TAANQAEVNA KAYTDQEVGE VAQELAAHKA ESTQNAHLAK NIGIEDAAGN FTATDVEGAL AELFTSVSNG KTLIAGAITD KGVPTNPSDT FQQMATNIQA IPVGDYAVGG TIRDSVLRFL PGGMGVEIWS KTDVARGQGI AVDSAGNVYV AHSVGSGGKA VRKLDSAGNE IWSKTDVAYG QGIAVDSVGN VYVTHFVSSS EKAVRKLDPN GNEIWSKTDV AYGWGIAVDS AGNVYVAHSV GSGGKAVRKL DSAGNEIWSK TDVANGRYIA VDSAGNVYVA HNVSSGKTVR KLDSAGNEIW SKTDVAYGWG IAVDSAGNVY VAHNVSSGKT VRKLDSAGNE IWSKTDVAYG QGIAVDSVGN VYVTHFVSSS EKAVRKLDPN GNEIWSKTDV ARGQGIAVDS VGNVYVTHDV SSGEKAVRKL DGNRYFQIVG
|
| |