Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1166 |
Symbol | |
ID | 4810834 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1390751 |
End bp | 1391629 |
Gene Length | 879 bp |
Protein Length | 292 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640106588 |
Product | hypothetical protein |
Protein accession | YP_001037591 |
Protein GI | 125973681 |
COG category | [S] Function unknown |
COG ID | [COG1624] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR00159] conserved hypothetical protein TIGR00159 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000000552202 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTTTTC TGGTAGGAAC AACAAATTTT TGGGATATCA TTGCAAATTT GTCAACAAAC TTAGATATAA AAAGCCCGTG GGATTTGATA AAGACAATTA TAGATATAGG TATAGTGTCA TTTGTAATAT ACAAATTAAT AAAACTGATA AGGGAGACAC GGGCGTGGCA GCTTATAAAA GGTATTCTTG TTATTGTGAT TGCGGCCAGG GCAAGTGAAC TGATTGGTTT TAAAACTCTG TCTTTTATAC TGAGACTTAC CATAGAGTAT ATGGCTATAA TACTTGTGGT ACTGTTTCAG CCTGAGTTCA GAAGAGGACT GGAGCAGCTG GGAAGGAGCA GGTTTAGAAA TCTTTTCAGC TTTGAAGAAG AAGACAGTAC CATTAAAGTA AAGTCGCTGA TTGAGGAAAT AATAAAGGCC GTAACGGAGA TGTCAAGGAC CTTTACAGGA GCGCTTATTG TAATTGAAAG AGAAACGAAG TTGGGGGAGA TTATCAACTC GGGAATTAAC CTGGATTCAA ATGTTACCTC CGAGCTTTTG ATAAATATTT TCACGCCGAA TACGCCTTTA CACGACGGTG CCGTAGTAAT CAGGGACAAC AAGATAAAAG CGGCGGCATG CTTTTTGCCC CTTACGGAAA ATCCGAATCT CAGCAAGGAA CTGGGGACAA GGCACCGGGC TGCACTTGGC ATAAGTGAAG TTTCAGACGC AATTGTTGTG GTGGTTTCCG AAGAGTCCGG AAGGATTTCG GTTGCCCTAA ACGGCGGCCT TACCAGGAAC TTGACTTCGG ATACTTTGAG AAAGGCTTTA AGCAAAAATC TTTTAGATAA AGAGAATCCA AGCAAGAAAC TGGGGATATG GAAGGTGAAG GCCAAATGA
|
Protein sequence | MFFLVGTTNF WDIIANLSTN LDIKSPWDLI KTIIDIGIVS FVIYKLIKLI RETRAWQLIK GILVIVIAAR ASELIGFKTL SFILRLTIEY MAIILVVLFQ PEFRRGLEQL GRSRFRNLFS FEEEDSTIKV KSLIEEIIKA VTEMSRTFTG ALIVIERETK LGEIINSGIN LDSNVTSELL INIFTPNTPL HDGAVVIRDN KIKAAACFLP LTENPNLSKE LGTRHRAALG ISEVSDAIVV VVSEESGRIS VALNGGLTRN LTSDTLRKAL SKNLLDKENP SKKLGIWKVK AK
|
| |