Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1756 |
Symbol | |
ID | 4810186 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2075853 |
End bp | 2077019 |
Gene Length | 1167 bp |
Protein Length | 388 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640107169 |
Product | putative virion core protein (lumpy skin disease virus)-like protein |
Protein accession | YP_001038170 |
Protein GI | 125974260 |
COG category | [S] Function unknown |
COG ID | [COG4260] Putative virion core protein (lumpy skin disease virus) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000000281872 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTTTGA TAAACTTTAT CAAAGGACAA TTCATTGAAG TTATCGAATG GACGGATTCT TCTGCAGATA CCATAGTTTA CCGCTTTCCT GTAGCAAACA AGGAAATAAA AATGGGAGCC CAGCTAACTG TGAGAGAATC GCAGGTGGCT ATATTCATAA ACGAAGGTCA GTTGGCGGAC GTTTTTGAAC CTGGAAGATA CGAACTTACG ACTGAAAACT TGCCGATTCT GACAAAGCTG AAATCATGGA AGTACGGATT TAATTCACCT TTTAAAGCAG AAGTGTATTT TATAAACACC AGACTGTTTA CCGAACAGAC TTGGGGTACC CAGTCACCCC TTCCGATGTT GGATCCTATG TTCGGCCCAA TTGAGGTCGG AGCAAGGGGA ACATATGCCT TCAGAGTGTC AGACCCGGTA AAATTCTTAA AAGACGTGTC AGGAACAAGG GGTACCATGG CAACTCAGGA TTTGACCAGG ACTTTAAGGT CATATATTAT GACGTACCTT AAAGATACGG TTGCAGAATC CAAAAAGTCC TTCTTTGAGA TGCAGAGCAA CATGCCTGAG TTTGCCGAAA TGGTTAAGGT GCATGCAAAG AGCAAATTTG AAGCTCTTGG GCTTGAACTG GTTGAATTTA CGATTGAATC GCTGATTCTT CCGGAAGAGC TCAGAAAAGC TTACCAGGAA GGCGCACAGA TAAATTTGAT GGGCGGTATG GACACTTACG CCAAAAAGAG AGCTCTCGAC GCAATGAACT CAGCCGCGTC AAACCAGGGG GGTGGCTCTT TTGCAAGCAT GGGTGCCGGA ATGGGAGCCG GAGCTGCGAT AGGCAATATA ATGGGACAGG TTTTTGGCGG AGGTTTTCAG CAGAATCCCC AATACTATCA GCCTCAGCCA CAGCCTCAGG TGCAGCCTCC GCAGCAGAGT GTGGTGTGTC CTTCGTGTAA AACCGGTGTG CCCGCCGGGA CAAAATTCTG TCCAAACTGC GGCAAGTCTC TTGTGGAAGA GAAGGACAGG TGTATTAAAT GCAATCATGA GATCAGCAAA GGAGCAAAGT TCTGTCCTGA ATGCGGAGAA AAGCAGGAGG TAGTGTGCAA CAACTGCGGT GCGAAGCTTT CACCGGGAAC AAAATTCTGT TCCGAATGCG GAACAAAGGT AGAGTAA
|
Protein sequence | MGLINFIKGQ FIEVIEWTDS SADTIVYRFP VANKEIKMGA QLTVRESQVA IFINEGQLAD VFEPGRYELT TENLPILTKL KSWKYGFNSP FKAEVYFINT RLFTEQTWGT QSPLPMLDPM FGPIEVGARG TYAFRVSDPV KFLKDVSGTR GTMATQDLTR TLRSYIMTYL KDTVAESKKS FFEMQSNMPE FAEMVKVHAK SKFEALGLEL VEFTIESLIL PEELRKAYQE GAQINLMGGM DTYAKKRALD AMNSAASNQG GGSFASMGAG MGAGAAIGNI MGQVFGGGFQ QNPQYYQPQP QPQVQPPQQS VVCPSCKTGV PAGTKFCPNC GKSLVEEKDR CIKCNHEISK GAKFCPECGE KQEVVCNNCG AKLSPGTKFC SECGTKVE
|
| |