Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2151 |
Symbol | |
ID | 4811199 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2557230 |
End bp | 2559425 |
Gene Length | 2196 bp |
Protein Length | 731 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640107555 |
Product | hypothetical protein |
Protein accession | YP_001038547 |
Protein GI | 125974637 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000594133 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACATGGA CCACATATAT AAGTGCCTAT TTGTTCAGAA CGACCGGCGA ACCAATGTTG TATGGCAATA TGACTTCCTT CCTGATTTTT TCATTTATTA TTATCTTTTT CATTGCAAAA GACGAAACGG ATATTCCAGC TTTATTTAAA TCGTTAAAAT CTGTAGTTTC AAAGATAAAA GACAATTCGT TTTTGAAAAG CAACGTTATT GAAATCGCTT ATGTTTTGGC TTTTTTGAGC ATATGGACAT TTTTGGTGAT TGGGTCTTTT AACATAAAAG ACGGTGTAAT GAGAATTGGA TGGTCAGTCT CAAGTGATTT TTCAACCCAT CTTGCGGTGA TTAGGTCTTT TTCCTATGGT TCTAACTTTC CCTCGGGATA CCCTCACTTT GCAGACGGTA ATATGAGATA TCATTTTATG TTCATGTTTT TGGCGGGAAA TCTGGAGTTT TTAGGCTTAA GACTTGACTG GGCGTTTAAT CTGCCGTCAA TTTTGTCGAT CGTTTCATTT TTGATGCTGC TTTATTCATT TGCCGTGCTG CTGCTGGGCG AAAAAATTAT CGGAGTTGTG ACGGGAATAT TGTTCTTTTT CAGAAGTTCC TTTGCTTTCT TTACTTTTAT CACGGGAAAA CCTTCGATAA AAGAAGCTTT GAAAGAATTA AAAAATTTAA AAGAGCATAT TGGTAATACG TTGAATGAAG AATGGGGACT TTATGCGCAG AAGGTTTATG TAAATCAGCG GCATTTGCCT TTTGTATTGG GAATTATGAT GCTTGTTCTG ATAGTGATTC TTCCTTTGTT TATTAAGATG ATGACAAGCA TTGGCGAGCT TTATCAGGAA AGAGTGAAAA AATCCGATGA AGAACAAATG CCCGATGAAA ACAAAGACTC TGAAAGTTTT TGGAAATCAT ATGTTAAAGA ATTTATTTTC AGTAAAAACG CGTGGATTCC GGAAAGTATA TTTACATCTG TGGTGCTGGG GGTTATTCTT GGGCTTTCAA GTTTTTGGAA TGGAGCGGTG GTTATAGCGG CGCTTTTAGT GCTTTTTGTT ATGGCTGTTT TCTCGAAACA CAGACTTCAG TATCTGATTA TGGCGTCAAT TACTGTCGTG TTGGCTTATT GCCAGACTCA ATTTTTTGTA AAAAGCGGCG GTTCTGTGGT TTCACCCAGC ATATATATTG GTTTTCTGGC AAACACCAAC GGGCTGGAAC AGGATTTGTC CAGATATTTT GCGAATAATG GTTTGTGGGC TACACTCGAG CATTTCTTTA AACTGATTCC TTACGTTACG GCTTTTTATA TTGAACTTCT CGGACTTCTT CCTTTTATTG TGGCCATAAA TTTGCTCTCA AGAAACAGCA AATACAAATA TCATATAAGC TTGTTGTTTA TAGCTGTGAC ACAAGTTATT GGCTACTTCT TCATAAAGTA CAAACTGGAC AGTGACGACA AAATTATAAA CAAGCAGCTT TTTACTGGTT CAATGCTGTT TGCTTTGCTG TTGGTGGTGT TTGCATGCAT GGCATATATG TTTTACGAAA ATTCTCCCGT GCCCCGGGGG ACAAGGGCGT TAATACTGGC ATTTAGCACT CCGATTATCT TTGCAAGCTC TGTCAAGCTG ACATTAGGCG TTGATATAAA CCATAAATAT GTGATTATTG GTTCCATACT TGTAAATATT TTTGTGGCAT CTTTTATATT CTTCCTTTTC AAAGTAAAAA AACCTTTTGC CACCCTTGTG GCTGTACTTG TTTCCGTGAT GATTACAATT ACAGGTTTTG CGGATCTGAA AGTGCTTTAC AATCTTAACA ATTCTTATGT TACAATCAAT TGTGATGATC CTTTGTTGGT GAAAGTGAAA AATGATACCG GCAAAGATGA AATTTTCCTG ACGGATAATT ACCATCTTCA TCCTCTGCTT CTGTCAGGTA GAAAGATATT CTGCGGTTGG CCGTATTTTG TGGCGTCTGC AGGTTACGAC TGGGATCTTC GGAATGACAT AAGGAGAAGA ATTCTTACTG CCACTGACCA AAACACACTT AAAAAACTTG TGGAGGAAAA CAATATCAGT TACATAGTTA TTGACAACGG GCTGAGAAAT TCTCAAGGGT TTACCGTGAA TGAAGAGCTT ATCAGAAATA CTTTCAGTGT TTTCTACGAT GACGGAGTCG ACGTTGTTAT TTACAAAACA CATTAG
|
Protein sequence | MTWTTYISAY LFRTTGEPML YGNMTSFLIF SFIIIFFIAK DETDIPALFK SLKSVVSKIK DNSFLKSNVI EIAYVLAFLS IWTFLVIGSF NIKDGVMRIG WSVSSDFSTH LAVIRSFSYG SNFPSGYPHF ADGNMRYHFM FMFLAGNLEF LGLRLDWAFN LPSILSIVSF LMLLYSFAVL LLGEKIIGVV TGILFFFRSS FAFFTFITGK PSIKEALKEL KNLKEHIGNT LNEEWGLYAQ KVYVNQRHLP FVLGIMMLVL IVILPLFIKM MTSIGELYQE RVKKSDEEQM PDENKDSESF WKSYVKEFIF SKNAWIPESI FTSVVLGVIL GLSSFWNGAV VIAALLVLFV MAVFSKHRLQ YLIMASITVV LAYCQTQFFV KSGGSVVSPS IYIGFLANTN GLEQDLSRYF ANNGLWATLE HFFKLIPYVT AFYIELLGLL PFIVAINLLS RNSKYKYHIS LLFIAVTQVI GYFFIKYKLD SDDKIINKQL FTGSMLFALL LVVFACMAYM FYENSPVPRG TRALILAFST PIIFASSVKL TLGVDINHKY VIIGSILVNI FVASFIFFLF KVKKPFATLV AVLVSVMITI TGFADLKVLY NLNNSYVTIN CDDPLLVKVK NDTGKDEIFL TDNYHLHPLL LSGRKIFCGW PYFVASAGYD WDLRNDIRRR ILTATDQNTL KKLVEENNIS YIVIDNGLRN SQGFTVNEEL IRNTFSVFYD DGVDVVIYKT H
|
| |