Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1654 |
Symbol | |
ID | 4808904 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1980282 |
End bp | 1982201 |
Gene Length | 1920 bp |
Protein Length | 639 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640107069 |
Product | hypothetical protein |
Protein accession | YP_001038070 |
Protein GI | 125974160 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAGCG TCAGCTCGTT CAGAGATGTT ATTGCAAATA TGTATTACAA TGATCTCTTT GACGAATTGT CTGAATATAT AGAGGACAAC CCGGATAAGC TTGAATCCAA CTCATACCAT GTGCAATCAC CGGATGAAGC AGCATTATCT GATTTTGACA TCATAACGAT AGATATAACC GACTCGCCAG GTAACAGTAT TTTATTTGAC GTAATTGTTT CTGCCGAAAT TGAAATTGCA GAAACAGTAC GAAGGAATCG CGAGACTGAT GGTATAGAAC AATGGTTCCG TATCTCCTGC AGAGCTGACC TTGATGACGG AATTCAGAAT TTTCAAATCA ACTCTGTTTC AATATACAAC AAGTACAGAG AAAGCAAATT AGGCAGACTG TCTGAGTATT TAGTACCAAT TATAGAAAAG GAACAGTTTG ACAATGTTGC TACTGAATTT CTAAATGAGT TTTGCCCAGA AGCATTAAGT ACTCCTATGC CCATTCCAGT AGATGAAGTA GTGAAAAGAA TGGGGCTTAA GGTTAAGGAA ATCCAGCTTA CAAAGCATTT CACTATATTT GGTCAAATAG TCTTTGGCGA TTGCACAATA GAGTATTACG ACAGAAATGA AAGAACATAT AAGCCTTTGG AAGTTTCAAG AGGAACAATT CTCGTGGATC CTAATGTGTA TTTCATGCGA AACATAGGGT GCATGAACAA TACCATTATT CATGAGTGTG TCCACTGGTA TAAGCATAGA AAATACCATG AGTTAGTTAA GACGTATAAC AGCGATGCTT TGCTCATAAG CTGCAGGGTA AACGAAACAA CTAAATACAA ACAGCAATGG ACGCCAGAAG ACTGGATGGA ATGGCATGCT AACGGAATTG CACCACGAAT CCTTATGCCT AGATCAATGA CCATTAAAAA GATTGAGGAG CTAATTAAAA AGAATGAGCT CCTTTTTGGT ACTTACGACA GGCTAAATAT AATGGAAAAT GTCGTGTATG AATTAGCTGA CTTCTTCCAG GTGTCAAGGA TAGCGGCCAA AATAAGGATG CTTGACCTTG GATATAAGGA AGTTGAAGGT GTATATACCT ACGTAGATGA CCATTTTATC AGCAATTATT CATTTAAGGC AGACTCATTA CATAAGAATC AAACATACAG TATTAGCCTA AGTGATTCTT TTTTTGAATA CTATGCAAAT CCGGAATTCG CAAAGATTAT AGACAGCGGT AATTTTATTT ATGTTGATGG TCATTACGTT ATTAACGACT CCAAATACAT TAAAAAGTTA GAAAATGGAA GCATTGATCT TACAGACTAT GCAAAACTGC ATGTAGATGA ATGCTGCCTT CTGTTTGATT TAAAATTAAA TAAAGCCTCA AAAATGGACA TTGTAGTATA CCTCGATTCT ATAATGTTCC GTAAAGCTAC ACCGGATTAT AACAGAGTGC CGACATTTAA TCCGGACAAG CATAATATGG AAGTATTTAA TCGTTCAGAA GAGCTAAAGA AGTTTCACGA AGAATTCGTC GAAGAAGGTC AGCATTTGAG CCGTACAACC CAGACATTTT CCCAAGCGGT ATACGGACAT ATCAAAAGGA AAGGCTACAA TAAGGTTGTT TTTATAGAAA AGACTTTGCT TTCAGGAAAA ACATATGACA GAATAAAAAA CAATGAACTT AACAATCCAA CTTTAGAAAC CGTTGTTGCA ATCTGCATCG GATTGGAGCT AAGCCCTACA TACAGTGAAG AAATATTAAG GCTTGCCGGA TATACTCTCA ATAACACTCC ACAGCAATTG GCGTATAAAA AGCTAATCCA TTCGTATAGA GGGCATTCAA TATATGAATG CAATGAAGTT TTGGAAGCCT TGGGACTTTC CCCTCTTTGT GCAAAGGCAT ATAAAGAAAT GATAAGTTAA
|
Protein sequence | MASVSSFRDV IANMYYNDLF DELSEYIEDN PDKLESNSYH VQSPDEAALS DFDIITIDIT DSPGNSILFD VIVSAEIEIA ETVRRNRETD GIEQWFRISC RADLDDGIQN FQINSVSIYN KYRESKLGRL SEYLVPIIEK EQFDNVATEF LNEFCPEALS TPMPIPVDEV VKRMGLKVKE IQLTKHFTIF GQIVFGDCTI EYYDRNERTY KPLEVSRGTI LVDPNVYFMR NIGCMNNTII HECVHWYKHR KYHELVKTYN SDALLISCRV NETTKYKQQW TPEDWMEWHA NGIAPRILMP RSMTIKKIEE LIKKNELLFG TYDRLNIMEN VVYELADFFQ VSRIAAKIRM LDLGYKEVEG VYTYVDDHFI SNYSFKADSL HKNQTYSISL SDSFFEYYAN PEFAKIIDSG NFIYVDGHYV INDSKYIKKL ENGSIDLTDY AKLHVDECCL LFDLKLNKAS KMDIVVYLDS IMFRKATPDY NRVPTFNPDK HNMEVFNRSE ELKKFHEEFV EEGQHLSRTT QTFSQAVYGH IKRKGYNKVV FIEKTLLSGK TYDRIKNNEL NNPTLETVVA ICIGLELSPT YSEEILRLAG YTLNNTPQQL AYKKLIHSYR GHSIYECNEV LEALGLSPLC AKAYKEMIS
|
| |