Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0017 |
Symbol | |
ID | 4808782 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 24056 |
End bp | 25918 |
Gene Length | 1863 bp |
Protein Length | 620 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640105427 |
Product | hypothetical protein |
Protein accession | YP_001036452 |
Protein GI | 125972542 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01131] ATP synthase subunit 6 (eukaryotes),also subunit A (prokaryotes) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000261883 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAAT TAAATTTGAA AAGTAAACTC GCCATTTTTG CCACAACGTT AAAAGAAGTT TTTATATCTT CGCTGCCTCT TGCGGCAATT ATGATTATTG TGTGCGGTTT TATCGCACCT TTGGACAGTG GGGCGGAGTA TGTCAAATTA TTTGTCGGCT ATGCCAGTGT TGTTTTTGGC CAGGCATTGT TTTTGGACGG TTTAAATATT AGTATTCTTC CCATAGGAAA ATTGGTTGGG GGTTCGCTAA TAAAGCTTAA AAAATCAATC TTTGTTATTT TCTTCGGACT TCTTTTTGGC GTGCTTGCCA CTGTCGCGGA ACCTGCACTG ACCGTTTTGG CCAAGCAGAC CAACATGATT ATGCCGATTA TCAACGAAAC CGTGTTTATC TGGATTATGG GTTTTGGAAT CGGCGTGATG CTTGCATTCT CCCTCTTTCG AATTATGAAG GACTTAAATA TTAAAGTGGT TTTTGCCATA TTGTATGTCA TTACTTTTCT GTTGATTATA TTTGTTCCTG ATGAATTTGT TGCTTTGGCT TTTGACGGAA GCGGTGCAAC TACGGGGGAC ATTTCGGTTC CGTTTATTTT GGCTTTGGGT ATGGGTGTTT CCACTACCAT GTCCAGGCAC AAAAGCAATG ACGACAGCTT TGGGATTATT GGCCTTGCTC CGGTGGGTCC GATTATTTCA TTGGCCATCT ACGGTATAGT ACTTAAGTTT CTTTACAACG GTGTATTTCC TCCTGAACAG GTATACTCTC CTGAGACGGT GGGAACCGTT GGTGAGATAA TTGTAAATAA TTTGTGGGGA GTTACATTGG CACTTTTGCC GGTTATCATA GTGTTTTTAC CGTTTCAGTT TTTGTTAATC AAACAGCCGA AGAAAGAATT TGTGAAAATT TTGTTAGGTA CTGTTGTAGT TTTTATAGGC TTGCTGATTT TTCTGGCAGG CATAGATTAT GGATTTGCAT TTGCAGGCAA ATACATCGGG GAGGTTTTCC TGGATCCTTC ACGTCCCGAG TGGTTTAAGT GGTTGCTTTT GATTGTTGCG TTTATTTTAG GTGCCGCCAT TACCTTGTCG GAGCCTGCTG TTACGGTGCT GGGAGAACAG TTGGAAGAGA TGACCAACGG ACATATTGCA AAGATGACAA TTCGCATGAC TCTTGCCATA GGTATTGGCT TTGCTGCTTT GCTTGGAATG TTGAAAATAT TGACGGAGAT TAACATATTG TGGTTCTTAA TACCCCTGTA CGCCGTCGCT CTTATCATGA TGATATTTGC GCCAAAGCTG TTTGTCGGTC TTGCTTTTGA CTCGGGAGGA GTGGCCGGCG GAGCTTTAAC TTCTGCATTT TTGACGCCGC TTACCCTTGG GGTGGCACAG GCTGTGGCTG CGACATCACC TTCCGGCGGA CAGCCGATTT TGGTCAACGG TTTTGGAATT ATTGCATTTA TTTCAGTTAC CCCATTGATT GCTGTACAAT TTTTAGGTAT AGTGTATAAT ATAAATATTA AGAAAGCGGA AAAAGCTCTG AAAGATGCTG AAATGAATGA TATAAAAGAG TTGGCGTCCC TTGCGGGTAT CGTTGAAGAA GTCGCAGCTG AAAAAAGTGC GACTCAAGAA AGTATAGCTG AGAAAGCTGC AGTTCAGGAA AGTATAGTTG AAAAAGGTGG GATTGAAAAA AGTGTAGTGG ATGAAGAAAG TATAACCGAA AAAAGCATAG TTAAGGAAAA TATAGATGAA GACGATATAG CGGCTAAAAT GGATGAACAG CGTCAAAATG AGCAGCATCA TGAAGATAAT ATTGTGAAGG ACATGAAAGA CGGACACGAG GAACTTAAGG CAGGTAAAAA CAGTGCGGAG TAG
|
Protein sequence | MKKLNLKSKL AIFATTLKEV FISSLPLAAI MIIVCGFIAP LDSGAEYVKL FVGYASVVFG QALFLDGLNI SILPIGKLVG GSLIKLKKSI FVIFFGLLFG VLATVAEPAL TVLAKQTNMI MPIINETVFI WIMGFGIGVM LAFSLFRIMK DLNIKVVFAI LYVITFLLII FVPDEFVALA FDGSGATTGD ISVPFILALG MGVSTTMSRH KSNDDSFGII GLAPVGPIIS LAIYGIVLKF LYNGVFPPEQ VYSPETVGTV GEIIVNNLWG VTLALLPVII VFLPFQFLLI KQPKKEFVKI LLGTVVVFIG LLIFLAGIDY GFAFAGKYIG EVFLDPSRPE WFKWLLLIVA FILGAAITLS EPAVTVLGEQ LEEMTNGHIA KMTIRMTLAI GIGFAALLGM LKILTEINIL WFLIPLYAVA LIMMIFAPKL FVGLAFDSGG VAGGALTSAF LTPLTLGVAQ AVAATSPSGG QPILVNGFGI IAFISVTPLI AVQFLGIVYN INIKKAEKAL KDAEMNDIKE LASLAGIVEE VAAEKSATQE SIAEKAAVQE SIVEKGGIEK SVVDEESITE KSIVKENIDE DDIAAKMDEQ RQNEQHHEDN IVKDMKDGHE ELKAGKNSAE
|
| |