Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0571 |
Symbol | |
ID | 4808246 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 698163 |
End bp | 699530 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640105985 |
Product | sun protein |
Protein accession | YP_001037000 |
Protein GI | 125973090 |
COG category | [J] Translation, ribosomal structure and biogenesis [K] Transcription |
COG ID | [COG0144] tRNA and rRNA cytosine-C5-methylases [COG0781] Transcription termination factor |
TIGRFAM ID | [TIGR00446] NOL1/NOP2/sun family putative RNA methylase [TIGR00563] ribosomal RNA small subunit methyltransferase RsmB [TIGR01951] transcription antitermination factor NusB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0809795 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATATGA GGACAAAAGT GGACAAAGTA AGGGAGACTG CACTTAAGAT ATTGTACGAT ATCAATGAAA AGGGAGCATA TTCGAATATC TCCCTGAATA AATATTTGAA TGGCCAGGAA TTTGAAAGTA TTGACAGGGC GTTTATCACT GACATTGTGT ACGGTACGTT AAAGTGGCAA TATACCATTG ATTATTTAAT TGAAAAGTTT TCGTCAGTCA AAATTAAAAA GATTTCTCCG TGGATATTCA ATATTTTGAG GATGGGTATT TACCAGTTGA TTTACACGGA CAAAATACCT TTTTTTGCTG CGTGCAATGA AAGTGTGAAG CTTGCGGCAA AGTATGGCCA TGCTGCCAGC AGCAAATATG TTAATGCTGT TTTGAGAAAT ATAGCGAGAA ACAAGGAGAA TCTGCCGTAT CCCGACAGAA ACAATGATAC GGCACACTAT CTTTCTGTAA AGTATTCCCA TCCAGTATGG ATGGTAAAGG ATTGGCTTGA CTGCTTTGGT GAGGAATTTA CCGAAGGGCT TTTGAAAGCC AATAATGAAG TTGCACCGTT TACTGTAAGA GTAAATGATT TAAAAATATC TAAAAAAGAG CTGGTGGATA TTTTAACAAA GGACGGTTTT GAGGTTGAAA ACGGCAAGTA TCTGGATGAA GCACTGATAA TAAGGAATCC TTCGGCGGTT CAAAAGATGG ATGCTTTTGC GAAGGGATAT TTTCAAGTAC AGGACGAAAG CTCCATGCTT GTGGCAAAGG TATTGGATCC AAAGCCGGGA GAGACAATAC TTGATGTCTG CAGTGCGCCA GGAGGAAAGT CCACCCATAT AGCACAGATT ATGAAAAACC GTGGTACTGT GATATCCAGA GACATTCATG AACATAAAAT TAAACTGATA GAACAGGCAA AAGAAAGACT GGGTCTGGAA ATAATAAAAA CTGAGGTGTT TGACGCCGCA GTTCTGGACG GTAAATTAAT AGAAAAAATT GACAGGGTTT TAGTGGATGC TCCGTGTACC GGTTTTGGTA TAATAAGAAG GAAGCCTGAT ATAAAGTGGT CAAAAAATTC GGAAGACAAG GCTGAGATTG TGAGCCTTCA GCATAAAATA CTTTCAACGG CGTCAAAATA TGTAAAAGAC GGTGGTGTGC TGGTATACAG CACCTGTACG TTAGAGCCGG AAGAGAACGA AAAAGCGGTG GAAAGGTTTA TTGAAGAGAA CAAGGACTTT TATTTGGAAG ATATAACAGA GTTTCTTCCT GATGCTTTAA GAAAAGAAAG CGCAGGCAAA GGATACATTC AGCTATATCC GAATATAGAC GGAATCGATG GATTTTTTAT TGCAAGAATG AGAAAAAGGA GCAAGTAA
|
Protein sequence | MDMRTKVDKV RETALKILYD INEKGAYSNI SLNKYLNGQE FESIDRAFIT DIVYGTLKWQ YTIDYLIEKF SSVKIKKISP WIFNILRMGI YQLIYTDKIP FFAACNESVK LAAKYGHAAS SKYVNAVLRN IARNKENLPY PDRNNDTAHY LSVKYSHPVW MVKDWLDCFG EEFTEGLLKA NNEVAPFTVR VNDLKISKKE LVDILTKDGF EVENGKYLDE ALIIRNPSAV QKMDAFAKGY FQVQDESSML VAKVLDPKPG ETILDVCSAP GGKSTHIAQI MKNRGTVISR DIHEHKIKLI EQAKERLGLE IIKTEVFDAA VLDGKLIEKI DRVLVDAPCT GFGIIRRKPD IKWSKNSEDK AEIVSLQHKI LSTASKYVKD GGVLVYSTCT LEPEENEKAV ERFIEENKDF YLEDITEFLP DALRKESAGK GYIQLYPNID GIDGFFIARM RKRSK
|
| |