Gene Cthe_0571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0571 
Symbol 
ID4808246 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp698163 
End bp699530 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content39% 
IMG OID640105985 
Productsun protein 
Protein accessionYP_001037000 
Protein GI125973090 
COG category[J] Translation, ribosomal structure and biogenesis
[K] Transcription 
COG ID[COG0144] tRNA and rRNA cytosine-C5-methylases
[COG0781] Transcription termination factor 
TIGRFAM ID[TIGR00446] NOL1/NOP2/sun family putative RNA methylase
[TIGR00563] ribosomal RNA small subunit methyltransferase RsmB
[TIGR01951] transcription antitermination factor NusB 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0809795 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATATGA GGACAAAAGT GGACAAAGTA AGGGAGACTG CACTTAAGAT ATTGTACGAT 
ATCAATGAAA AGGGAGCATA TTCGAATATC TCCCTGAATA AATATTTGAA TGGCCAGGAA
TTTGAAAGTA TTGACAGGGC GTTTATCACT GACATTGTGT ACGGTACGTT AAAGTGGCAA
TATACCATTG ATTATTTAAT TGAAAAGTTT TCGTCAGTCA AAATTAAAAA GATTTCTCCG
TGGATATTCA ATATTTTGAG GATGGGTATT TACCAGTTGA TTTACACGGA CAAAATACCT
TTTTTTGCTG CGTGCAATGA AAGTGTGAAG CTTGCGGCAA AGTATGGCCA TGCTGCCAGC
AGCAAATATG TTAATGCTGT TTTGAGAAAT ATAGCGAGAA ACAAGGAGAA TCTGCCGTAT
CCCGACAGAA ACAATGATAC GGCACACTAT CTTTCTGTAA AGTATTCCCA TCCAGTATGG
ATGGTAAAGG ATTGGCTTGA CTGCTTTGGT GAGGAATTTA CCGAAGGGCT TTTGAAAGCC
AATAATGAAG TTGCACCGTT TACTGTAAGA GTAAATGATT TAAAAATATC TAAAAAAGAG
CTGGTGGATA TTTTAACAAA GGACGGTTTT GAGGTTGAAA ACGGCAAGTA TCTGGATGAA
GCACTGATAA TAAGGAATCC TTCGGCGGTT CAAAAGATGG ATGCTTTTGC GAAGGGATAT
TTTCAAGTAC AGGACGAAAG CTCCATGCTT GTGGCAAAGG TATTGGATCC AAAGCCGGGA
GAGACAATAC TTGATGTCTG CAGTGCGCCA GGAGGAAAGT CCACCCATAT AGCACAGATT
ATGAAAAACC GTGGTACTGT GATATCCAGA GACATTCATG AACATAAAAT TAAACTGATA
GAACAGGCAA AAGAAAGACT GGGTCTGGAA ATAATAAAAA CTGAGGTGTT TGACGCCGCA
GTTCTGGACG GTAAATTAAT AGAAAAAATT GACAGGGTTT TAGTGGATGC TCCGTGTACC
GGTTTTGGTA TAATAAGAAG GAAGCCTGAT ATAAAGTGGT CAAAAAATTC GGAAGACAAG
GCTGAGATTG TGAGCCTTCA GCATAAAATA CTTTCAACGG CGTCAAAATA TGTAAAAGAC
GGTGGTGTGC TGGTATACAG CACCTGTACG TTAGAGCCGG AAGAGAACGA AAAAGCGGTG
GAAAGGTTTA TTGAAGAGAA CAAGGACTTT TATTTGGAAG ATATAACAGA GTTTCTTCCT
GATGCTTTAA GAAAAGAAAG CGCAGGCAAA GGATACATTC AGCTATATCC GAATATAGAC
GGAATCGATG GATTTTTTAT TGCAAGAATG AGAAAAAGGA GCAAGTAA
 
Protein sequence
MDMRTKVDKV RETALKILYD INEKGAYSNI SLNKYLNGQE FESIDRAFIT DIVYGTLKWQ 
YTIDYLIEKF SSVKIKKISP WIFNILRMGI YQLIYTDKIP FFAACNESVK LAAKYGHAAS
SKYVNAVLRN IARNKENLPY PDRNNDTAHY LSVKYSHPVW MVKDWLDCFG EEFTEGLLKA
NNEVAPFTVR VNDLKISKKE LVDILTKDGF EVENGKYLDE ALIIRNPSAV QKMDAFAKGY
FQVQDESSML VAKVLDPKPG ETILDVCSAP GGKSTHIAQI MKNRGTVISR DIHEHKIKLI
EQAKERLGLE IIKTEVFDAA VLDGKLIEKI DRVLVDAPCT GFGIIRRKPD IKWSKNSEDK
AEIVSLQHKI LSTASKYVKD GGVLVYSTCT LEPEENEKAV ERFIEENKDF YLEDITEFLP
DALRKESAGK GYIQLYPNID GIDGFFIARM RKRSK