Gene Cthe_0461 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0461 
Symbol 
ID4808389 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp576255 
End bp577565 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content42% 
IMG OID640105875 
ProducttRNA (uracil-5-)-methyltransferase Gid 
Protein accessionYP_001036892 
Protein GI125972982 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1206] NAD(FAD)-utilizing enzyme possibly involved in translation 
TIGRFAM ID[TIGR00137] tRNA:m(5)U-54 methyltransferase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCGATT ATATAAATGT GATAGGCGCC GGACTTGCCG GCTGTGAGGC GGCATGGCAG 
ATTGCCAAAA GAGGAATAAA AGTTAAGCTT TTTGAAATGA AACCGAAAAA ATTTTCACCG
GCACATCATA TGGAAACTTT CGCAGAGCTT GTTTGCAGCA ATTCCTTGCG TTCAAACCAG
CTGGAAAATG CAGTCGGACT CTTAAAAGAG GAAATGAGAC TTTTGAACTC CATAATTATG
AAATGTGCCG ATGCTGCCCA GGTTCCTGCC GGAGGAGCTC TGGCAGTGGA CAGAACAAAG
TTTTCTCAAA TGGTGACGGA GCTTATAAAA CAAAATGAAA ATATAGAAGT GATAAATGAA
GAGGTACGGG AGCTGCCGAA GGAAGGAATT ACAATTGTTG CCACAGGGCC TCTTACATCC
GGGGATCTGT CAAAACATTT GGCGGATTTT GTAGGTGAAG GCTATCTTCA TTTCTTCGAT
GCTGCGGCTC CTATTGTTAC TTTCGAGTCA ATAGACATGA ACAAGGCATT TAAGGCCGCA
CGTTATGGAA GGGGAACGGA TGATTACATA AACTGTCCCA TGAACAAAGA AGAGTACGAG
ATTTTCTGGA ATGAACTGGT GAATGCAGAA CTTGCGGAAG TAAAGGACTT TGACCGCGAA
GTGGTTTTTG AAGGGTGTAT GCCTGTTGAA ACCATGGCCA AAAGAGGAAA GGATACTCTA
AGGTTCGGTC CTTTAAAACC GGTAGGTCTT GTTGACCCCA ATACCGGAAA GGAACCTTAT
GCGGTTGTTC AGTTAAGACA GGACAACAGT GAGGGAACCA TGTATAATAT GGTAGGTTTT
CAAACAAGGC TTAAGTGGCC GGAGCAAAAA AGAGTGTTTC GATTAATACC GGGACTTGAA
AATGCTGAAT TTGTAAGGTA TGGTGTAATG CACAGAAATA CTTTCATAAA TTCGCCGGTA
CTTTTGGATG CCACATACTG CCTTAAAAAA TCTCCCAATA TTTATTTTGC GGGGCAGATT
ACCGGAGTTG AAGGATATGT GGAATCAGCC TCTTCAGGCA TGGTTGCGGG AATTAATGCT
GCCATGGATT TTCTGGGAAA GGATAGGGTT GTATTTCCGA AAAGTACAGC CATCGGTGCT
TTGAGTCATT ATGTTTCCGA CAGCAGTATA AAAAATTTTC AGCCCATGAA TGTTAATTTT
GGCATAATGG AGAGCTTTCC TCTCAAAATA CGGGATAAAA GAAAAAGAAA TTATGAGACG
GCAATGCGGG CTCTTAAGAT TTTAAAAGAG TATGTTTCAA AGTATTCGTA A
 
Protein sequence
MIDYINVIGA GLAGCEAAWQ IAKRGIKVKL FEMKPKKFSP AHHMETFAEL VCSNSLRSNQ 
LENAVGLLKE EMRLLNSIIM KCADAAQVPA GGALAVDRTK FSQMVTELIK QNENIEVINE
EVRELPKEGI TIVATGPLTS GDLSKHLADF VGEGYLHFFD AAAPIVTFES IDMNKAFKAA
RYGRGTDDYI NCPMNKEEYE IFWNELVNAE LAEVKDFDRE VVFEGCMPVE TMAKRGKDTL
RFGPLKPVGL VDPNTGKEPY AVVQLRQDNS EGTMYNMVGF QTRLKWPEQK RVFRLIPGLE
NAEFVRYGVM HRNTFINSPV LLDATYCLKK SPNIYFAGQI TGVEGYVESA SSGMVAGINA
AMDFLGKDRV VFPKSTAIGA LSHYVSDSSI KNFQPMNVNF GIMESFPLKI RDKRKRNYET
AMRALKILKE YVSKYS