Gene Cthe_0036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0036 
Symbol 
ID4808801 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp44775 
End bp46403 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content44% 
IMG OID640105445 
Producthydroxylamine reductase 
Protein accessionYP_001036470 
Protein GI125972560 
COG category[C] Energy production and conversion 
COG ID[COG1151] 6Fe-6S prismane cluster-containing protein 
TIGRFAM ID[TIGR01703] hydroxylamine reductase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCATGT TTTGTTATCA ATGTCAGGAA ACTGCAGGAG GAAAAGGCTG CACAGTACGC 
GGGGTGTGCG GCAAGAATGA GGAAGTTGCA AAGCTCCAGG ATCTTTTGCT TTACACCGTT
AAAGGCATCT CATATATCGT TACCAAGGGA AATATTGATG CCGCAAAACT TGGAAATACA
AACCATGAAG TATTAAGCAG CCTGTTTATG ACAATTACCA ATGTGAATTT TGACGATGGT
TCAATTGAAA AACAGATAAG AAAAATGCTT GCCGTAAGGG ACGAAATGAA AAAGTCCGTA
CAGGCGGAAG GCCTTCATGA TGCGGCGGTA TTTTCTGTGG ATTCAAGAGA ATCCATGTTA
AAAAAAGCCG ATTCTGTGGG GGTACTCTCC ACTCAAAATG AAGATATACG TTCATTAAGA
GAAATGATTA CCTATGGTGT CAAGGGTATG GCAGCCTATG CCGAGCATGC GAAAAATATA
GGAAAAGAAG ACAAGGAAAT CTATTCCTTT ATATATGAGG CTTTGGCGGC AACTTTGGAC
GACTCTCTTT CAGTTGACGA CCTTTTTGCG CTGACACTCA AAACCGGTGA ATACGGTGTA
AAGGTTATGG CGCTTCTGGA CGAGGCCAAT ACATCAAGGT TCGGAAATCC GGAAATTACC
GAAGTTAATA TCGGAGTCAG AAAAAATCCG GCCATCCTTG TTTCCGGGCA TGACCTTACC
GATCTTGAAC AGCTTTTGGA GCAGACAAAA GGAACAGGCG TGGATGTATA CACCCATGGC
GAAATGCTTC CGGCCCACTA CTATCCGGCT TTCAAAAAGT ATGACAATTT CGTGGGCAAC
TACGGAAACG CATGGTGGAA GCAGGTGGAA GAATTTGAGT CCTTCCACGG GCCTATTTTG
TTTACTACAA ACTGTATTGT TCCGCCAAGG AGCGAAGAGG TCAGAAGAAG GATATTTACA
ACAGGTTCGG CAGGTTTCCC CGGATGCAAG CATATTGAAG CCGATGAAAA CGGCAAGAAG
GATTTTTCGG AGATAATTGA GCTTGCGAAA ACTTTGCCTG CTCCTGATGA AATTGAAACG
GGAAGCATTG TCGGCGGATT TGCCCACAAT CAGGTAATGG CCCTTGCCGA CAAGGTGGTT
GAAGCCGTCA AATCCGGAGC TGTTAAAAAG TTCTTTGTTA TGGCCGGATG CGATGGGCGC
ATGAAGTCCA GAAGTTATTA CACCGAGTTT GCCCAAAACC TTCCTAAAGA CACTGTGATT
TTAACGGCAG GATGTGCAAA ATATCGTTAT AACAAGTTGG GACTGGGCGA TATAGGCGGT
ATTCCGAGGG TGCTTGATGC CGGACAGTGC AACGATTCGT ATTCTTTGGC GGTTATTGCC
CTGAAACTTA AAGAAGTATT TGGACTGGAC GATATCAACA AGCTTCCGAT TGCCTTTAAT
ATAGCGTGGT ATGAGCAGAA AGCCGTAATA GTTCTTTTGG CTCTCTTGTA TCTGGGCGTG
AAGAATATTC ATCTTGGACC TACGCTTCCG GGATTCCTTT CACCCAATGT GGCAAAGGTT
CTGGTGGAGA AGTTTGGCAT CGCAGGTATT GGCACTGTTG AAGATGATAT CAAATTGTTT
ATGTCTTAA
 
Protein sequence
MSMFCYQCQE TAGGKGCTVR GVCGKNEEVA KLQDLLLYTV KGISYIVTKG NIDAAKLGNT 
NHEVLSSLFM TITNVNFDDG SIEKQIRKML AVRDEMKKSV QAEGLHDAAV FSVDSRESML
KKADSVGVLS TQNEDIRSLR EMITYGVKGM AAYAEHAKNI GKEDKEIYSF IYEALAATLD
DSLSVDDLFA LTLKTGEYGV KVMALLDEAN TSRFGNPEIT EVNIGVRKNP AILVSGHDLT
DLEQLLEQTK GTGVDVYTHG EMLPAHYYPA FKKYDNFVGN YGNAWWKQVE EFESFHGPIL
FTTNCIVPPR SEEVRRRIFT TGSAGFPGCK HIEADENGKK DFSEIIELAK TLPAPDEIET
GSIVGGFAHN QVMALADKVV EAVKSGAVKK FFVMAGCDGR MKSRSYYTEF AQNLPKDTVI
LTAGCAKYRY NKLGLGDIGG IPRVLDAGQC NDSYSLAVIA LKLKEVFGLD DINKLPIAFN
IAWYEQKAVI VLLALLYLGV KNIHLGPTLP GFLSPNVAKV LVEKFGIAGI GTVEDDIKLF
MS