Gene Moth_1600 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1600 
Symbol 
ID3832746 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1635185 
End bp1636372 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content55% 
IMG OID637829529 
Productsulfite reductase, dissimilatory-type beta subunit 
Protein accessionYP_430449 
Protein GI83590440 
COG category[C] Energy production and conversion 
COG ID[COG2221] Dissimilatory sulfite reductase (desulfoviridin), alpha and beta subunits 
TIGRFAM ID[TIGR02066] sulfite reductase, dissimilatory-type beta subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000981714 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATTAT CACAAGGGAG ACTGGGTAAA TACAACCCGG AGAAACCGAC GGAAAACAGG 
ATTACGGATA TCGGTCCCAG GCACTTTTGG GATTTCTTCC CACCGGTTAT CCAGAAAAAC
TACGGCAAGT GGGCTTATCA TGACATCCTG GAACCCGGTA TCATGGTCCA TGTCTCCGAG
ACCGGGGACA AGGTCTTTAC CGTGCGTTGC GGCGGCTCAC GACTTATGAC GGCGGAAAAT
ATCCGGGAGA TCTGCGACAT TGCCGATAAA TACTGCGATG GTTATGTGCG CTTTACCACC
CGGAATAATA TTGAATTCAT GGTAACAAGT TATGAAAAAG TCCAGGAGTT GAAGAAAGAC
CTCCTGAGCC GGAAATACAT CTCCGGGAGC TACAAATTTC CCATCGGCGG CACCGGCGCC
GGGGTTACCA ACATCGTTCA TACCCAGGGC TATATCCATT GCCATACCCC GGCAACCGAC
GCTTCCTCCA TGGTCAAGGC AGTCATGGAT GAACTCATGG ACTACTTCAC CGGCATGACC
CTGCCGGCCC ACGTCCGGAT TTCCATGGCC TGCTGCCTGA ATATGTGCGG CGCCGTTCAC
TGCTCGGATA TTGCCCTCCT GGGCATCCAC AGGAAACCGC CGATTGTAGA CGATAATATT
GATTCCATTT GTGAAATACC CCTGGCCATT GCCGCCTGTC CCGTGGGTGC CATCTCGCCG
GCCAAGACGG AAGACGGCCG GAAGTCGGTG AAGATCAAAG AAGATCGGTG CATGTTCTGC
GGCAATTGCT ATACCATGTG CCCGGCCCTG CCTCTGGCCG ACAAGGAGGG CGACGGCGTG
ACCATCCTGG CCGGCGGCAA GATCTCCAAC CGGATCAGCG AGCCCAAGTT CTCCAAGGTC
ATTGTTCCCT GGTTGCCCAA CAACTTCCCC CGCTTCCCGG AAGTGGTGGC CACAGTTAAG
AAGATCATCG AAGTCTATGC CGCCAATGCC CGCAAGTACG AGCGCATTGG TGACTGGGCG
GAGAGAATCG GCTGGGAGAA GTTCTTCGAG CTGTGTGATT TGCCCTTTAC CGAGCACCTC
ATTGATGATT ACCGCCTGGC CTATGATACC TACCGGACCA GCACCCAGTT TAAATTCACC
GGAAAAGCCT GGGAAGTTTC CAGGGCGGCA GGGGGTATCG ACGACTAG
 
Protein sequence
MALSQGRLGK YNPEKPTENR ITDIGPRHFW DFFPPVIQKN YGKWAYHDIL EPGIMVHVSE 
TGDKVFTVRC GGSRLMTAEN IREICDIADK YCDGYVRFTT RNNIEFMVTS YEKVQELKKD
LLSRKYISGS YKFPIGGTGA GVTNIVHTQG YIHCHTPATD ASSMVKAVMD ELMDYFTGMT
LPAHVRISMA CCLNMCGAVH CSDIALLGIH RKPPIVDDNI DSICEIPLAI AACPVGAISP
AKTEDGRKSV KIKEDRCMFC GNCYTMCPAL PLADKEGDGV TILAGGKISN RISEPKFSKV
IVPWLPNNFP RFPEVVATVK KIIEVYAANA RKYERIGDWA ERIGWEKFFE LCDLPFTEHL
IDDYRLAYDT YRTSTQFKFT GKAWEVSRAA GGIDD