Gene Moth_1630 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1630 
Symbol 
ID3831259 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1666191 
End bp1667213 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content63% 
IMG OID637829555 
Producthydrogensulfite reductase 
Protein accessionYP_430475 
Protein GI83590466 
COG category[C] Energy production and conversion 
COG ID[COG2221] Dissimilatory sulfite reductase (desulfoviridin), alpha and beta subunits 
TIGRFAM ID[TIGR02064] sulfite reductase, dissimilatory-type alpha subunit 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATGAAG CAGGGCTGGC AGAGGGAGAA TCCGCTTATG GCCACGGCGG CTATATCAGT 
GTCCCGGAAC TGCCTGCCGG GGTGGCTGTG CGCAACAGCC GTCGTCCGGA CATCATTCAG
GAATCGTCCT ATCTCCGCAT CCTGGCGCCG GCGGGGGGAT GGCTCGCGGC TGCAACCCTG
GAGAAACTGG CCGATTGTGC TGAGACCTAC GGCCGGGGGC TGGTTCACCT GACCAGCGGC
GGCACCATAG AGATCTACAC CGGCCGCGAA CAGATGCTAC CCCTGGTCCG GGAGTTAAAC
AGCGGGGGTT TGGATGTGGG TTCAACGGGG GATGATCTAC GTTGCCTGAC GGCCTGCTGC
GGACCGGTAC GCTGCGAGCA TGCCCTGGTG GATGCTCCGG CCCTGGCCAC CTACCTGGGG
CAGCGCTTTA TTGACGACCA GCAGTATCCC GGCTACCCCC AGAAGGTTAA GAGCGCCGTG
GCCGGTTGTC CCAATGATTG TATCCGGGCC ATGATGCAAA AGGACCATTC CTTTATCGGC
GTTTACCGGG ACTGGCCCAT GGTAGACGGC GAGATGCTGG CCGCCTGGCG GGAGAAGGGC
GGCGACCTGA ACGGCCTCCT GGCCGCCTGC CCGGCAGGGG CCCTGACCCT GGTGGGGGAT
ACCTTGGAGG TAGACAAGGA ACGCTGCTGG CGTTGCATGG CCTGTATCAA CTACTGCCCG
GCCATCCGTC CCGGCCGCGA GCGGGGGGTC GCCTGGGTTG CCGGGGGCAA ATACGGTCAC
CGGGGCCCCC AGGGCCCCAT GGTAGGTTTT GTCCTGGTGC CCTTCATCCC CGTTGCCGAT
GACGATTACA CCGCCGTCGG CGATATCTTC GGCGCTTTCC TGGAGTTCTG GGCGGATAAC
GCTAAGAAAA AGGAGCGGGT GGGGGACTTT ATCGCCCGCG TTGGCCCCCT GAAGATTTTA
CGGGAACTCG GCCTCGAGGC CCAGCCCCAG ATCCTCTTGA GCCCCAAGAG GAATGTGTAC
TAG
 
Protein sequence
MYEAGLAEGE SAYGHGGYIS VPELPAGVAV RNSRRPDIIQ ESSYLRILAP AGGWLAAATL 
EKLADCAETY GRGLVHLTSG GTIEIYTGRE QMLPLVRELN SGGLDVGSTG DDLRCLTACC
GPVRCEHALV DAPALATYLG QRFIDDQQYP GYPQKVKSAV AGCPNDCIRA MMQKDHSFIG
VYRDWPMVDG EMLAAWREKG GDLNGLLAAC PAGALTLVGD TLEVDKERCW RCMACINYCP
AIRPGRERGV AWVAGGKYGH RGPQGPMVGF VLVPFIPVAD DDYTAVGDIF GAFLEFWADN
AKKKERVGDF IARVGPLKIL RELGLEAQPQ ILLSPKRNVY