Gene Moth_1629 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1629 
Symbol 
ID3831258 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1664993 
End bp1666060 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content63% 
IMG OID637829554 
Producthydrogensulfite reductase 
Protein accessionYP_430474 
Protein GI83590465 
COG category[C] Energy production and conversion 
COG ID[COG2221] Dissimilatory sulfite reductase (desulfoviridin), alpha and beta subunits 
TIGRFAM ID[TIGR02066] sulfite reductase, dissimilatory-type beta subunit 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.501606 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAATG ATCTGCCGGC CGGGATTATG CCCGACTACC GCCAGCAGAT ACCACCAGAA 
CTCCTCGCCT GCCGGGGAAA GTGGGTCAGG CATGAAATGG TCCGGCCAGG AGTGATCCGT
CATCTGGCCG GCGATGGTAC CGCCATCCTG ACGGTCCGCA TTTTACTGCC ACCCAACGGC
CTGTTGAGCG CGGCTACCCT GCGTCAGCTG GCCCGCTGGA TCCGGGTTTA TGCCCTGACT
GGCCGGCGCA CCAGCCGCCA GGGTTTCGAG TTTGTCGGCG TCCGGCCGGA ACTCCTGGAT
AACTTCCTGG CCGAGCTGGC GGCGAGCGGT TTTCCGGCCG GGGGTACTGG TAATAGTCTG
CACCAGATTA AGTGCTGTAC CTCCTTCATC CATTGCCAGA ATGCCGCCGT CGATGCGCCC
AGCATCGCCA AAACCCTGGC GGACTATTTG TACCCGGCGT TTTTCCACCA GGACCTGCCG
GCACCGCTGA AGATATCGGT AACCGGCTGC CCCAACCAGT GTGGCGGCGG TGTCGAGGCG
GACATCGGCA TATCCGGATA TTTTGCCACC GTACCCAGGG TGGACGACGC CGGCCTGATG
GCAGCCAATA TCGACTTCGG CCACCTGATC TCCGGTTGCC CGGTAGGGGC CATTCGTCCC
CGGCAGGTCG AGGGCGGGAC GACAGTGGTT ATTAACGCCG AACGTTGCAT CCGCTGCACC
TCCTGTATCC AGGTGGCGCC TGAAGGGATA AAGCCCGGCC CGGAGCGCTT TGTCGCCATC
GCCGTCGGCG GCCACGGCGG TAATAACCGG CGGGGCCCAG AAATGGCAGC CGTGGTGTTT
TCCCGGGTAC CGGCCAGGCC CCTGGACTAC GCGGCCATTT GCGAACGGGT GCAACGGATT
ATCGCCTGCT GGCGGGACAG GGGGAAACGC GGGGAAAGGC TCGCCGGCTT CCTGGAACGC
CTTGGCTGGC CGGCTTTTCT CAAGGCCGTA GGGGCTAGCC CGGTAGCAGA CATTTTTGAT
AACCATTTCC CCTTTGCCGG CCTACGCCGG GACCTGCACC TGCGGTAG
 
Protein sequence
MKNDLPAGIM PDYRQQIPPE LLACRGKWVR HEMVRPGVIR HLAGDGTAIL TVRILLPPNG 
LLSAATLRQL ARWIRVYALT GRRTSRQGFE FVGVRPELLD NFLAELAASG FPAGGTGNSL
HQIKCCTSFI HCQNAAVDAP SIAKTLADYL YPAFFHQDLP APLKISVTGC PNQCGGGVEA
DIGISGYFAT VPRVDDAGLM AANIDFGHLI SGCPVGAIRP RQVEGGTTVV INAERCIRCT
SCIQVAPEGI KPGPERFVAI AVGGHGGNNR RGPEMAAVVF SRVPARPLDY AAICERVQRI
IACWRDRGKR GERLAGFLER LGWPAFLKAV GASPVADIFD NHFPFAGLRR DLHLR