Gene Moth_1601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1601 
Symbol 
ID3832747 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1636385 
End bp1637803 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content55% 
IMG OID637829530 
Productsulfite reductase, dissimilatory-type alpha subunit 
Protein accessionYP_430450 
Protein GI83590441 
COG category[C] Energy production and conversion 
COG ID[COG2221] Dissimilatory sulfite reductase (desulfoviridin), alpha and beta subunits 
TIGRFAM ID[TIGR02064] sulfite reductase, dissimilatory-type alpha subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000022809 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTTTA AGCCCAAGCG GCCGCAGAAG GACTTAAAGT ACGACGAATT GCGCATTTAT 
ACCGACGAAG AATTGCATAA CTACTCGGAA GAAGAACTAA AAAACTTTAA GCTCAAACAC
GACATTCCCG ACCTGGACGA ACTGGAAAAG GGACCGTGGC CCAGCTTTGT CGCCGATGCC
AAGCGGGAAG CCCTGCATCG CAAGAAGCTC GCCGATGACC GGCTTATGAT CGACAAGGAC
GTAGTTGACG ATTTACTCGG ACAGCTGCAA TTATCCTTTG ACGACGGAGA AACCCACTGG
AAGCACGGCG GTATCGTCGG CGTCTTCGGT TACGGCGGCG GCGTCATCGG CCGGTACTCG
GACGTTCCCG AAAAATTCCC TTCGGTGGCC CAATTCCATA CCCTGCGGGT CAACCAACCG
GCCAGCAAGT TCTATAAGAC TGATTTCTTG CGGGCCCTGG CCGACCTGTG GGAGTACCGC
GGCAGCGGTA TGTTCAATCT GCATGGCTCC ACCGGGGACA TCATTCTCCT GGGAACCTCT
ACCGAACAGC TGGAACCTAT CTTTTATGAT CTGACCCACG AGCTGGATCA GGACCTTGGC
GGGTCCGGTT CCAACCTGCG GACACCTTCC TGCTGCATCG GCAAGGCCAG GTGCGAGTTT
GCCTGCATCG ACACCCAGGA TTTATGTTAT GAGATAACCA CCCACTACCA GGATGAGCTG
CACCGCCCGG CCTTCCCCTA CAAGTTTAAG ATTAAGGTCG ACGGTTGCCC CAACGGTTGC
GTAGCTTCCA TTGCCCGTTC TGACATGTCC CTCATTGGCA CCTGGCGGGA CGATATCCGC
ATTGACCAGG AGGCTGTACG GGCCTACATG GCCGGCGATA TTGAACCCAA CGGGGGCGCC
CATAAGGGCC GCGATTGGGG CAAATTTGAT ATCCAGAAAG AGGTTATTGA TCTCTGCCCG
ACTGGCTGTA TGGCCCTGGA AGACGGCCAG CTGAAAATCA ATAATAAAGA ATGCAACCGC
TGCATGCACT GCATCAACGT CATGCCGCGA GCCCTGAAAC CGGGAAGGGA TACCGGCGTC
AGCGTCCTCT TCGGGGCCAA GGCACCCATC CTGGAGGGCG CCCAGCTGGC GGTATTAACA
ATACCCTTCA TGAAGGCCGA AGCGCCCTAC GATAATATTA AAGAGCTGGT TGAAAAGGTC
TGGGATTGGT GGATGGAAGA GGGCAAAAAC CGTGAGCGCC TGGGCGAACT GATCCAGCGC
AAGGGTTTAC CCAAGTTCCT GGAGGTTATC GGCGTACCGG CCGCACCCCA AATGGTTCGC
CATCCCCGGA CCAATCCTTA TATCTTCTGG AAGGAAGAAG ACGTACCCGG CGGCTGGAAA
CGCGATATCA ACGAATACCG GCAGCGGCAC AAGAGATAG
 
Protein sequence
MEFKPKRPQK DLKYDELRIY TDEELHNYSE EELKNFKLKH DIPDLDELEK GPWPSFVADA 
KREALHRKKL ADDRLMIDKD VVDDLLGQLQ LSFDDGETHW KHGGIVGVFG YGGGVIGRYS
DVPEKFPSVA QFHTLRVNQP ASKFYKTDFL RALADLWEYR GSGMFNLHGS TGDIILLGTS
TEQLEPIFYD LTHELDQDLG GSGSNLRTPS CCIGKARCEF ACIDTQDLCY EITTHYQDEL
HRPAFPYKFK IKVDGCPNGC VASIARSDMS LIGTWRDDIR IDQEAVRAYM AGDIEPNGGA
HKGRDWGKFD IQKEVIDLCP TGCMALEDGQ LKINNKECNR CMHCINVMPR ALKPGRDTGV
SVLFGAKAPI LEGAQLAVLT IPFMKAEAPY DNIKELVEKV WDWWMEEGKN RERLGELIQR
KGLPKFLEVI GVPAAPQMVR HPRTNPYIFW KEEDVPGGWK RDINEYRQRH KR