Gene Moth_1168 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1168 
Symbol 
ID3833102 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1199708 
End bp1200691 
Gene Length984 bp 
Protein Length327 aa 
Translation table11 
GC content59% 
IMG OID637829101 
Productnitrite and sulphite reductase 4Fe-4S region 
Protein accessionYP_430025 
Protein GI83590016 
COG category[C] Energy production and conversion 
COG ID[COG2221] Dissimilatory sulfite reductase (desulfoviridin), alpha and beta subunits 
TIGRFAM ID[TIGR02912] sulfite reductase, subunit C 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.00249386 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATAATA CGAAAGAATT GATCAAGAAC GCCTACCGCA TTACCAGCCG CAGGGGTTAT 
ACGGCCCTGC GTCTCCGGGT GCCCGGCGGG CACCTGGCGG CCGAATATTT AGGCTTAATC
CAGGATATAG CCCGGCGCTA CGGCAACGGT ACCGTTCATC TGACTACCCG CCAGGGTTTT
GAAATCCCCG GCATACCCCT TGACAAAGTA CCGGAGGTAA ACCAACTGCT GGCACCTATG
TTGCAACAAG AGGCCGCCCT GGGAGTGGCC ATCGAGAATA TCCATGCCGG TTACCCGGCG
GCGGGCACCC GGAACGTCAC GGCCTGCATT GGCAGCAGAG TCTGCCCCTT TGCCAACTTT
GACACCACGG CCCTGGCGCA AAAAATCGAA GGTCTCATCT ATCCCAACCA CTACCACGTC
AAGATTGCCA TTACCGGTTG CCCCAACGAC TGCATCAAGG CCCACCTCCA GGACATCGGC
ATCATCGGCC AGGTGGAGCC GGAGTATGAT CCCGGCCGCT GTATCGCCTG CCAGGCTTGT
GTCAAGAACT GCCGCCAGTT TATCGTCGGC GCCCTGGAGC TGGTCAATTA CCAGGTGGAA
CGCGACGGCA AGCGCTGCCT GGGCTGCGGC GAGTGTATCC TGCAATGTCC CATGGCGGCC
TGGACCAGGG GGCGCCAGTA CTACCGGATC GTGGCCCTGG GTCGCACCGG GAAAAAGAGC
CCGCGCCTGG CGGCCAACTT CCTGGAATGT ATTGATGAAA AGGCCGTCCT GCAGGTAATC
GCCAACCTGT ACCGTTATAT TGAGCGGCAC ATTGATCGCT CCCTGCCCAA GGAGCACGTT
GGCTATATCG TCGACCGCAC GGGCTACCAG GTCTTTAGGG ATGAACTCCT GGATGGTGTT
GATCTGGGTC CGAAAGGCCG GGTGGCCCGG GAATTGCCCT TTTACGGCTA CAGCTACGAC
CGCGACTTGC TGTGGAGCAA ATAG
 
Protein sequence
MYNTKELIKN AYRITSRRGY TALRLRVPGG HLAAEYLGLI QDIARRYGNG TVHLTTRQGF 
EIPGIPLDKV PEVNQLLAPM LQQEAALGVA IENIHAGYPA AGTRNVTACI GSRVCPFANF
DTTALAQKIE GLIYPNHYHV KIAITGCPND CIKAHLQDIG IIGQVEPEYD PGRCIACQAC
VKNCRQFIVG ALELVNYQVE RDGKRCLGCG ECILQCPMAA WTRGRQYYRI VALGRTGKKS
PRLAANFLEC IDEKAVLQVI ANLYRYIERH IDRSLPKEHV GYIVDRTGYQ VFRDELLDGV
DLGPKGRVAR ELPFYGYSYD RDLLWSK