Gene Moth_1603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1603 
Symbol 
ID3832749 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1639124 
End bp1640281 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content53% 
IMG OID637829532 
Productpolysulphide reductase, NrfD 
Protein accessionYP_430452 
Protein GI83590443 
COG category[C] Energy production and conversion 
COG ID[COG5557] Polysulphide reductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.00601969 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGCCA GGGCTTTTAA CGGCAGCAAA AGGTACTGGA GCTGGGTAGT ATTCCTGCTA 
GTCCTGGCCG GCATAGGGTT CGCCTGTTAC CTGGTCCAGT TCAACCGGGG CCTGACAGTT
ACCGGTATGA GCCGGGACGT TTCCTGGGGC CTGTATATTG GCCAGCTAAC CTTCCTGGTA
GGGGTGGCCG CTTCGGCAGT AATGCTGGTG CTGCCCTACT ATCTTCATAA TGTCAAAGCC
TTTGGGCGTA TTACTATCCT GGGGGAGTTC CTGGCGGTGG CCACCATCAT CATGTGCTTA
CTCTTTGTTT TCGTCGACCT TGGGAAACCC ATGCGCCTGC TGAACATGAT CCTCTACCCG
ACCCCCAACT CCATCCTCTT CTGGGATATG GTTGTGTTAA CTGGTTACCT CCTTCTCAAT
ATTGTCATCG GCTGGACCAT GCTGGAAGCG GAATATAAGA CGGTAGCCCC GCCAAAATGG
GTTAAGATTT TGATCTATAT ATCCATCCCC TGGGCCATCA GCATCCATAC GGTTACGGCT
TTTCTTTACG CCGGCCTGCC GGGGCGCCAT TACTGGCTGA CAGCCATTAT GGCCGCCCGC
TTCCTGGCCT CGGCCTTTGC CTCGGGTCCG GCCCTGCTGA TTGTCCTGTG CTTCATTATC
AGGAAGGTCA GCAAGTTTGA CCCCGGCCGT GAGGCTATCG ACAAGCTGGC AGCCATCGTT
ACCTATGCCA CCATTATCAG CGTCTTCTTT GTGGGCCTGG AATTCTTCAC TGCCTTTTAT
AGCCAGGTTC CGGCCCATGG TATGGATAGC CTTATCTATC TCTTTGCCGG CCTGGATGGC
CATGCTAAAT TAGTTCCTTT GATGTGGCTG TTTGCCGTCC TGGCTGTAAT AGCCCTTGTT
TTGCTAATCA ATCCCCGCAC CCGGAACAGG GAGGCTACCC TGATAGCCGC CTGCGGCGCG
GTATTTATCT CCATGTGGTT GGAAAAGGGT ATTGGTTTGG TAATCGGCGG CTTTATTCCC
AACGCCTTCA ACCGGGTGAC GGAATACAGC CCTACTCCCC TGGAGATGTT GATCACCCTG
GGTATCTGGG CCGTCGGGGC CCTCATCCTG ACTTTTCTTT ATAAAATTGC TATTGCGGTT
AAAGAAGATT TGCTTTAA
 
Protein sequence
MIARAFNGSK RYWSWVVFLL VLAGIGFACY LVQFNRGLTV TGMSRDVSWG LYIGQLTFLV 
GVAASAVMLV LPYYLHNVKA FGRITILGEF LAVATIIMCL LFVFVDLGKP MRLLNMILYP
TPNSILFWDM VVLTGYLLLN IVIGWTMLEA EYKTVAPPKW VKILIYISIP WAISIHTVTA
FLYAGLPGRH YWLTAIMAAR FLASAFASGP ALLIVLCFII RKVSKFDPGR EAIDKLAAIV
TYATIISVFF VGLEFFTAFY SQVPAHGMDS LIYLFAGLDG HAKLVPLMWL FAVLAVIALV
LLINPRTRNR EATLIAACGA VFISMWLEKG IGLVIGGFIP NAFNRVTEYS PTPLEMLITL
GIWAVGALIL TFLYKIAIAV KEDLL