Gene Moth_1235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1235 
Symbol 
ID3833177 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1275310 
End bp1276722 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content59% 
IMG OID637829170 
Productradical SAM family protein 
Protein accessionYP_430092 
Protein GI83590083 
COG category[C] Energy production and conversion 
COG ID[COG1625] Fe-S oxidoreductase, related to NifB/MoaA family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATAACT TTCACCTGGC CCTAATGACA GCTTCCAGGT ACAATATCCT CCCCCTCACC 
TCCACCTGCA ACCTGGGCTG CCTTTTTTGC AGCCATCGCC AGAACCCCCC GGGAGTTGAA
ACCTGGCGCC TGCAGCCACT CAAGGGCGAG GAGATAGACA ACCTCCTGGA TTACCTGGAC
GGTGACCGAA AAATAGTAAT CGGCGAGTCA GCCACCCGGC TGATTGAGGG AGAACCTCTG
ACTCATCCCG ATTTTCTGGC AATCATACGT AAAGTACGAC GGCGTTTTCC CCGGGCAAGG
CTGGAGATTA CCACTAACGG CACCCTCTTA ACCCCGAACT TGATCAGAGA ACTGGCTGAT
TTGCAGCCCC TGGAAATAAA TCTCTCCCTC AACAGTGCCA GCCCGGAGGG ACGCCGGCGG
CTGATGGGAG ATAGGAATCC CGGTGCTGCT CTCCAGGCTC CCATGGCTTT ACAGCAGGCA
GGGATAATCT ACCAAGGCAG CCTGGTGGCC TGTCCATGGC TGGTGGGCTG GGACGATTTT
CGTGAAACTA TCCTCTACCT GGCCCGGGCG GGGGCCAGGA CCATCCGTGT TTTTTTACCC
GGTTACACCC GGCTGGCTCC GCCAGAGCTA CGTTTTCCCC CTGGCCTCCG CCGGCAAATA
GAGGAGGAGC TGGAACAACT CCGTTCTTTA ACCGATGTCC CCCTGTTACT GGAACCTCCT
CTTCTTGACA ACCTGCTGCC GGAAATAGAA GGAGCTATCC CCGGAACGCC AGCAGCCAGG
GCCGGGTTGA AGCGGGGCGA TCTGATCCTG GAAATAGATG GCCAAAAGCC CCGCAGCCGG
GTAGAGGCTT ACCGGTGGGC CGCCGTCCCC GGCCGGCGGC GCCTCCTGGT CGGCAGGCAT
CAGGGAAAAA ATTACGGCCC CGCTGAAATA AAGTTAACCG GCGGGCAACA AGTGAAAAAT
ACGAGAAGCA GCTCAGAACT GGAGGGTATC CGGGGGGCAC TTTATGAGCT GGAGGTGGGT
CGGGAGGGCA GCGGTTTGAC CTTTGCCTGG GATTTCGACC CGGACCTGCT GCCGGAAGTA
GAAAAGGCAT GCCGGCGCCA CGGCGCCCGG AAGGTTTTAA TCCTTACTTC CAGGCTGGCT
GTGGCAGTGA TAAAGGAGGC CGTGGCCCGA CTATCTCTTT CCCTGGAGGT AGTCGTCACC
CCCAGCCGTT TCTTCGGCGG GTCCATCGGC TGTGCCGGCC TGTTAACCCT GGCTGATTTC
CAGGCCGCCT GGCAGGATTG GCAGAAGAAT AATGGCCCGG CTGATCTCAT CATCCTCCCG
TCCATCGCCT TCGACTACCG GGGACGGGAC CTGGTTGGTG AGCATTACCT GAGCCTGGCG
GCAAGTACCG GCGTTCCAGT GGAACTGGTA TAA
 
Protein sequence
MDNFHLALMT ASRYNILPLT STCNLGCLFC SHRQNPPGVE TWRLQPLKGE EIDNLLDYLD 
GDRKIVIGES ATRLIEGEPL THPDFLAIIR KVRRRFPRAR LEITTNGTLL TPNLIRELAD
LQPLEINLSL NSASPEGRRR LMGDRNPGAA LQAPMALQQA GIIYQGSLVA CPWLVGWDDF
RETILYLARA GARTIRVFLP GYTRLAPPEL RFPPGLRRQI EEELEQLRSL TDVPLLLEPP
LLDNLLPEIE GAIPGTPAAR AGLKRGDLIL EIDGQKPRSR VEAYRWAAVP GRRRLLVGRH
QGKNYGPAEI KLTGGQQVKN TRSSSELEGI RGALYELEVG REGSGLTFAW DFDPDLLPEV
EKACRRHGAR KVLILTSRLA VAVIKEAVAR LSLSLEVVVT PSRFFGGSIG CAGLLTLADF
QAAWQDWQKN NGPADLIILP SIAFDYRGRD LVGEHYLSLA ASTGVPVELV