Gene Moth_2191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2191 
Symbol 
ID3832866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2289914 
End bp2291935 
Gene Length2022 bp 
Protein Length673 aa 
Translation table11 
GC content52% 
IMG OID637830113 
Producthydrogenase 4 subunit B 
Protein accessionYP_431023 
Protein GI83591014 
COG category[C] Energy production and conversion
[P] Inorganic ion transport and metabolism 
COG ID[COG0651] Formate hydrogenlyase subunit 3/Multisubunit Na+/H+ antiporter, MnhD subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.660892 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTACGC AACAGCTTCT GCTACTCTCT GTGCTCTTGT ACGTTGCCGG AGCCCTTGCC 
TCCCTGGCTC TTAATCGGGC CGGTAAAATT GCCAACTATG CTTCAGGTAT AAGTGCCCTT
GCAGCAGCAG GTACCGGGAT GGCCTCGGCC GTCCAGGTAC TTGCCGGCGG AGCAGCTTTT
ACCTGGGAAG CGGCGGGGTT TATACCCTTT GCCAAGTTTA TTATAAAGGT TGATCCCCTC
TCTGCCTTTA TGTTACTGGT TATTTCCCTT CTGACAGGGG CTACGGCTCT ATATTCCCTC
TCGTACCTGG ATGAGTATAC CGGTAAAGGC GCAGGGGTTA TGGGTTTTTT CAATAACCTC
TTTATTGCCT CTATGGTATT AGTGGTCATT AGTGGGAATG CTTTTTATTT TCTAATTTTC
TGGGAACTGA TGACGCTGGC CTCTTATTTC CTGGTTAGCT TTGATCAGGA AGACAGTGAA
GCTGTCAAGG CCGGGTTCAT CTATCTTTTT ATGGCCCACG CGGGAACGGC TTTGATTATG
CTGGCTTTTA TCTTATTCTT TGTCTATACA GGTACCTTCG ATTTCGCTTC CTTCCGTGGG
GCGAACCTCC CGGTGTTTAC AAAGAGCTTG ATCTTCCTGC TAGCTTTCCT GGGATTCGGG
GCCAAGGCCG GTATTATTCC GCTCCATATC TGGCTGCCGA AGGCTCACCC GGCTGCTCCG
TCCAACGCTT CCGCTCTCAT GTCGGGTGTC ATGATTAAAA CCGCTATCTA TGGTATTCTC
AGGGTCAGTG TCGATTTCCT GGGGGCTTCT GTTTGGTGGT GGGGATTTAT TGTCCTGGCC
TCCGGAGCGA TTTCAGCAGT TCTGGGTGTT CTCTACGCCC TGGGGGAGCA CGATATAAAG
CGGCTGCTGG CCTATCACAG TGTTGAAAAC GTTGGGATTA TATTGATGGG AGCCGGCGCC
GGCATGATCG GCATCGCTGC CGGCCAGCCT GTTTTAGGAG TACTCGGGAT CCTGGCAGGC
CTCTACCACT TGTTAAACCA TGCCGTCTTT AAAGGCTTGC TCTTTCTTGG GGCAGGTTCG
GTAATATATC GAACCCATAC GAAACATATG GAGGAACTTG GCGGACTGGC CAGGCGCATG
CCCTGGACGG CACTCGCTTT TCTGGTTGGT GCTGTAGCCA TCTCAGCCAT CCCTCCCCTC
AACGGATTTG TCAGTGAGTG GTTTACCTAC CAGTCGTTGT TTATTGCGAG CACCAGCAGC
ATCCTGGCTG TGAGAGTGTT TGCGCCCCTG TTTGTTGTTA TGTTAGCCCT GACGGGCGCG
CTGGCGGCGA TGTGTTTTGT GAAGGCATAT GGGGTTACTT TTGGCGGTCC CTGTCGCAGC
GGGCATGCCC GTGAGGCCAG GGAGGTTCCC ATACCGATGC TTGCCGGGAT GGCAATTCTG
GCAATTAGCT GCATTATCCT CGGTGTGGGT GCACCGGTTG TTGCTCCTTA TATTGGAAAG
GTGGCTTCGG CATTATTAGC CATTACTGCG GTCCAGGTAA GCGACGGTTT GCTGGTATTT
CCTGCAAACA GCATGCAGGC CATGCTCTCC ACACCGCTCA TCGCCATTCT CCTCGTTGGT
CTTGCTACGT TACCCTTGTT AATTATCGGG ATCCAAGGTG GTTTCCAGGC CGGACGGCGT
ATCGATGCTG AGCCGTGGGC ATGTGGATAC AAGTATTCAC CGCGGATGGC CTATACCGCA
ACTGCCTTTG CTCAGCCGTT GCGCGTACTT TTTCGGCCGG TTTATTCGCT CAGGACCACC
CTCGATGGAC CTGGCTATAC CGTTGCATCG TATTTCAAAG GAGCAGTGGT CTACATCGCC
AGTGTAGAAT CGTTGTGGGA ACGTTACATT TACGCTCCTC TGGCACGGGG CACGGTATAT
CTGGGTAAAA AATTGCAGGC CTTCCAGATA GGGAACGTCA GGCTATATTG TCTCTATATA
ATCATAACCC TCGTAGTCCT GTTATTAGCG ACGGTTAGAT AG
 
Protein sequence
MLTQQLLLLS VLLYVAGALA SLALNRAGKI ANYASGISAL AAAGTGMASA VQVLAGGAAF 
TWEAAGFIPF AKFIIKVDPL SAFMLLVISL LTGATALYSL SYLDEYTGKG AGVMGFFNNL
FIASMVLVVI SGNAFYFLIF WELMTLASYF LVSFDQEDSE AVKAGFIYLF MAHAGTALIM
LAFILFFVYT GTFDFASFRG ANLPVFTKSL IFLLAFLGFG AKAGIIPLHI WLPKAHPAAP
SNASALMSGV MIKTAIYGIL RVSVDFLGAS VWWWGFIVLA SGAISAVLGV LYALGEHDIK
RLLAYHSVEN VGIILMGAGA GMIGIAAGQP VLGVLGILAG LYHLLNHAVF KGLLFLGAGS
VIYRTHTKHM EELGGLARRM PWTALAFLVG AVAISAIPPL NGFVSEWFTY QSLFIASTSS
ILAVRVFAPL FVVMLALTGA LAAMCFVKAY GVTFGGPCRS GHAREAREVP IPMLAGMAIL
AISCIILGVG APVVAPYIGK VASALLAITA VQVSDGLLVF PANSMQAMLS TPLIAILLVG
LATLPLLIIG IQGGFQAGRR IDAEPWACGY KYSPRMAYTA TAFAQPLRVL FRPVYSLRTT
LDGPGYTVAS YFKGAVVYIA SVESLWERYI YAPLARGTVY LGKKLQAFQI GNVRLYCLYI
IITLVVLLLA TVR