Gene Moth_1717 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1717 
Symbol 
ID3833167 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1759099 
End bp1760820 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content60% 
IMG OID637829642 
ProductIron hydrogenase, small subunit 
Protein accessionYP_430562 
Protein GI83590553 
COG category[R] General function prediction only 
COG ID[COG4624] Iron only hydrogenase large subunit, C-terminal domain 
TIGRFAM ID[TIGR02512] hydrogenases, Fe-only 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.118526 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.410336 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGCACCG TAAAACTGAC CATTGACAAT ATACCGGTGG AAGTTGAGGC CGGAACGACG 
ATCTTAAAGG CCGCCGAGGA GGCTGGTATT CATATACCCA CCCTTTGTTA CCTGGAAGGC
ATCAACGAGA TCGGCGCCTG CCGGGTCTGC GTGGTTGAAG TTGAAGGGGC CAGGAACCTG
ATGGCCTCCT GTGTAGCCCC GGTAGCCGAA GGTATGGTGG TAAAGACCAA CAGCCCGAGG
GTAAGGATGG CCCGGCGCCT GAACGTTGAG CTCCTCCTTT CCAACCACGA GATGGAGTGC
CCGACCTGCA TCCGTAACTT GAACTGCGAA CTCCAGTCCC TGGCCCGGGG GCTGGGTATC
CGCCAGGTAC GCTTTAAGGG CAAAAAGAGC GAGCACCCCG TGGACGATTC GACCCCAGCC
CTGGTGCGCG AGCCTGACAA GTGCATTCTC TGCCGCCGTT GCGTGGCCGT TTGCGAAAAG
GTCCAGGGAG TTATGGCTAT AGCCCCCCTG GGGCGGGGCT TTGACACCGT CATCGCCCCG
GCCTTCCAGG AGAAGCTCGT GGACATCGCC TGCGTGGAAT GCGGCCAGTG CACCCTTGTC
TGCCCGGTGG GCGCCCTGTA CGAAAAAGAT TACACCAGCG AAGTCTGGGC GGCCCTGGCC
GACCCGGAGA AGTTCGTCGT CGTCCAGACG GCGCCGGCCA CCCGGGTGTC CATCGGCCAG
GAGTTCGGGT TAGCACCGGG GAGCATCAAC ACCGGCCAGA TGGTGGCGGC TTTAAGGCGC
CTGGGCTTTG ACAGGGTCTT TGATACCGAC TTTTCCGCCG ACCTGACCAT TATGGAAGAA
GGCTCCGAGT TTATTGAGCG CTTTACCAAA GATGGCCCCC TGCCGTTGAT CACCTCCTGC
AGCCCGGGCT GGATCAAGTT TATGGAGCAC TTCTACCCGG AGCTTATACC CAACGTCTCC
ACCTGCAAGT CGCCCCAGCA GATGTTCGGC GCCGTGGCCA AGACTTACTA TGCCCGGAAG
GCCGGTGTAG ATCCGGCCAG GATGGTGGTC GTCTCCATCA TGCCCTGCAC TGCCAAGAAG
TTCGAGTGCC AGCGGCCGGA GATGCGGGAC AGCGGCTATC AGGACGTGGA CTACGTCCTC
ACCACGCGCG AGCTGGCGCG GATGATCAGG GAAGCCGGGA TTGATTTCAA AAACCTCCCG
GAAGAGCAGT ACGACGATCC ATTAGGCGAA TCCACCGGAG CGGGGGTCAT CTTCGGTGCC
ACAGGCGGGG TCATGGAGGC GGCCTTGCGT ACGGCCTACG AACTAATTAC CGGCGAGACC
CTGCCCGCCC TGGACTTCTA TGATATCCGC GGCCTCAAGG GCATCAAGGA AGCCACGGTA
GACATCAAGG GTACCAAAGT CCGGGTGGCT GTAGCCCACA GCCTGGGCCA TGCCCGGCAG
CTTTTAGAGC GGGTCAAGGC CGGGGAGCAG TATCACTTCA TTGAAATCAT GTGCTGCCCC
GGCGGCTGCA TTGGCGGCGG CGGGCAGCCC ATCCCCACCA ACACCGAGAT CAGGGAGCAG
CGCATCAAGG GTATTTATCA GGTCGACATG GAGATGCCCA TCCGCAAGTC CCACGAGAAC
CCGTCCGTTC AAGCCCTTTA CCGCGAGTTC CTGGGCAAGC CTTTGAGCGA GAAGTCCCAC
CACTTATTGC ACACCGAATA TACGCGGCGG GGGAAATACT AG
 
Protein sequence
MSTVKLTIDN IPVEVEAGTT ILKAAEEAGI HIPTLCYLEG INEIGACRVC VVEVEGARNL 
MASCVAPVAE GMVVKTNSPR VRMARRLNVE LLLSNHEMEC PTCIRNLNCE LQSLARGLGI
RQVRFKGKKS EHPVDDSTPA LVREPDKCIL CRRCVAVCEK VQGVMAIAPL GRGFDTVIAP
AFQEKLVDIA CVECGQCTLV CPVGALYEKD YTSEVWAALA DPEKFVVVQT APATRVSIGQ
EFGLAPGSIN TGQMVAALRR LGFDRVFDTD FSADLTIMEE GSEFIERFTK DGPLPLITSC
SPGWIKFMEH FYPELIPNVS TCKSPQQMFG AVAKTYYARK AGVDPARMVV VSIMPCTAKK
FECQRPEMRD SGYQDVDYVL TTRELARMIR EAGIDFKNLP EEQYDDPLGE STGAGVIFGA
TGGVMEAALR TAYELITGET LPALDFYDIR GLKGIKEATV DIKGTKVRVA VAHSLGHARQ
LLERVKAGEQ YHFIEIMCCP GGCIGGGGQP IPTNTEIREQ RIKGIYQVDM EMPIRKSHEN
PSVQALYREF LGKPLSEKSH HLLHTEYTRR GKY