Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1717 |
Symbol | |
ID | 3833167 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1759099 |
End bp | 1760820 |
Gene Length | 1722 bp |
Protein Length | 573 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637829642 |
Product | Iron hydrogenase, small subunit |
Protein accession | YP_430562 |
Protein GI | 83590553 |
COG category | [R] General function prediction only |
COG ID | [COG4624] Iron only hydrogenase large subunit, C-terminal domain |
TIGRFAM ID | [TIGR02512] hydrogenases, Fe-only |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.118526 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.410336 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGCACCG TAAAACTGAC CATTGACAAT ATACCGGTGG AAGTTGAGGC CGGAACGACG ATCTTAAAGG CCGCCGAGGA GGCTGGTATT CATATACCCA CCCTTTGTTA CCTGGAAGGC ATCAACGAGA TCGGCGCCTG CCGGGTCTGC GTGGTTGAAG TTGAAGGGGC CAGGAACCTG ATGGCCTCCT GTGTAGCCCC GGTAGCCGAA GGTATGGTGG TAAAGACCAA CAGCCCGAGG GTAAGGATGG CCCGGCGCCT GAACGTTGAG CTCCTCCTTT CCAACCACGA GATGGAGTGC CCGACCTGCA TCCGTAACTT GAACTGCGAA CTCCAGTCCC TGGCCCGGGG GCTGGGTATC CGCCAGGTAC GCTTTAAGGG CAAAAAGAGC GAGCACCCCG TGGACGATTC GACCCCAGCC CTGGTGCGCG AGCCTGACAA GTGCATTCTC TGCCGCCGTT GCGTGGCCGT TTGCGAAAAG GTCCAGGGAG TTATGGCTAT AGCCCCCCTG GGGCGGGGCT TTGACACCGT CATCGCCCCG GCCTTCCAGG AGAAGCTCGT GGACATCGCC TGCGTGGAAT GCGGCCAGTG CACCCTTGTC TGCCCGGTGG GCGCCCTGTA CGAAAAAGAT TACACCAGCG AAGTCTGGGC GGCCCTGGCC GACCCGGAGA AGTTCGTCGT CGTCCAGACG GCGCCGGCCA CCCGGGTGTC CATCGGCCAG GAGTTCGGGT TAGCACCGGG GAGCATCAAC ACCGGCCAGA TGGTGGCGGC TTTAAGGCGC CTGGGCTTTG ACAGGGTCTT TGATACCGAC TTTTCCGCCG ACCTGACCAT TATGGAAGAA GGCTCCGAGT TTATTGAGCG CTTTACCAAA GATGGCCCCC TGCCGTTGAT CACCTCCTGC AGCCCGGGCT GGATCAAGTT TATGGAGCAC TTCTACCCGG AGCTTATACC CAACGTCTCC ACCTGCAAGT CGCCCCAGCA GATGTTCGGC GCCGTGGCCA AGACTTACTA TGCCCGGAAG GCCGGTGTAG ATCCGGCCAG GATGGTGGTC GTCTCCATCA TGCCCTGCAC TGCCAAGAAG TTCGAGTGCC AGCGGCCGGA GATGCGGGAC AGCGGCTATC AGGACGTGGA CTACGTCCTC ACCACGCGCG AGCTGGCGCG GATGATCAGG GAAGCCGGGA TTGATTTCAA AAACCTCCCG GAAGAGCAGT ACGACGATCC ATTAGGCGAA TCCACCGGAG CGGGGGTCAT CTTCGGTGCC ACAGGCGGGG TCATGGAGGC GGCCTTGCGT ACGGCCTACG AACTAATTAC CGGCGAGACC CTGCCCGCCC TGGACTTCTA TGATATCCGC GGCCTCAAGG GCATCAAGGA AGCCACGGTA GACATCAAGG GTACCAAAGT CCGGGTGGCT GTAGCCCACA GCCTGGGCCA TGCCCGGCAG CTTTTAGAGC GGGTCAAGGC CGGGGAGCAG TATCACTTCA TTGAAATCAT GTGCTGCCCC GGCGGCTGCA TTGGCGGCGG CGGGCAGCCC ATCCCCACCA ACACCGAGAT CAGGGAGCAG CGCATCAAGG GTATTTATCA GGTCGACATG GAGATGCCCA TCCGCAAGTC CCACGAGAAC CCGTCCGTTC AAGCCCTTTA CCGCGAGTTC CTGGGCAAGC CTTTGAGCGA GAAGTCCCAC CACTTATTGC ACACCGAATA TACGCGGCGG GGGAAATACT AG
|
Protein sequence | MSTVKLTIDN IPVEVEAGTT ILKAAEEAGI HIPTLCYLEG INEIGACRVC VVEVEGARNL MASCVAPVAE GMVVKTNSPR VRMARRLNVE LLLSNHEMEC PTCIRNLNCE LQSLARGLGI RQVRFKGKKS EHPVDDSTPA LVREPDKCIL CRRCVAVCEK VQGVMAIAPL GRGFDTVIAP AFQEKLVDIA CVECGQCTLV CPVGALYEKD YTSEVWAALA DPEKFVVVQT APATRVSIGQ EFGLAPGSIN TGQMVAALRR LGFDRVFDTD FSADLTIMEE GSEFIERFTK DGPLPLITSC SPGWIKFMEH FYPELIPNVS TCKSPQQMFG AVAKTYYARK AGVDPARMVV VSIMPCTAKK FECQRPEMRD SGYQDVDYVL TTRELARMIR EAGIDFKNLP EEQYDDPLGE STGAGVIFGA TGGVMEAALR TAYELITGET LPALDFYDIR GLKGIKEATV DIKGTKVRVA VAHSLGHARQ LLERVKAGEQ YHFIEIMCCP GGCIGGGGQP IPTNTEIREQ RIKGIYQVDM EMPIRKSHEN PSVQALYREF LGKPLSEKSH HLLHTEYTRR GKY
|
| |