Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_1319 |
Symbol | |
ID | 6092761 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | + |
Start bp | 1348851 |
End bp | 1350788 |
Gene Length | 1938 bp |
Protein Length | 645 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 642488521 |
Product | hydrogenase large subunit |
Protein accession | YP_001739346 |
Protein GI | 170289108 |
COG category | [C] Energy production and conversion [R] General function prediction only |
COG ID | [COG1905] NADH:ubiquinone oxidoreductase 24 kD subunit [COG4624] Iron only hydrogenase large subunit, C-terminal domain |
TIGRFAM ID | [TIGR02512] hydrogenases, Fe-only |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATTT ACGTTGATGG AAAAGAAGTT ATCATAAATG ACAACGAGCG TAACCTCCTT GAAGCGCTGA AGAACGTGGG GATAGAGATT CCGAATCTGT GTTATCTTTC GGAGGCTTCT ATATATGGAG CCTGTAGAAT GTGCCTTGTG GAGATCGACG GTCAGATCAC CACTTCCTGT ACCCTGAAAC CTTACGAAGG TATGAAGGTA AAAACGAACA CCCCCGAAAT ATACGAAATG AGAAGGAACA TCCTCGAACT CATCCTTGCA ACTCACAACA GAGACTGTAC CACCTGCGAT AGAAACGGAA GCTGTAAACT TCAGAAGTAC GCTGAAGACT TCGGCATAAG AAAGATCAGA TTCGAAGCTC TCAAGAAAGA ACACGTCAGG GACGAATCCG CTCCGGTAGT GAGAGATATA TCCAAGTGTA TCCTCTGCGG TGACTGTGTT CGCGTGTGTG AAGAGATCCA GGGAGTCGGT GTTATCGAGT TCGCCAAGCG CGGTTTTGAA AGCGTCGTGA CGACCGCTTT TAATACTCCC CTCATAGAGA CAGAGTGTGT GCTCTGCGGA CAGTGTGTAG CATACTGTCC AACGGGAGCT CTGAGCATCA GAAACGATAT AGACAAGTTG ATTGAAGCTC TTGAAAGCGA CAAGATCGTG ATAGGAATGA TCGCACCCGC GGTGAGAGCT GCGATTCAGG AAGAGTTTGG AATAGACGAA GACGTCGCAA TGGCGGAAAA ACTCGTCTCT TTCCTGAAAA CGATAGGCTT CGATAGAGTT TTCGATGTGT CGTTCGGAGC AGATCTTGTC GCCTACGAAG AAGCCCACGA GTTCTACGAA AGACTCAAAA AAGGAGAAAG ACTTCCACAG TTCACCTCAT GCTGTCCTGC GTGGGTGAAG CACGCTGAGC ACACCTATCC TCAGTACCTT CAGAATCTCT CGAGCGTGAA ATCACCTCAA CAGGCACTTG GCACGGTGAT AAAGAAGATC TACGCAAGAA AACTCGGTGT TCCTGAAGAA AAGATCTTCC TCGTTTCGTT CATGCCGTGT ACCGCTAAAA AGTTCGAAGC AGAAAGAGAA GAACACAAAG GAATCGTTGA CATTGTCCTC ACAACAAGAG AACTTGCTCA ACTCATCAAG ATGAGCAGAA TAGACATAAA CAGAATAGAA CCCCAGCCGT TCGACAGACC TTACGGAGTG TCTTCGCAGG CGGGTCTCGG TTTTGGAAAA GCCGGTGGGG TCTTCTCCTG TGTTCTTTCT GTGTTGAACG AGGAAATCGG CATAGAAAAA GTCGACGTGA AATCTCCAGA AGATGGCATC AGGGTAGCGG AAGTTACACT CAAAGATGGT ACGTCTTTCA AAGGAGCCGT CATATACGGT CTTGGTAAGG TGAAGAAGTT CCTCGAAGAA AGAAAAGACG TGGAGATTAT CGAAGTAATG GCCTGTAACT ACGGATGTGT GGGTGGAGGA GGACAGCCTT ACCCGAACGA TTCCAGAATC AGAGAACACA GGGCAAAAGT GCTAAGAGAC ACCATGGGAA TAAAATCTCT CCTCACACCC GTGGAAAACC TCTTCCTCAT GAAACTCTAC GAAGAAGATC TGAAAGACGA ACACACAAGA CATGAGATTC TCCACACCAC CTACCGACCG AGGAGAAGAT ACCCGGAAAA AGATGTGGAA ATACTTCCCG TTCCAAACGG TGAAAAGAGA ACGGTGAAAG TCTGTCTTGG AACCTCCTGT TACACGAAAG GATCTTACGA AATACTGAAA AAGCTCGTTG ATTACGTCAA AGAGAACGAT AGGGAAGGAA AAATAGAAGT GCTGGGAACG TTCTGTGTGG AAAACTGCGG GGCTTCTCCG AACGTGATCG TGGATGATAA AATCATAGGT GGTGCCACTT TTGAGAAGGT GCTGGAGGAG CTTTCGAAAA ATGGCTGA
|
Protein sequence | MKIYVDGKEV IINDNERNLL EALKNVGIEI PNLCYLSEAS IYGACRMCLV EIDGQITTSC TLKPYEGMKV KTNTPEIYEM RRNILELILA THNRDCTTCD RNGSCKLQKY AEDFGIRKIR FEALKKEHVR DESAPVVRDI SKCILCGDCV RVCEEIQGVG VIEFAKRGFE SVVTTAFNTP LIETECVLCG QCVAYCPTGA LSIRNDIDKL IEALESDKIV IGMIAPAVRA AIQEEFGIDE DVAMAEKLVS FLKTIGFDRV FDVSFGADLV AYEEAHEFYE RLKKGERLPQ FTSCCPAWVK HAEHTYPQYL QNLSSVKSPQ QALGTVIKKI YARKLGVPEE KIFLVSFMPC TAKKFEAERE EHKGIVDIVL TTRELAQLIK MSRIDINRIE PQPFDRPYGV SSQAGLGFGK AGGVFSCVLS VLNEEIGIEK VDVKSPEDGI RVAEVTLKDG TSFKGAVIYG LGKVKKFLEE RKDVEIIEVM ACNYGCVGGG GQPYPNDSRI REHRAKVLRD TMGIKSLLTP VENLFLMKLY EEDLKDEHTR HEILHTTYRP RRRYPEKDVE ILPVPNGEKR TVKVCLGTSC YTKGSYEILK KLVDYVKEND REGKIEVLGT FCVENCGASP NVIVDDKIIG GATFEKVLEE LSKNG
|
| |