Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hore_01810 |
Symbol | |
ID | 7313309 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothermothrix orenii H 168 |
Kingdom | Bacteria |
Replicon accession | NC_011899 |
Strand | + |
Start bp | 187499 |
End bp | 189499 |
Gene Length | 2001 bp |
Protein Length | 666 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 643610604 |
Product | NADH dehydrogenase I subunit G |
Protein accession | YP_002507938 |
Protein GI | 220931030 |
COG category | [C] Energy production and conversion [R] General function prediction only |
COG ID | [COG1905] NADH:ubiquinone oxidoreductase 24 kD subunit [COG4624] Iron only hydrogenase large subunit, C-terminal domain |
TIGRFAM ID | [TIGR02512] hydrogenases, Fe-only |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.000616939 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGATA CTTTAGTAAT AAACGGAAAA ACTGTTGCGA TAAATGGTGA GAAGAATCTG CTTGAGCTGA TAAGAAAACA GGGGATTGAG CTACCAACCT TTTGTTACCA TTCTGAATTA AGTGTTTATG GAGCCTGTAG GATGTGTCTT GTTGAAGAGG AGAATATGGG GATTATTGCA ACCTGTTCAA CGGCACCCCA ACCGGGGATG AATATAAAAA CCCATTCTCC CAGGGTACAA CGGATAAGAA AGATGGCTCT GGAATTACTA CTGGCTAACC ATGATCGGGA TTGTACTACC TGTTCCCGGA ATGGGAATTG TAAGCTTCAG GACCTGGCCC AGAGGTTTGG CATAGATGAG GTAAGGTTTG GTACCAGGGA AGAAGTACTA CCGGTTGATA ACAGTAGTCC TTCAATTGTA CGCAATCCCA ATAAGTGTAT ACTGTGTGGG GACTGTGTCC GGACCTGTCA GGAAATTCAG GGCATCGGGG TACTGGATTT TGCCAACCGT GGTTCCAAAT CTATAGTGGC TCCAGCTTTT AATAAAGACC TGATTGAGGT TGACTGTGTG GCCTGTGGTC AGTGTGTTGC CGTATGTCCG ACCGGAGCTT TGACAATAAA ATCAGATATT AGTCGGGTCT GGGAAGCAAT TAATGATCCC GCGAAAACCG TAGTTGTCCA GATTGCGCCT GCTGTCAGGG TAGCTATCGG TGAAGAATTT GGCATGGAAC CTGGAGAAAT CCAGATGGGA CAGCTGGTTG CTGCCCTCAA AAGGCTTGGG TTTGATAAAG TCTTTGATAC CAGCTTTGCA GCTGACCTGA CAGTTATGGA AGAAACGGCT GAATTTATTG AGAGATTTAA AGAAGGTAAG AAATTGCCAC AGTTTACTTC CTGCTGTCCG GCCTGGGTTA AATATGCTGA AGAGTATGCC CAGGACTTCC TGGATAACCT GTCCAGCTGT AAGTCACCAC AGCAAATGTT CGGTTCTGTA GCCAAGAGGT TCTATAGTCA GGATCTCGGT ATTGATCCGG AAGATATGGT TGTTGTTTCC ATAATGCCCT GTACAGCCAA AAAATTTGAA GCTCAGCGTC CTGAATTTAT AACAGATGGG GTACCTGATG TTGACGTAGT TATAACCACC CAGGAAGTAG CAAGCATGAT TAAAAAGTCT GGCCTGGTTT TCTCTGAACT GGGGATAGAA TCCCTCGATA TGCCTTTAGG ATTTTCTACC GGGGCCGGTG TTATTTTCGG TGTTACCGGT GGGGTTTCAG AGGCTGTTCT TAGAAATGCC TATGAAAAGA TAACCGGAGA TAACCTTGAT GATGTAGAAT TTAAGGAAGT AAGGGGCTTT GATGGAATAA AAGAAGCAGA AGTAGAACTG GATGGTAAAA CTGTAAGACT GGCGGTTGTC CACGGTCTGA GTAATGTCGG TGATCTTATT AAGGCGATTA AAAAAGGTGA AAAAGAATAT GACCTGATTG AAGTTATGGC CTGTCCTGGC GGTTGTATTG GCGGTGGAGG ACAGCCTACT CCTAATAACA CAGAGGTCAG AGAAAAGAGG GCGCAGGGTA TGTACAACTG TGATAAGTTA TCAGCCCTTC ACAAATCCCA GGAAAACCCC ATGGTCAACG ACTTTTACCG CCGCTGGTTT GGTGAAGAGA ATAGTGATGT AACCCATAAA CACTTACACA CCTCCTATGA AGAAAAGCAG AGGATCAATA CTAAAGGAAT TGAATTAAAC ACAAGTGAAA GTAGTGAGGA GGTTGTACCG GTTCAGGTCT GTGTAGGAAC CTGCTGTTAT CTTCATGGTT CCTACGACCT CTTACAGGGT CTTATAGAAA GGGTTGAAGA AGAGGGGTTA AGTGATAAGG TAGATATAGA AGCAACTTTC TGTTTTGAAA ACTGTAAAAA CGCGCCTTCT GTTAAAGTAG GCAATCAGCT TTTAAGTAAG GTTGAAAGTG TTGATGATAT TTTAAAACAC TTAAAGCCTG CTTTAAAATA A
|
Protein sequence | MSDTLVINGK TVAINGEKNL LELIRKQGIE LPTFCYHSEL SVYGACRMCL VEEENMGIIA TCSTAPQPGM NIKTHSPRVQ RIRKMALELL LANHDRDCTT CSRNGNCKLQ DLAQRFGIDE VRFGTREEVL PVDNSSPSIV RNPNKCILCG DCVRTCQEIQ GIGVLDFANR GSKSIVAPAF NKDLIEVDCV ACGQCVAVCP TGALTIKSDI SRVWEAINDP AKTVVVQIAP AVRVAIGEEF GMEPGEIQMG QLVAALKRLG FDKVFDTSFA ADLTVMEETA EFIERFKEGK KLPQFTSCCP AWVKYAEEYA QDFLDNLSSC KSPQQMFGSV AKRFYSQDLG IDPEDMVVVS IMPCTAKKFE AQRPEFITDG VPDVDVVITT QEVASMIKKS GLVFSELGIE SLDMPLGFST GAGVIFGVTG GVSEAVLRNA YEKITGDNLD DVEFKEVRGF DGIKEAEVEL DGKTVRLAVV HGLSNVGDLI KAIKKGEKEY DLIEVMACPG GCIGGGGQPT PNNTEVREKR AQGMYNCDKL SALHKSQENP MVNDFYRRWF GEENSDVTHK HLHTSYEEKQ RINTKGIELN TSESSEEVVP VQVCVGTCCY LHGSYDLLQG LIERVEEEGL SDKVDIEATF CFENCKNAPS VKVGNQLLSK VESVDDILKH LKPALK
|
| |