Gene Hore_02980 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_02980 
Symbol 
ID7314684 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp307388 
End bp309100 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content42% 
IMG OID643610721 
ProductNADH dehydrogenase I subunit G 
Protein accessionYP_002508054 
Protein GI220931146 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG1034] NADH dehydrogenase/NADH:ubiquinone oxidoreductase 75 kD subunit (chain G)
[COG4624] Iron only hydrogenase large subunit, C-terminal domain 
TIGRFAM ID[TIGR02512] hydrogenases, Fe-only 


Plasmid Coverage information

Num covering plasmid clones59 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTAAAAA TAAATATTGA TGGGCTTGAT CTGGAGGTCA GAGACGGAAT TAGTATCCTG 
GAAGCTGCTC GCCAGGCCCA TATTAAAATA CCAACCCTAT GCTATGAAGA AGGATTATCC
GTATATGGTG GTTGTCGGTT ATGTGTGGTT GAAATTGAAG GAGAAGCCCT TTTAAAACCT
GCCTGTGCTA CAGAAGTTAA TGATGGTATG GTAATCAAAA CCCACTCCCC TAAAGTCAGA
GATGTTAGAA GAACTTTATT TGAATTAATA GTTGCCTCCC ATAATATTGA TTGTCAGTTA
AATTGCCTAA CCTGTAGTCG GGCCGGAAGC TGTGAACTGA GGGAAATTGC CGAGGATATA
GGTGTCAGTA ATATCAGATT AGCTACCTAT GACAAAGGTT ACAGGGATGA CCGGTCCAGT
TATTCAATTG TGAGGGAGCC TAATAAATGT ATAACCTGTG GTCGGTGTAT CAGAAAATGT
GAAGAGGTCC AGGAGGTAGG TATTTTTACA ACAGCCAATA GAGGACCGGC AACAGTGGTA
ACAACCTTCA AAGAAAAAGG AATGGGGAAT GTGGAGTGTA CTAACTGTGG TCAGTGTATC
CATGCCTGTC CGACAGGGGC CTTACATGAG GTGTATCACT ATGAAAAAGT CTGGGAGGTG
CTTCATGACA GGGATAAATA TGTGGTAGTC CAGACGGCAC CTGCTGTCAG GGTGGCCCTG
TCTGAGGCCT TTGGATTAAA ACCGGGAACT ATTTTTACAG GGCAGATGGT TGCTGGATTG
AGGCGACTGG GCTTTGACCG TGTCTTTGAT ACTAATTTTA CAGCTGATTT AACCATCATG
GAAGAGGGAA CTGAATTGAT AGAGCGGTTA AATAATAATG GTGAGCTACC GATGTTTACT
TCCTGTAGTC CGGGCTGGAT TAAATATATA GAACACTTCT ACCCTGAATT TATAGATAAT
TTGTCTACCT GTAAGTCACC ACAGCAGATG TTTGGAGCGA TTGCCAAATC TTATTATGCT
GATAAAAGTA ATATACCGAG GGATAAAATA GTTGTAGTTT CGGTTATGCC CTGTACTGCC
AAAAAATTTG AAGCCAGAAG ACCGGAAATG GAGGGCGATG TTGATTATGT TCTGACTACC
AGGGAACTGG CCCGGATGAT TAAAGAGGCA GGAATTGATA TTACTAACCT TAATGCTGAA
GAGCATGATA AACTAATGGG AACATCATCG GGGGCTGCTG ATATTTTCGG CGCAACCGGT
GGGGTAATGG AAGCTGCCCT GCGGACAGCC TATGAACTGG TAACCGGTGA AAAACTCGGT
CAGCTGGACT TTAAGAATGT AAGGGGTGAA GCCGGAATTA AGGAAGCAGA AGTTACCTTA
AATGGAACTC AGTTGAAGGT AGCAGTAGCA CATGGTCTGG GAAATGTCAG AAAACTGATG
GAGCTGATAA AATCAGGAAA AGAGTATCAT TTTGTTGAGT TAATGGCCTG TCCCGGGGGA
TGTATTGGTG GTGGTGGTCA ACCTATTCCC ACCAGTGAGG ATATAAGAAG AAAGCGAATT
GAAGCTTTAT ATAAGATAGA TAAAAATAAA AAATTAAGGA AATCCCATGA AAATCCGTAT
ATTAAAAAAC TATATCAAGA ATTCCTGGAT AAACATGGAA GCCATAAAGC CCATGAACTG
CTCCATACCC ATTATATAAA CAGGGGTGTT TAA
 
Protein sequence
MVKINIDGLD LEVRDGISIL EAARQAHIKI PTLCYEEGLS VYGGCRLCVV EIEGEALLKP 
ACATEVNDGM VIKTHSPKVR DVRRTLFELI VASHNIDCQL NCLTCSRAGS CELREIAEDI
GVSNIRLATY DKGYRDDRSS YSIVREPNKC ITCGRCIRKC EEVQEVGIFT TANRGPATVV
TTFKEKGMGN VECTNCGQCI HACPTGALHE VYHYEKVWEV LHDRDKYVVV QTAPAVRVAL
SEAFGLKPGT IFTGQMVAGL RRLGFDRVFD TNFTADLTIM EEGTELIERL NNNGELPMFT
SCSPGWIKYI EHFYPEFIDN LSTCKSPQQM FGAIAKSYYA DKSNIPRDKI VVVSVMPCTA
KKFEARRPEM EGDVDYVLTT RELARMIKEA GIDITNLNAE EHDKLMGTSS GAADIFGATG
GVMEAALRTA YELVTGEKLG QLDFKNVRGE AGIKEAEVTL NGTQLKVAVA HGLGNVRKLM
ELIKSGKEYH FVELMACPGG CIGGGGQPIP TSEDIRRKRI EALYKIDKNK KLRKSHENPY
IKKLYQEFLD KHGSHKAHEL LHTHYINRGV