Gene Hore_06330 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_06330 
Symbol 
ID7314538 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp682455 
End bp684191 
Gene Length1737 bp 
Protein Length578 aa 
Translation table11 
GC content42% 
IMG OID643611063 
ProductNADH dehydrogenase (ubiquinone) 75 kDa subunit 
Protein accessionYP_002508385 
Protein GI220931477 
COG category[R] General function prediction only 
COG ID[COG4624] Iron only hydrogenase large subunit, C-terminal domain 
TIGRFAM ID[TIGR02512] hydrogenases, Fe-only 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.00021405 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGATA AAGTAAATAT AACATTAGAT GGTAAAAGTT TAACTGTAGA TAAAGATAAA 
ACTATTCTGG AAGTAGCCCG GGAAGCCGGG ATTAAGATTC CTACCCTCTG TTACCTGGAA
GAAATTAATG AAATTGGTAG CTGCAGGGTC TGTGTAGTGG AAGTGAATGG AAAAATACAG
CCTGCCTGCG TTACTCCGGT AAGTGAAGGC CTGGAAATCA CAACAACTTC ACCCAGGATT
CGTGAAGCCA GGAGGATATC CCTCGAGTTA ATAATTTCAG ACCATCCTAT GGAATGTTTG
ACCTGTAGTC GGAATGGAAA TTGTGAACTC CAGAGACTGG CAGAGGACTT TGGAATAAGT
GAGATAACTT ATGAGGGTGA ACAGTCACAT TTTGAACCTG ATCTTTCATC ACCTTCAATT
GTCAGGGATC CCGATAAATG TATTTTATGC CGGCGTTGTG TTAGTGTCTG TGAACAGGTT
CAGGGGGTTG CTGCCTTAAC TCCCAATGAA AGGGGATTTT CTACCATAAT TACCCCTGCC
TTTGGTCAAA AACTGGGTGA AATAGCATGT GCTAACTGTG GTCAGTGTAT AAATGCGTGT
CCAGTTGGAG CCCTTTCTGA AAAGGATGAC ACCGAAAAGG TCTGGGAAGC CCTGGCTAAT
CCCGATAAAC ATGTAGTGGT CCAGACAGCA CCTGCTGTCA GGGTATCGAT TGGTGAAGTA
TTTGGAATGA AACCCGGTAG TCTGGTAACA GGAAAACTGA TGGCTGGTTT AAGACGGCTT
GGTTTTGATA AGGTTTTTGA TACCAACTTT ACTGCTGACT TGACCATAAT GGAAGAAGGT
CATGAATTAA TTGAAAGACT GAAAAACAAT GAAAGGCTAC CGTTGATTAC TTCCTGTAGT
CCGGGCTGGA TTAAGTTTAT TGAACACTTC TACCCAAGTT ACCTTGAGCA TATCTCAAGC
TGTAAATCTC CTCAACAGAT GTTTGGGGCC CTTGCCAAAA CTTATTACCC TGAAAATAAT
GGTATAGACC CGGAAGATGT ATTTGTAGTT TCGGTTATGC CCTGTACTGC TAAAAAATTC
GAAATAACAA GACCCGGTAT GGATAGTAGT GGGTATCAGG ACGTAGATGT GGTTCTTACC
ACAAGGGAGC TGGCAAAAAT GTTTAAACAG GCCGGGATTG ACTTTGTGAA TCTCCCTGAT
GAAGAATATG ATAAACCCCT CGGTATTTCG ACTGGTGCCG GTACTATTTT TGGAACAACA
GGTGGCGTTA TGGAAGCAGC CTTAAGAACT GCCTATGAGG TATTAACAGG GGAGGAATTA
CCCGGTCTGG AATTTGAGGA TGTAAGGGGT TTAGAGGGGA TTAAGGAATG TGAAATTGAA
ATTAACGGTC AGAAAATAAA AGTTGCCGTA GCTCATGGAC TTTCCAATGC TCATAAGGTA
CTTCAAAATA TAGACGACTA TCATTTTATT GAAATTATGG CCTGCCCTGG TGGTTGTGTT
GGTGGTGGTG GTCAGCCCTA TCCTACCAAT GAAGAAACTA TAAGATTAAG GGCCCAGGGC
CTTTACCGGG ATGATAAGGA ACATCAGATC AGGAAATCCC ACGAAAATCC TGTTGTCAAA
AAACTATATG AAGAATTTCT TGGCAAACCA TTGAGTCATA AGTCTCATGA ATTACTACAC
ACCGGGTATG TTGTAAGATC AAAATACCCG GCCAATGTTG AATCTGATGC GGTTTAA
 
Protein sequence
MSDKVNITLD GKSLTVDKDK TILEVAREAG IKIPTLCYLE EINEIGSCRV CVVEVNGKIQ 
PACVTPVSEG LEITTTSPRI REARRISLEL IISDHPMECL TCSRNGNCEL QRLAEDFGIS
EITYEGEQSH FEPDLSSPSI VRDPDKCILC RRCVSVCEQV QGVAALTPNE RGFSTIITPA
FGQKLGEIAC ANCGQCINAC PVGALSEKDD TEKVWEALAN PDKHVVVQTA PAVRVSIGEV
FGMKPGSLVT GKLMAGLRRL GFDKVFDTNF TADLTIMEEG HELIERLKNN ERLPLITSCS
PGWIKFIEHF YPSYLEHISS CKSPQQMFGA LAKTYYPENN GIDPEDVFVV SVMPCTAKKF
EITRPGMDSS GYQDVDVVLT TRELAKMFKQ AGIDFVNLPD EEYDKPLGIS TGAGTIFGTT
GGVMEAALRT AYEVLTGEEL PGLEFEDVRG LEGIKECEIE INGQKIKVAV AHGLSNAHKV
LQNIDDYHFI EIMACPGGCV GGGGQPYPTN EETIRLRAQG LYRDDKEHQI RKSHENPVVK
KLYEEFLGKP LSHKSHELLH TGYVVRSKYP ANVESDAV