Gene Hore_02090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_02090 
Symbol 
ID7312528 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp214625 
End bp216100 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content43% 
IMG OID643610631 
Producthydrogenase large subunit domain protein 
Protein accessionYP_002507965 
Protein GI220931057 
COG category[R] General function prediction only 
COG ID[COG4624] Iron only hydrogenase large subunit, C-terminal domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones70 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGGAG AAGTATCTGA AGTTACCAGG ATCAGGAGAA AGGTTTTGAC CGAGATAGCC 
CGTCTTACTT TTGAAGATCG TCTGGTGGAA GAGGTAGATT ATATTCCCCG GAGGTTGACT
GAAAATGGTA TTTCCAATTA TCGCTGTTGT GTCTATAAGG AAAGAGCAAT TTTAAAAGAG
AGGGTTAAAC TGGCCCTGGG GGTTGCCCCT GATGAAGTGG ATGATGAAGA GAAGAGTCTT
TCTGAAATAG CGAGTAATGT ATTACAGGGC CAACTGGGGA ACAGTGATAG AAATATTGCT
ATTATTGAAG AAGCCTGTGA CCAGTGTTCA ATCGATAAAA TAGTGGTAAC CAATGCCTGT
CGTAATTGTG TTGCCCACCA CTGTGTAAAT TCCTGTCCCC GGGGCGCCAT TACCATTGTT
AATAATCAGG CCTATGTTAT CAGGGAAAAG TGTGTGGAGT GCGGGCTCTG TGTTAAAGCC
TGCCCCTATG GCGCCATTCT TGAAGTTGAA AGACCCTGTA CCAGTGCCTG TTCCCTTGAT
GCAGTTGTTC CCGGGGAGAA GAGTACAGCA GAAATAGATG ATAATAATTG TATTGAATGT
GGCTCCTGTA TAGAGGCCTG TCCCTTTGGA GCAATTTCCT ATAAATCTGA AATTGTCAGG
GTTGTTCAAA TGTTAAAAAG TGGAGACATT AATACTGTAG CCCTGGTTGC TCCTTCATAT
ATTGGACAGT TTGGACCAAG GGTTGACTGG GAAAAACTAT GTACTGGTCT TAAAAGGCTG
GGATTTCACG ATGTAATACC GGTGGCCCTC GGTGCTGATA AGGTAATTGA AGAGGAAAGC
AGGGAATTTC AGGCGATGAC TGATAAACCA ATGTTTAATT CCTGTTGTCC TTCCTTTGTT
AACTTAATAG AATTAAAATT TCCTGAATAT TTATCACAGG TATCTACTAC AGAATCACCG
ATGCTTAAAG CTGCTTATCT GGCTAAAGAA GATGGTGAAC TGGTTCAGTG TGTCTTTATC
GGCCCCTGTC TTGCTAAAAA GAGTGAGGCC AGGAATAAGG GGCAGGGTTT GATAGATGCT
GTTTTGACCT TTGAAGAAAT TGCAGCCATG CTGGTGGCTA AAGGTATTAA CCTGGCAAAA
ATAACCGAAA CGGGGGATCT ATCTGAAGAA CACCGGTCCC CTTCCATACC GGCTCAGGCC
TTCTGTGAAG CCGGGGGAGT TGGAAGTGCT ATCAGTGGAA AACTTAATAA TAATGATAAT
GGTATAAGGT TTCACCAGGT TGATGGTGCT TCAGAATGTC TTAAAGTGTT AAATCAGATT
AAAGCAGGTA AGATTAAAGC AGATTTTGTT GAAGGTATGG GTTGCCAGGG GGGCTGTATT
GGTGGCCCCG GAACGCTGGT AAACCGGCGA GTGGCTTCCG GATTATTAAA AAAGCTTATA
AAATCCAGGG GTGGTGAAAA AGTTGGCACA AAGTAA
 
Protein sequence
MAGEVSEVTR IRRKVLTEIA RLTFEDRLVE EVDYIPRRLT ENGISNYRCC VYKERAILKE 
RVKLALGVAP DEVDDEEKSL SEIASNVLQG QLGNSDRNIA IIEEACDQCS IDKIVVTNAC
RNCVAHHCVN SCPRGAITIV NNQAYVIREK CVECGLCVKA CPYGAILEVE RPCTSACSLD
AVVPGEKSTA EIDDNNCIEC GSCIEACPFG AISYKSEIVR VVQMLKSGDI NTVALVAPSY
IGQFGPRVDW EKLCTGLKRL GFHDVIPVAL GADKVIEEES REFQAMTDKP MFNSCCPSFV
NLIELKFPEY LSQVSTTESP MLKAAYLAKE DGELVQCVFI GPCLAKKSEA RNKGQGLIDA
VLTFEEIAAM LVAKGINLAK ITETGDLSEE HRSPSIPAQA FCEAGGVGSA ISGKLNNNDN
GIRFHQVDGA SECLKVLNQI KAGKIKADFV EGMGCQGGCI GGPGTLVNRR VASGLLKKLI
KSRGGEKVGT K