Gene Hore_02060 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_02060 
Symbol 
ID7312525 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp211497 
End bp213212 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content42% 
IMG OID643610628 
Productputative PAS/PAC sensor protein 
Protein accessionYP_002507962 
Protein GI220931054 
COG category[R] General function prediction only 
COG ID[COG4624] Iron only hydrogenase large subunit, C-terminal domain 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTCTGG TAATAACAAG TGAGGCTAAA TGCAGAGATT GTTATAAGTG TATACGGTAT 
TGCCCGGTTA AAGCTATTGG GATCAAGGAT GGGCAGGCCT GGGTAGATGA AGACAGGTGT
ATTCTGTGTG GCCGTTGTAT TGAAGCCTGT CCCCAGAATG CTAAAAAAAC TGAACAATAT
CTTGATAAAT TTAAAAACTA TATAGCCTCC GGTAATAAAG TAGTTGTTAG CCTGGCTCCC
TCGTATCTGG CTTCAGGTTA TTTCTCTACC CCCTGGAAGT TGGTAGGGGC TTTAAAAGAA
CTTGGTGTAG ATGTTGTTGA AGAAACAGCG ATTGGGGCAG AAGTTATCGC CGATGAGTAC
AGAGATTTAC TCAATAAAAA GGAACTAATT ATATCCAGCT GTTGCCCAAC TGTTGTAAAC
CTGGTTGAAA AATATTTTCC AGGATTATGT GATTATCTGG CACCTGTTAT ATCACCGATG
ATAGCCCACG GAAGACTGAT AAAACAGGAA ATGGGTGAAA ACTGCCGGGT TGTATTTATC
GGCCCCTGTT ATGCGAAAAA AGAAGAAGCG TTGACCCGGG GTGAAGGGTC AATTGATGCT
GTTTTGACCT TTGAGGAAAT ATTTAATTAC TTAGATAGAC TAAATCAGGA TATACCTCCC
CGTAATATTT TTCCTGACAG AACCTCTAAC AGGGCCCGCC GTTTTCCACT TCCCGGCGGG
GCCCTTAGAA CAGGTGGGCT AAATGAATTA AATATAAAGC CAGATGATAT TGTGTCAATA
TCAGGACTTG AAGATTGTAT GGAAACCTTC CGTGATATTG AAAAAGGTCT TATCAAGCCC
AGATTTATCG AGGCAATGGC CTGCCGGGGG GGTTGCCTCG GGGGGCCAGC CATGGGTGAA
GATGCTGGAA TCCTGGCCAG GAAACAAAGG TTATACAATA ATTTAAACAG GGGACAGCAG
AGCTGTCGGG GGGCTAACAA TAAAGGTATT ATAGTTTCCT GGAGCTACAC TCCCAGAGAG
ATAGATTATA AAGTTCCAGA TGAAGAAGAA ATAAAGAGAA TTCTGGCATT GACCGGTAAA
ACATCACCTG AAGATGAAAC AAACTGTGGT GGCTGTGGTT ATCCCAGTTG CCGGGATAAG
GCAATTGCTG TGTATAATGG TCTGGCCAAT CCGGAAATGT GTATTCCCTA TATGAGGGAA
AAGGCTGAAT CCTTATCCCA TGCTGTTGTT GATAGTACTC TTAATGGAAT TATTATTGTT
GATAAAGATA TGATTATTCA GGATTTTAAT CCGGCTGCAA ACCGGATGTT TAACCGCAGA
GATATTAAGG CTAAAGGTAA ACCATTGAGC ACTTTTATTG ATCCCGGGGA TTACATTGAT
GTCTGGGAGA ATCAGGAGAT GATAACTGAT AATTGTAAGC AATATCCCCA GTATGAGTTG
ATTACCAGGG AAACTATTTA TCCCCTCCCT AAATATGGAG TTGTCATCGG TATAATTACA
GATGTTACTG AAGAGGAAAA GACCAGGGCC GAAATTGATA ATATGAGACA GGAAGCTTTA
GACAGGGCCT CTCAGGTAAT TAAAGAACAG ATGCGGGTTG CCCAGGAAAT TGCAGGACTT
CTTGGTGAGA GTACGGCAGA TACTAAAGCA ACACTACTCG AACTAAAAGA GATTATAGGG
CAAAGAGAGG CGAAGACAAA TGGGGATGAA AGTTGA
 
Protein sequence
MGLVITSEAK CRDCYKCIRY CPVKAIGIKD GQAWVDEDRC ILCGRCIEAC PQNAKKTEQY 
LDKFKNYIAS GNKVVVSLAP SYLASGYFST PWKLVGALKE LGVDVVEETA IGAEVIADEY
RDLLNKKELI ISSCCPTVVN LVEKYFPGLC DYLAPVISPM IAHGRLIKQE MGENCRVVFI
GPCYAKKEEA LTRGEGSIDA VLTFEEIFNY LDRLNQDIPP RNIFPDRTSN RARRFPLPGG
ALRTGGLNEL NIKPDDIVSI SGLEDCMETF RDIEKGLIKP RFIEAMACRG GCLGGPAMGE
DAGILARKQR LYNNLNRGQQ SCRGANNKGI IVSWSYTPRE IDYKVPDEEE IKRILALTGK
TSPEDETNCG GCGYPSCRDK AIAVYNGLAN PEMCIPYMRE KAESLSHAVV DSTLNGIIIV
DKDMIIQDFN PAANRMFNRR DIKAKGKPLS TFIDPGDYID VWENQEMITD NCKQYPQYEL
ITRETIYPLP KYGVVIGIIT DVTEEEKTRA EIDNMRQEAL DRASQVIKEQ MRVAQEIAGL
LGESTADTKA TLLELKEIIG QREAKTNGDE S