Gene Hore_15370 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_15370 
Symbol 
ID7313130 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp1644733 
End bp1645980 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content40% 
IMG OID643611979 
Producthelix-turn-helix- domain containing protein AraC type 
Protein accessionYP_002509281 
Protein GI220932373 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones50 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAATG AAAAAATCAA GGATAAATTA AAAAAGGTAA TTAATATATA TTCTTACGCA 
ACAGGAATTG GCTGCCGGAT TATCTCTACC GAAGATATGA AGGAGGTCTT TCAGCCTCCC
CTGAGTAGAC CCTCCTTCTG TAAAATGGTT CATAGTTGTA GTCGCTGTGA TGATGACTGT
AAACAGTCTT ACATTTATGG AGGGTTACAG GCGGAAAAAC TGGGTGAACC CTATATTTAT
TTCTGCCCAT ATGGTCTTGT TAACTGGGCG GTACCCCTGT TTGGTGAAAA GAAGATGGAG
TATTTCTTTA CCGGGGGGCC GGTCTTGCTG CACCCGGTTG ATGATTTACT TATTGAGGAT
ATTCTTAATC AGAACCCACT CCTGGGTTCC AGTTATAGAG AGTTAAGGGA GCATTTAAAT
AAATTAATTC AGGTTGATAC GGTCAGGGCC CGGTATCTGA GTGAACTTTT GATGGAACTA
TCAAAAAATG TAATGCTGGA GAGGGGTTAC CGGCAAAACA GAAAACGGGA ATATAATAGT
ATAAATGCCC GGATTGCAGA AAAGATTCAC GAACTGAAGC AGGGTAAAGA TGACGAGGGT
ACTCTGTATC CAATTGAAAA GGAGCAGGAG TTAATTAAAA AGGTAAAACT GGGTGATAAA
CAGGGGGCGC GGGCTATACT GAATGATATC CTTGGATTTG TATATTTTCA GAGTGGCAAC
AAATTAGATA TTATCAAGGC CAAGGCGATT GAGTTGATGG TGGTTCTGGC CAGGGCTGCT
ATAGAAGTCG GGGCTGACCT TGAAGTTATT TTCGGTCTGG AGAATACTTA TTTTAATAAA
ATTGATAATA TTGAGGATGT GAACCGGTTG TCAGAAATTC TGGTCAGGGT TCTGGACCGG
TTTATAGAGT GTATTTTTTC TATCAAAAAT GTGAGAAAAA AGGATTTGAT GTATAAGGCA
ATGAATTATA TCAGGGACAA TTATGCCCAT AAATCTATAA GCCTGAATGA AGTAGCCGAT
GAAGTGGGAT TAAGTGCGGC CTATTTCAGT AAATTATTTA AAGAAGAGCT GGGTTTAACC
TATACTGAGT ATCTGAATAA AGTCCGGATT GAAGCCAGCA AGGAGTTATT AAAGCAGGGT
TGTTCCCTGG CCAGTATTGC CCAGACCGTT GGTTTTAATG ACCAGAGTTA TTTTTCCAAG
GTATTTAAAA AAATGGAGGG GTTATCACCG GGAAAATGGA GGGGATAA
 
Protein sequence
MINEKIKDKL KKVINIYSYA TGIGCRIIST EDMKEVFQPP LSRPSFCKMV HSCSRCDDDC 
KQSYIYGGLQ AEKLGEPYIY FCPYGLVNWA VPLFGEKKME YFFTGGPVLL HPVDDLLIED
ILNQNPLLGS SYRELREHLN KLIQVDTVRA RYLSELLMEL SKNVMLERGY RQNRKREYNS
INARIAEKIH ELKQGKDDEG TLYPIEKEQE LIKKVKLGDK QGARAILNDI LGFVYFQSGN
KLDIIKAKAI ELMVVLARAA IEVGADLEVI FGLENTYFNK IDNIEDVNRL SEILVRVLDR
FIECIFSIKN VRKKDLMYKA MNYIRDNYAH KSISLNEVAD EVGLSAAYFS KLFKEELGLT
YTEYLNKVRI EASKELLKQG CSLASIAQTV GFNDQSYFSK VFKKMEGLSP GKWRG