Gene Hore_23020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_23020 
Symbol 
ID7313054 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2514652 
End bp2515662 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content37% 
IMG OID643612754 
Producttranscriptional regulator, LacI family 
Protein accessionYP_002510042 
Protein GI220933134 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.000406909 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGTGA CTATAAAAGA TATTGCCAGG GCTGCTGGTG TGTCAAAGAC TACTGTTTCC 
AGGGCTTTAA ATAATAAGGG TCGAATTAGT CCGGAAACCA GAGAAAAAAT CCTTAAAATC
GCTGCAAAAA TGAATTACCG TCATAATAAA ATAGCTACCA GTTTACGGTC CAATAAATCC
ATGGTAATTG GTCTAATTTT CCCTGGATTT ATGGCAGGCC ATTTTTATAG TGAAATTTTT
CATGGCATTG AGGCTTACTG TACTGAAAAA GGCTTTGGGG TCTTAATTGG TTGTTCTGAT
GGATTAGCTC AAAAAGAGGA AGAGATAATT AAATTACTTC AGGAACGCAG GGTTGATGGT
ATTATAATTG CCCCTACCCA TGGGGTTGAT CTGGATTATT ATCATCAGTT TAAAAAGGAA
AAACTGCCTT TTATCTTTAT AGATAAATAC ATACCGGGTA TTGAAGCAGA TAGAATAGTT
GTTGATAATA AAAAAGGTGC CTATCTTGCC GTAACTCACC TTATCGAGAG AGGTCATAAA
AAGATTGCTT TATTAAGTGG ACCGGAATAC CCCTGTACTA CAATTGAAGG AAGATTAGAA
GGATATCTTA AGGCTCTTGA AGATAATGGT CTTACATATA GGAAAATCAT AAAAACAGAT
AAAAATGTTT ATAACCAGAG AGAAAGTGGT TATAAGGCAA CAAAGGAATT TATAAAAGAT
AATGATGGTG TAACCGCCAT CTTTGCTATT AACGATAGTG TCGCTATAGG TGCCATGAGG
GCAATCAGGG AGGCTGGACT TCGAGTCCCG CAGGATATGG CCATTGTTGG CTTTAATGAT
GATGATATTT CTTTATATAT TGAAAATTCA CTTACAACTA TATCTGTTCC CAAATATAAA
TTAGGGGAAA AGGCCGCTCA ACTGATATTA GAACGTATTG AGGGGCAGGC CGATACAGGA
CAGAGGATTA TTACCCTGGA ACCTGATTTA GTGGTTAGAG ATACAACTTA A
 
Protein sequence
MAVTIKDIAR AAGVSKTTVS RALNNKGRIS PETREKILKI AAKMNYRHNK IATSLRSNKS 
MVIGLIFPGF MAGHFYSEIF HGIEAYCTEK GFGVLIGCSD GLAQKEEEII KLLQERRVDG
IIIAPTHGVD LDYYHQFKKE KLPFIFIDKY IPGIEADRIV VDNKKGAYLA VTHLIERGHK
KIALLSGPEY PCTTIEGRLE GYLKALEDNG LTYRKIIKTD KNVYNQRESG YKATKEFIKD
NDGVTAIFAI NDSVAIGAMR AIREAGLRVP QDMAIVGFND DDISLYIENS LTTISVPKYK
LGEKAAQLIL ERIEGQADTG QRIITLEPDL VVRDTT