Gene Hore_19800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_19800 
Symbol 
ID7312795 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2131696 
End bp2132706 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content38% 
IMG OID643612426 
Producttranscriptional regulator, LacI family 
Protein accessionYP_002509722 
Protein GI220932814 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones73 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAACAA TAAAAGACAT TGCAAAACTT GCAGGTGTTT CAGTAACTAC AGTATCAAAA 
GTTATCAATA ATTATCCTGA TATTAGTGAT AAAACAAAGG AAAAGGTTAT AAAAATCATG
GAACAGCAAA ATTACCGCCC TAATGCTATT GCCAGAAGCC TGTCAACAAG CCGTTCGCGG
TCAATAGGAG TATTTTTTAC AGACCATTTA AATAGTGGGT TGAGGCACCC CTTTTTCAGA
GATATTATTT ACGGGATCGA AAAGACATTT TTCCGAAAAG GGTATGACCT GATTTTATTT
GCCCATCAAT GGGGAGACAG GTTCAGTTAT ACAGAAAAGT GTAAAAGTCG TCATGTTGAT
GGTGCCATCT TAATGGGGAT GCCGAGGACT GATCCCAATC TTGATAAATT AGTTAATTCA
AATATACCAA CAGTATTTAT AGACCTCGAT ATAGTTGGCA AAAATGCTAC GTATGTGATA
TCCGATAATG TTCAGGGGGC AAAACAGGCT GTGAATTATC TTTATTCCCT TGGCCATATA
AAAATAGGTA TGATTATGGG ACAGCGGATT ACTAAACCGG CACAGGATCG CCTGATTGGT
TTTCAGGAAG AGTTAACGAA TTTAGGTCTG GAGTATAACC CGGAATGGAT TATAGAGGCT
GAATTCGGAG AAGAAGGCGG TTATCAAGCT ATGAAAAGGA TTATTACCCA GGAGATAAGA
CCATCTGCTG TGTTTTGCCA GGGTGATGAA ATGGCCATTG GAGCTATTAA CGCTATAAAA
GAACATGGTT ACAATGTACC TCAAGATTTT TCTATAGTTG GCTTTGATAA TATTGAAATA
AGTAGTTATG TTTCCCCTGG TCTTACTACA ATCCATCAGG ATAAATTGAC TATGGGAAAG
AAGGCCGCCA GTATTCTTCT GGAAATGATT AATAACCCAA ACAAAACCTT TTCTCCCGTA
GTGTTACCAA CAAAATTAAT CGAGAGGGAG TCATGTAGAA AGATTGGATA G
 
Protein sequence
MATIKDIAKL AGVSVTTVSK VINNYPDISD KTKEKVIKIM EQQNYRPNAI ARSLSTSRSR 
SIGVFFTDHL NSGLRHPFFR DIIYGIEKTF FRKGYDLILF AHQWGDRFSY TEKCKSRHVD
GAILMGMPRT DPNLDKLVNS NIPTVFIDLD IVGKNATYVI SDNVQGAKQA VNYLYSLGHI
KIGMIMGQRI TKPAQDRLIG FQEELTNLGL EYNPEWIIEA EFGEEGGYQA MKRIITQEIR
PSAVFCQGDE MAIGAINAIK EHGYNVPQDF SIVGFDNIEI SSYVSPGLTT IHQDKLTMGK
KAASILLEMI NNPNKTFSPV VLPTKLIERE SCRKIG