Gene Hore_18230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_18230 
Symbol 
ID7313821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp1946127 
End bp1947287 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content44% 
IMG OID643612270 
Producttranscriptional regulator, LacI family 
Protein accessionYP_002509567 
Protein GI220932659 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTACAA TAAAAGATAT TGCAGAACGG GCCGGAGTCT CTACCGCTAC TGTATCCCGG 
GTTTTAAATA ACAGTCCCCG GGTTAAAGAG GAGACCAGGG TCCGGATTCT TGAGATTATT
AAGGAGACCG GTTATGGGCG GTACCGGAAG GGTATTAAAA AGAGCCCTGA TAGTAAGGTG
ACTATCAGGG AGGTAGCAAG GGAGGCCGGG GTATCAGTGG CCACTGTTTC CCGGGTCATA
AATGGAGACA GTGCTGTGAG TCCTCAAACC CGGAACAGGG TAAAAAAGGC CATGCAGGCC
CTGAACTATC ACCCCAATTT ACTGGGACGC CAGCTGAGGA GAAGGGAAAC AAAAAGTATT
GGAATTATTA TCCCCGAGAT ATCAAACTTT TTCTTTGCCC GGGTCATCAA AGGGATTGAA
AATGTGGCTG AATGTGAAAA CTATAACGTA ATTTTGATGG AGAGTAACAG AAAGGACCAT
ATCGCGGCCA TAAAGGCCCT GTATGAGCGG CGGATTGATG GTTTAATCTA TATGACCGGC
CACCTGACAA AACAGGAGAT TGATTTTTTC AGAGAGCTCA AATTACCGGT TGTACTCCTT
TCCCAGGACT TTTGTGCTCC TGATATACCT TCAGTAAATA TTAATAATAG GGAAGCAGCG
TTTGAAGCTG TGACCTACCT TCTAAAAAAG GGGTATAAAA GGATTGCTTT TCTGGGTGGT
CCTTTTTCTG ACAGGGTATC TGTTTTCAAC CGGTTTAAAG GATACTGTGG GGCTTTAAAA
GAGTATGGCC TAAAACCCGA TAAGCACTTG ATAAAAGAAG GTGAGTTTAG CCTGGAAAGC
GGTTATGATA TGTGCTACCG GCTCCTTCAG GAAGGTCCAG AAGTTGAGGC AATATTTGCT
GCCAATGATG AAATTGCCAT TGGTGTTATC AAGGCCCTTA CCGTGAAAGG ATATAAAATA
CCCCGGGATA TTGCTGTGAT TGGCTTTGAT GACCTCCCTG TGGCCAGGTT TACAGTTCCT
TCTCTCACCA CGGTCCATCA ACCTATTTAT AAAATGGGCC GGGAAGGAAT GAATCTCCTC
TTGAAACTTA TCAAAAATAT TCCTTTAAAG GAATCCCATG TGACCTTGAA TCATAAACTT
ATTATCAGGG ATTCGGCCTG A
 
Protein sequence
MVTIKDIAER AGVSTATVSR VLNNSPRVKE ETRVRILEII KETGYGRYRK GIKKSPDSKV 
TIREVAREAG VSVATVSRVI NGDSAVSPQT RNRVKKAMQA LNYHPNLLGR QLRRRETKSI
GIIIPEISNF FFARVIKGIE NVAECENYNV ILMESNRKDH IAAIKALYER RIDGLIYMTG
HLTKQEIDFF RELKLPVVLL SQDFCAPDIP SVNINNREAA FEAVTYLLKK GYKRIAFLGG
PFSDRVSVFN RFKGYCGALK EYGLKPDKHL IKEGEFSLES GYDMCYRLLQ EGPEVEAIFA
ANDEIAIGVI KALTVKGYKI PRDIAVIGFD DLPVARFTVP SLTTVHQPIY KMGREGMNLL
LKLIKNIPLK ESHVTLNHKL IIRDSA