Gene Hore_19830 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_19830 
Symbol 
ID7312798 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2135765 
End bp2137360 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content36% 
IMG OID643612429 
Productleucine-rich repeat protein 
Protein accessionYP_002509725 
Protein GI220932817 
COG category[S] Function unknown 
COG ID[COG4886] Leucine-rich repeat (LRR) protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones61 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAA AGCCTTTTTC AAAAATAGTT ACACTGTTTG TTATAATTAT TTTTTCAATC 
TTTCTTTTTG CCTGGAATGG GGAAGCTGTC GAAGATACAG GAATTATTGT TTTTGAAGAT
GAGGACCTGA AAAAAGTGGT AATGAGGTCA CTGAGCAAGC CGGAAGGACC TGTCTTCAGG
CCAGAGGTGG AAAATTTAAC TGAGTTCAGT ATCCCCTCTT TTCGTCATTA TGAAATAAAT
TCTATAAAGG GATTAGAAGC CTTTTTAAAC ATAAAAACCC TGCGGATTGG GCCAAATTAT
ATCAGTGATT TAACCCCCCT GGCCCATTTA ACAGACCTTG AAAGACTCTA TATCTTTGAA
AATCATATTG AAGATTTAAG TCCACTGGGA AAATTGAAGG AATTAAGGGA GTTAATAATC
AGGGGGTTAC CTCCATATAA AAAGGGATTG CCTTCAGGTA AATATTCAGG ACATTATATT
GAGGACATAA GTCCTCTGGC CGGTTTAGTA AAACTTGAAT ACCTTAAATT ATCCCATCAA
AAGATATCAA ATTTAGAGAC CCTGACTCAA CTACCAAACT TAAAAACCCT GAATGTAGCC
TATAACAGTA TATCTGACCT TAAACCCCTG ACTGCTTTGA CAGGGTTAAG CCACCTGGAT
CTGGAAGCCA ACAATATTAA AGATATATCT CCATTAAGAG GGTTAAAAAA ACTTACCTAT
TTAAATCTGA TCAGAAATGA GTTGACCGGT GTAAAACACC TTTCCAGTCT GGAAGGTTTG
CAGGTATTGC TGTTAAGCGG GAATGACCTC CGGAATATTG CCTCCCTTAC CCGACTGGTA
AACCTTGAGA AACTGGATAT CAGTGACAAT AATATCAGTG TTGCCCCCGG TTTAAAGGAA
TTTAAAGGTC TGAAGGAATT GAATATAAGT GGCAACCCCA TTGACGATAT TAATTTTATC
AGCGAGTGCA GGAAACTTGA AAGATTACTG GCCTTCAATT GTGAGATAAG GGACATATCA
CCTTTAAGGG GACATAACAG TTTAAAAGAG CTTTTTTTGC ATAACAACAG GATTACCGAT
ATTAGTCCCC TTGAAGGGCT GAACACTCTC GAAAGGCTTG ACCTGAGTGG AAATAGTATA
GAAAATGTTT CAGTCATATC TGGACTCAAT AAACTTAAAT ATTTAGACCT TGAGGGGTGT
GGTCTGACCG CGATAGAATT TTTAAAAGAC CTGGGATCCC TGGAATACCT TGAACTTGAA
AATAATAGAA TAAGCCAGAT TGAGCCTTTA AAAAAACATA TTAATTTAAA AACCCTGGTT
CTTGATAATA ACCAGATTAA AGATATAAGT ACCCTGGGTG AATTGATGAA CTTAAAGGTG
CTATCATTAA ACGATAATCA GATTGAAAAC ATCGATTCTT TGACTGGTTT AAACCAGCTG
GAAGTATTAT ATATTTCGGG CAATAGAATC AGGAATATTA AACCCCTTTT AAAATTAAAT
AATTTGAGTG TTGTAGCAAT AAAAAATAAT CAGTTTAAGC TTGATGAAGA TGTTATAAAA
AAGCTGGAGG ATAATAAAGT AACTGTTGTG TATTGA
 
Protein sequence
MSKKPFSKIV TLFVIIIFSI FLFAWNGEAV EDTGIIVFED EDLKKVVMRS LSKPEGPVFR 
PEVENLTEFS IPSFRHYEIN SIKGLEAFLN IKTLRIGPNY ISDLTPLAHL TDLERLYIFE
NHIEDLSPLG KLKELRELII RGLPPYKKGL PSGKYSGHYI EDISPLAGLV KLEYLKLSHQ
KISNLETLTQ LPNLKTLNVA YNSISDLKPL TALTGLSHLD LEANNIKDIS PLRGLKKLTY
LNLIRNELTG VKHLSSLEGL QVLLLSGNDL RNIASLTRLV NLEKLDISDN NISVAPGLKE
FKGLKELNIS GNPIDDINFI SECRKLERLL AFNCEIRDIS PLRGHNSLKE LFLHNNRITD
ISPLEGLNTL ERLDLSGNSI ENVSVISGLN KLKYLDLEGC GLTAIEFLKD LGSLEYLELE
NNRISQIEPL KKHINLKTLV LDNNQIKDIS TLGELMNLKV LSLNDNQIEN IDSLTGLNQL
EVLYISGNRI RNIKPLLKLN NLSVVAIKNN QFKLDEDVIK KLEDNKVTVV Y