Gene Hore_16190 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_16190 
Symbol 
ID7312655 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp1736004 
End bp1737209 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content42% 
IMG OID643612066 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002509363 
Protein GI220932455 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2182] Maltose-binding periplasmic proteins/domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.00305219 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAG TTTTAACTTT ACTCTCTGTT TTAGTTTTAG TTGTTGGTAT GACTTTAAGT 
GCCAGTGCTG TAAAGCTGGT AGTCTGGGAA TCTCCCGGTC CTGAAGAAGA ATTTATTCAG
GAAATGGGTA AAATTTACAC TGAACAAACC GGTGTTGAAA TTGAGGTGCA ACCAGTAGAC
CAGATTAACC AGGATGATAA ACTGGCCCTG GATGGTCCTG CTGGAAAGGG AGCCGATATT
GTTGTCTGGC CCCATGATGG TATCGGTCGT TCTGTAGAGC AGGGTTTAAT CTGGCCTATT
CCTGAAGATA AGGTGGATAC CAGTGCCTTT ACAGAGTCTT CTCTCAACGC GCTGACTTAC
AAGGGTAAGC TATATGGATT ACCATATGCT GTTGAAAGTG TGGCCCTGTT ATATAACAAG
GATCTGTTAC CAGAAGTACC TGAAACCTTT GATGAGTTTT TGGCTAAAGT AAAAGAACTA
AATAAACCGG CTGAGGGCCA GTTTGGTTTT ATGGCCAACA TCGGTGACCT CTACCACGTT
TTCGGGTTTA TCTCCGGTTA TGGTGGTTAT ATCTTTAAAC AGACCGAAAA TGGTCTTGAC
ATAAATGATA TCGGTCTGGA TAGCCCTGGT GCTATTAAAG CCATGAAGTT TATAAAGAGC
TTCAGGACCT CAGGTTTAAT GCCTGAAGGT ACTACCGGTG ATGTTATGAA TGGTCTCTTT
TCCCAGGGTT CTCTGGCAGC TGTTATTGAC GGACTATGGG CTTTAGAAGG TTATCGTGAA
GCCGGGGTTA ACTTTGGTGT TGCTCCCCTG CCCAGGCTTG ATAATGGTGA ATATCCCCAT
ACCTTCATAG GCGTTAAAGG TTACTACATC AGTGCCTTCA GTGAACATAA AGAAGAAGCC
CTGAAATTCA TTCAGTGGTT AACCACTAAA GAGAATTCCT TTAAACATTA TCAGAAGACA
TATGTAATTC CTCCACGTAA AGATGTAATG GAAATGCCTG AATTTAAAGA AAATAAAGTT
GTTGAAGCTT TTGCTATTCA GGCTTCAAGG GGTATGCCAA TGCCGAATGT ACCTGAAATG
ATGGCTGTAT GGGAACCGGC TAATAATGCC CTTTCTTTCA TCCTTCAGGA TCAGGTTACA
CCTGAAGAAG CAGCTAAACT CTGTGTTCAG AGAATCCAGG ATAATATTGA AATGATGAAA
GAATAA
 
Protein sequence
MKKVLTLLSV LVLVVGMTLS ASAVKLVVWE SPGPEEEFIQ EMGKIYTEQT GVEIEVQPVD 
QINQDDKLAL DGPAGKGADI VVWPHDGIGR SVEQGLIWPI PEDKVDTSAF TESSLNALTY
KGKLYGLPYA VESVALLYNK DLLPEVPETF DEFLAKVKEL NKPAEGQFGF MANIGDLYHV
FGFISGYGGY IFKQTENGLD INDIGLDSPG AIKAMKFIKS FRTSGLMPEG TTGDVMNGLF
SQGSLAAVID GLWALEGYRE AGVNFGVAPL PRLDNGEYPH TFIGVKGYYI SAFSEHKEEA
LKFIQWLTTK ENSFKHYQKT YVIPPRKDVM EMPEFKENKV VEAFAIQASR GMPMPNVPEM
MAVWEPANNA LSFILQDQVT PEEAAKLCVQ RIQDNIEMMK E