Gene Hore_04270 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_04270 
Symbol 
ID7314102 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp458982 
End bp460238 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content43% 
IMG OID643610850 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002508180 
Protein GI220931272 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAGT TTGCCGTGAT TGCTGTAGTG GCTCTGCTGG TAGTTTCAAT GTTCAGTCTT 
TCTATAGCTG CCAAGACAAA GATTAAGGTG GCCTGTTTCC CTGATCAGGA CTCTGGCTTT
GAAGCAATTT TAGATCAATT CCATGCTGAA CATCCTGATA TTGAGGTAGA GCTGGTAGTT
AATGGTTTTG CCGATCACCA CAACACTCTG CTTACCCAGA TTGCTGCCGG GGCAGAGGTA
CCTGATGTTG CCATGATTGA AATTGGGTAT ATTGCCAACT TTGTCTCCAA AGGTGGTTTT
GTAAATCTCC TCGAAGAACC ATATAACGCT GGCCAGTTTA AGGAAAATAT CGTTCCCTAT
AAATGGGCAC AGGGAAGTAC TGATGACGGC CGCTTAATTG CTTTTCCAAC TGATATTGCA
CCGGGTACAA TTTATTATCG CAGGGATAAA CTGGCTGAAC TTGGCTATGA AATTGAAGAT
ATGAAGACTT TAGAAGACTG GATTGAAGCC GGTTCTCAAT TTGCTAAAGA CTTAGATGGT
GACGGTGTCA ATGATCGCTG GTTACTGGCT GACGCTACTG ATATTTTCTT TATGATTGCT
AAAAGTGGTG AAGAACTTTA CTTTAATGAA GACGGTGAGT GTATAGTTGA TTCCCCAAGG
TTTATCAAGG CCTTTAAGGC TGCCAAGATG GTCAGGGATA TGGGACTCGA TGCCAGGATA
GGTGCCTGGA CCAATGAATG GTATTCTACC TTTAAAGATG GTACAGTCTT AATGCAACCT
TCAGGAGCAT GGCTTGGTGG CCATATCCGT AACTGGATTG CTCCTGACAC AGCAGGCAAA
TGGGGTGTAA CCAACCTCCC GGATGGAATG TACTGTAACT GGGGTGGATC CTTTGCAGCT
ATACCGGAGA AAGCTGAACA TAAGGAAGAA GCCTGGGAAT TTATTAAATT TATTGCCACC
AGAAAGGATA CCCAGATTGC CCAGTTTAAA GCTTCAAATA TCTTCCCGGC CTGGATGCCT
GCCTTTGATG ACCCGGTCTT TCAGGAAGAA ATGGAATTCT ATGGTGGACA GAAGGCCCGT
TTACTCTGGC TTGAAGCAGC CAAGAAGATT CCTAATGTTG TAACCAATAA ATATGATGTT
ATTGCTGAAG AGATTGTTAC TGCAGCCCTG ACAGATGTAC TTAATAATGA TGCTGATCCT
GTAGAAGCAC TCAGGGAAGC TAAAAGAATG ATTGAAAGAA GGATGAGAAG AAGGTAA
 
Protein sequence
MKKFAVIAVV ALLVVSMFSL SIAAKTKIKV ACFPDQDSGF EAILDQFHAE HPDIEVELVV 
NGFADHHNTL LTQIAAGAEV PDVAMIEIGY IANFVSKGGF VNLLEEPYNA GQFKENIVPY
KWAQGSTDDG RLIAFPTDIA PGTIYYRRDK LAELGYEIED MKTLEDWIEA GSQFAKDLDG
DGVNDRWLLA DATDIFFMIA KSGEELYFNE DGECIVDSPR FIKAFKAAKM VRDMGLDARI
GAWTNEWYST FKDGTVLMQP SGAWLGGHIR NWIAPDTAGK WGVTNLPDGM YCNWGGSFAA
IPEKAEHKEE AWEFIKFIAT RKDTQIAQFK ASNIFPAWMP AFDDPVFQEE MEFYGGQKAR
LLWLEAAKKI PNVVTNKYDV IAEEIVTAAL TDVLNNDADP VEALREAKRM IERRMRRR