Gene Hore_19500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_19500 
Symbol 
ID7312765 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2090139 
End bp2091173 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content43% 
IMG OID643612396 
ProductD-xylose ABC transporter, periplasmic substrate-binding protein 
Protein accessionYP_002509692 
Protein GI220932784 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4213] ABC-type xylose transport system, periplasmic component 
TIGRFAM ID[TIGR02634] D-xylose ABC transporter, substrate-binding protein 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value0.497232 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA TATTTGCAAT TTTAACACTG GTAATGCTTT TCACAGGACT GGTTGTAACA 
GAGGCACTGG CTGATGATGA ATTGGTAATC GGTCTTTCAA TGGACAACCT GAGGCTGGAG
AGATGGCAGC ATGACCGTGA CATCTTTGTT AAAAGGGCTG AAGAATTAGG TGCAAAAGTA
TTGGTACAGT CTGCCAATAG TGATGATATG GTGCAGTTAT CCCAGGCTGA AAACCTGATT
ACCCAGGGCA TCGACGTACT GGTAGTTGTG CCCCACAATG GTAAAATTAT GGGCAGTATT
GTCAGGGAAG CTCACCGTAA TGGAGTTAAG GTCCTGGCTT ATGATAGATT GTTAATGGAC
TGTGATGTTG ACCATTATAT TTCTTTCGAT AATATCCGGG TTGGAGAGTT ACAGGCCCAA
TACCTGGTTG ACAGGAAACC AAGTGGGAAG TACTTTCTTT TAGGTGGTTC TCCTACCGAC
AATAACGCTA AACTCTTCAG GCAGGGACAG ATGAATGTAC TTAACCCCTA TATTGAAAGG
GGAGATATTG AGGTAGTAGG TGATCAGTGG GCCAAGGACT GGTTACCTCA GGAAGCCATG
AAGATAATTG AGAATGCCCT GACTGCAAAC AACAATGATA TTGATGTAAT TGTTGCTTCC
AATGATAGTA CTGCCGGTGG AGCTATTGAA GCCCTGGCAG AGCAGAACCT TGACGGAAAG
GTGCTGGTAT CTGGTCAGGA TGCCGACCTG GCTGCCTGTC AGCGTGTTGT TGAAGGTACT
CAGACCATGA CCATTTATAA ACCAATTAGT AAGCTGGCTA ACAGGGCAGC TGAAGTCGCA
GTAGCCATGG CTAAAGGGGA AGAAGTTAAA ACAAATGGTA AGGTAAATAA TGGCAAGATC
GATGTTCCTT CCATTCTGCT GGAGCCAATT GCAGTCGATA AAGATAATAT GGTTGAAACA
ATTATCAAAG ATGGTTTCCA TAGTTTAGAA GACGTATATA AAAATGTACC CAGGGAAGAA
TGGCCTGAAC TATAA
 
Protein sequence
MKKIFAILTL VMLFTGLVVT EALADDELVI GLSMDNLRLE RWQHDRDIFV KRAEELGAKV 
LVQSANSDDM VQLSQAENLI TQGIDVLVVV PHNGKIMGSI VREAHRNGVK VLAYDRLLMD
CDVDHYISFD NIRVGELQAQ YLVDRKPSGK YFLLGGSPTD NNAKLFRQGQ MNVLNPYIER
GDIEVVGDQW AKDWLPQEAM KIIENALTAN NNDIDVIVAS NDSTAGGAIE ALAEQNLDGK
VLVSGQDADL AACQRVVEGT QTMTIYKPIS KLANRAAEVA VAMAKGEEVK TNGKVNNGKI
DVPSILLEPI AVDKDNMVET IIKDGFHSLE DVYKNVPREE WPEL