Gene Hore_20870 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_20870 
Symbol 
ID7313320 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2260408 
End bp2261700 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content43% 
IMG OID643612534 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002509827 
Protein GI220932919 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones53 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAG GGTTATCTTT AATACTTGTT ATTTCATTAA TGCTGGTATT AAGTAGTGTT 
GTTTTTGCAG AAGAACAGGT TGAAATTACA ATTGCTGGTG GTAGTGTAGG TATCGAGCTC
GACCTGACCA AAGAAGCAGC TCAATTATAT ATGGAAAGAC ACCCCAATGT AAAGGTTAAT
GTACTGGACA CACCTGATTT AGCCAATGAC AGGCTCGGGT TATATCTTCA GTTTTTGGAA
GCAAAAAGCC CCAAAATTGA CGTCTACCAG ATTGATGTTA TCTGGCCCGG GGATTTAGCT
GAACATTTTG TTGATCTTTA TGAGTATGGG GCTGAAAAAT ATGTTGATGA TCACTTCCAG
CCTATTGTAG AAAATAATAC TGTTGAGGGC AGACTGGTAG CTATGCCCTG GTTTACTGAT
GCCGGTCTTC TTTACTACCG TAAGGATCTT CTTGAAAAAT ATGACCTGGA AGTCCCCAAA
ACATGGGAAG AATTAGAAAG GGCGGCCAAG ATTATTCAGA CCGGAGAAAG GGCTGCCGGA
AACCAGGACT TCTGGGGTTA TATCTGGCAG GGTAATGCTT ATGAAGGTTT AACATGTGAT
GCCTTAGAAT GGGTTGCCTC CAATGGTGGA GGAACTATTA TCAGTCCTGA CAAAAAAATT
ACTATTAACA ATGAAAAGGC AATAGAAGCC ATCGAAATGG CCGCTGACTG GGTAGGCTGG
ATTTCTCCTC CAGGAACAAC TGGTCTTGTT GAAGAAAGCA CCCGTAAGAT GTGGGAAGCA
GGTAATGCCG CCTTTATGAG AAACTGGCCT TACTGTTATA AACTTGGTAA TGCTGAAGGG
TCTGCCATCA AAGGCAAGTT TGATGTAGCT CCCCTACCGG CCGGTGATAG TGGTAACGGG
GCTGCTACCC TCGGTGGTTG GAACCTGGCT GTAAGCAAGT ACAGTGAACA CCCTGAAGTT
GCCGCTGATT TTGTTTTCTT CCTGACCGGT TATGAAATTC AGAAACTCCG GGCTACCAAA
GGTTCCTTTA ATCCGACCAT TAAAGCCCTT TATGAAGATG AAGAAGTCCT GGAAGCTAAC
CCCTTCTTTG GTAAGCTTTA TGATGTTTTT GTAAATGCTG TTGCCCGTCC TTCTACTGCC
ACTGCTCCTA ACTATAATGA AGTCTCCAGG TTATTCTTCC AGGCTGTACA TTCAGTCCTT
TCTGGTGAAA TGGATGCCAG GACTGCAGTG GAATACTTAG AATTAGATCT TCAGGATTTA
ACCGGTTTTG AAATTGGTGA ACCTCAAAAA TAA
 
Protein sequence
MKKGLSLILV ISLMLVLSSV VFAEEQVEIT IAGGSVGIEL DLTKEAAQLY MERHPNVKVN 
VLDTPDLAND RLGLYLQFLE AKSPKIDVYQ IDVIWPGDLA EHFVDLYEYG AEKYVDDHFQ
PIVENNTVEG RLVAMPWFTD AGLLYYRKDL LEKYDLEVPK TWEELERAAK IIQTGERAAG
NQDFWGYIWQ GNAYEGLTCD ALEWVASNGG GTIISPDKKI TINNEKAIEA IEMAADWVGW
ISPPGTTGLV EESTRKMWEA GNAAFMRNWP YCYKLGNAEG SAIKGKFDVA PLPAGDSGNG
AATLGGWNLA VSKYSEHPEV AADFVFFLTG YEIQKLRATK GSFNPTIKAL YEDEEVLEAN
PFFGKLYDVF VNAVARPSTA TAPNYNEVSR LFFQAVHSVL SGEMDARTAV EYLELDLQDL
TGFEIGEPQK