Gene Hore_15560 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_15560 
Symbol 
ID7312591 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp1666490 
End bp1667746 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content44% 
IMG OID643612002 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002509300 
Protein GI220932392 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGGA GTCTTATTTT AACACTATCA GTTTTCCTGG TACTGGTCTT GTCCGTATCC 
GCATTTGCTG CTACTGAAAT TACCGTATGG TATCACTCCG GCAGGGGTGG GGAAAGAGAA
GTAATTGAGG ATCAGGTTAA AAGGTTTAAT GCCATGCAGG ATGAAGTCAA GATAAAACTT
GTTCAGTTGC CTGAAGGTAG TTATAACGAA CAGGTTCAGG CTGCTGCCAT GTCAGGGGAC
CTGCCCGATG TACTTGACCT TGATGGTCCT TTTATTGCCA ACTATGCCTG GTCCGGGTAC
CTTCGTCCCT TAGAAGATTA TGTTAGTCCG GAACTTAAAG AGGATCTCTT ACCCTCTATT
TTAGCCCAGG GAACCTACCA GGGTCACCTG TATGCTCTGG GAACCTTTGA TTCAGGACTG
GCTATCTGGG GTAACAAGGA ATATTTAGAA GATGTCGGAG CCCGTATTCC AACCAGTGTT
GAAGATGCCT GGACATTTAC TGAATTTATG GATATCCTGA AAAAGCTTAA AGAACATCCT
GATGTAAAAT ATCCACTCGA CTTTAAAATT AACTATGGTA AGGGTGAATG GTTCAGCTAC
GGTTTCTCTC CTATTTTCCA GGCTTTTGGA GCCGATTTAA TTAATCGTGA TAACTTCACT
ACTGCAGAAG GTGTTTTAAA CGGACCGGAA GCTATGGCTG CTGCCTGGTT CCAGGCCTTG
TTTGAACAGG GTTATGCTAA TCCTAACCCT CCGGGAGATA CTGAGTTTAC AAATGGTGAT
GCTGCATTAT CATGGTGTGG ACACTGGGGT TATAATCAGT ATAAGGATGC CCTCGGTGAT
GATGTTGTCC TGATTCCCAT GCCCAAATTC GCAACCCAGG TAACAGGAAT GGGTTCCTGG
GCCTGGAGTA TAACTCAAAA CTGTGAAAAT CCAGAAGCAG CCTGGAAGTT CATAGAGTTT
ATATTACAGC CAGAAGAAAT AGTAAAGATG ACCAATGCCA ATGGTGCTGT ACCATCCAGA
CTTTCTGCCG CCAAATTATC AGAACCCTAT AAGCCCGGTG GAGAATTAAG GATTTTTGTT
GAACAGCTGC AGAAAATAGC TGTAGAACGC CCTGTAACTC CAGCTTATCC GACTATTACT
GATGCCTTTG CTACTGCTAT TGATAACATT ATTAACGGTG GCGACATCAG GTATGAACTC
AACGAAGCCG TTAGAGCAAT TGACGAAGAA ATTGAGTTTA TGGGTCTTGC TCAATAA
 
Protein sequence
MKRSLILTLS VFLVLVLSVS AFAATEITVW YHSGRGGERE VIEDQVKRFN AMQDEVKIKL 
VQLPEGSYNE QVQAAAMSGD LPDVLDLDGP FIANYAWSGY LRPLEDYVSP ELKEDLLPSI
LAQGTYQGHL YALGTFDSGL AIWGNKEYLE DVGARIPTSV EDAWTFTEFM DILKKLKEHP
DVKYPLDFKI NYGKGEWFSY GFSPIFQAFG ADLINRDNFT TAEGVLNGPE AMAAAWFQAL
FEQGYANPNP PGDTEFTNGD AALSWCGHWG YNQYKDALGD DVVLIPMPKF ATQVTGMGSW
AWSITQNCEN PEAAWKFIEF ILQPEEIVKM TNANGAVPSR LSAAKLSEPY KPGGELRIFV
EQLQKIAVER PVTPAYPTIT DAFATAIDNI INGGDIRYEL NEAVRAIDEE IEFMGLAQ