Gene Hore_15040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_15040 
Symbol 
ID7313097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp1606126 
End bp1607502 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content35% 
IMG OID643611947 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002509249 
Protein GI220932341 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value0.751229 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTAACA AAAAACATTT ATCAACTATT TTAGTGGCTA TATTAGTATT AGGAGCTATC 
CTGTATGTTG GTCCTGTAAC TATGGCATGG GGAAATAAAG ATGATTCAGA TGAAGATTTT
GAATGGGTTA CTGCAAATAG AATTGGTAAT CCTGATGCAG AAATTGTCTT AACAGCTAAT
ATACAGAGAT TTGATACTTT AAAAACTCCC TTTGATACTA AAAAAGAATA TATTAAAAAG
GCAGCCACAG CCTGGGCTAA AGAACACCCT AATGTAAGAA TTGATATTGA AGTTGCACCT
GCCGGACAGA CATCAATGAC AATGTCTAAA ATAATGACTC AAGCCATTTC TGGAAATGCA
CCTGATTTTG CCCATATTGA TTCTTTCTGG ATTGGAAGAT TTATTGACCG GGGAATTTTA
CAGCCTTTAA ATGATTATTT ATCTCAAGAG ACTATAGATG ATTTCTATGG TTTTACAAAG
AAAGTGACTA TGAGAGACGG TAAAATGTAT GCTATTTGGG CTCAAACAGA TGCCAGGTTT
TTATATTATA GAAAAGACTG GATAAAAAAT CCTCCTAAAA CATGGGATGA ATTAATTGAA
ACAGCATTGG CAATGAAAGA AAAACACAAC GTTCATGGTT ATCTTGCCTG GTTGGGAACG
TGGGAAGGTG CTGTTAATGG TAATGTATGG CCATATTTCT GGGCTCAAGG TGGAAGAATA
TTTGATGAAT CTGGGAGACC AGTAATTGGT GAAGGTAAGA ACAGAGAGGC TTTAATTAAT
ACATTAGACT TTTTAAATAG ACTGGTTGAC ACTGGAGCAG CTCCCAGAAT GGTTGCTTCT
ATAACAAGTA TTGATCCAAT ACTGGCTGAA GCTAAAGCAA ACAGTGTAGC CATGGTAGCC
AATGGAAACT GGTTTTATGA TATGTTATTG GAGTCTGTTG ATAATGCAGA AGAAAAATGG
GATTTTGTTC CCTTGCCTCA AATGAAAGAA TCCCAGAGAG CTAATAGTAA TGGTGGATGG
ACTTATGCAG TCTTAACAGA TGATCCAGTA AAACAGGAAC TAGCAGTCAG CTATATTATG
GCTGTTCTTG GAAGCAAGGA AGCTATGGGT GAAAGATGTA AGGTATACAA TTATTTACCA
ACAAGAAAAA GTGTGTATAA AGAGTATCCT TATTTTGCAG ATAATCCTGT CCAACAAAGA
TTTGCTAAGG AACTTAAGTA TGGTCATGCT AGACCTTCTA ATTCTTTATA TGGTGAGGTA
TCAGATTTAG TTCAAAAAGA ACTTGGACGA ATTCTTACTG GTCAAACTAC AGTAGAAAAA
GCAGTTGATA AAATACAAAA AGTAGCTTTA CAAGCTTGGA AAGAAAATAA AAGATAG
 
Protein sequence
MINKKHLSTI LVAILVLGAI LYVGPVTMAW GNKDDSDEDF EWVTANRIGN PDAEIVLTAN 
IQRFDTLKTP FDTKKEYIKK AATAWAKEHP NVRIDIEVAP AGQTSMTMSK IMTQAISGNA
PDFAHIDSFW IGRFIDRGIL QPLNDYLSQE TIDDFYGFTK KVTMRDGKMY AIWAQTDARF
LYYRKDWIKN PPKTWDELIE TALAMKEKHN VHGYLAWLGT WEGAVNGNVW PYFWAQGGRI
FDESGRPVIG EGKNREALIN TLDFLNRLVD TGAAPRMVAS ITSIDPILAE AKANSVAMVA
NGNWFYDMLL ESVDNAEEKW DFVPLPQMKE SQRANSNGGW TYAVLTDDPV KQELAVSYIM
AVLGSKEAMG ERCKVYNYLP TRKSVYKEYP YFADNPVQQR FAKELKYGHA RPSNSLYGEV
SDLVQKELGR ILTGQTTVEK AVDKIQKVAL QAWKENKR