Gene Hore_20470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_20470 
Symbol 
ID7314371 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2208237 
End bp2209457 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content40% 
IMG OID643612491 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002509787 
Protein GI220932879 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2182] Maltose-binding periplasmic proteins/domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000403753 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGG TTGCTTTACT TGTTGTTACT TTAATGCTTA TTTTTACACT GCCGACCATG 
GCGGTAACCA AATTAACTAT CTGGGAAAAA TATGATGAGG CTACCCAGGA TCCATATTTC
CAGAGTTTAG TAGATGAATT TAACAAAACC CATGAAGACA TTGAAGTTGA AATGCTCCAC
TATAATACTG AAGATTTGAG GGAAAACTTC CAGAACGCTA TGATGGCCGG AGAAGGTCCT
GATGCTACCA TCAGTCCCTT TGACCATGTC GGTGTTTTTG CCGTTTTAGG TATTGCAGAG
CCAATTACTG ATATTTTACC ACAGGACATT AAAGATGCCC TGGTTGACAA TGCTTTACCT
GCCATGAGCC TTGATGGAGA AATGTACGGT GTTCCTATTG ATATGGGTAA CCACTTAATG
CTACTTTATA ATAAGAAATT TGTCAAAGAA CCACCCAAAA CCTGGGACGA GTTAATTAAA
ATTGCCAAAA AACATACTAA AGACCTGGAC GGTGACGGAG TAAATGACCA GTTTGGTCTG
GTTTATAACC TGACTGAACC TTTCTGGTTT ATTCCTTTCA TGAGTGGTCA CGGTGGCTGG
GTTATGGACG AAAACCGTCA GCCAACCTTA AATACTGATG GTGTAGTTAA TGCCTTTAAA
TTTATCAGGG ATCTTAAATT TGAACATAAA ATTGTCCCTG AAGAATGTGA CTATGATATT
GCAGATAGCC TCTTTAAAGA AAATAAAGCC GTATTTCTTA TCAATGGTGA CTGGTCCTTA
AATGGTTATA AAGCAGTAGA AGGCCTTGAT TTTGGTACAG CAGCCATACC TAAATTCAAA
GATTATGAGT GGCCTAAACC GATGATGAGT GGTAATGGTT TCATCATGGC CGAAGGTTTA
TCTGAAGAAA AGCAGGAGGC CCTCTTTGAA TTTATCCGCT TTGTACTGGA GAAAGAAAAC
CAGGTTCGTA TGGTAAAAGA ACTGAGTATC CTTCCCAGTA CCAAAGCTGC CAGAGAAGTA
GAGTTTGAAG ACCCGATATT AAGGGGTTCT ATTGAACAGC TTAACCATAC AAAACCAATG
CCTATTGTTC CTGAAATGAG GGCGATCTGG GATGCCCTGA GAGCACCAAT CCAGAATGTT
ATGAATGGTA GTGCTACTCC TGAAGATGCT GCCAAAGAAG CTCAGGATCT GGCTGAAGAA
GGCGTAGGTG CAATGCATTA A
 
Protein sequence
MKKVALLVVT LMLIFTLPTM AVTKLTIWEK YDEATQDPYF QSLVDEFNKT HEDIEVEMLH 
YNTEDLRENF QNAMMAGEGP DATISPFDHV GVFAVLGIAE PITDILPQDI KDALVDNALP
AMSLDGEMYG VPIDMGNHLM LLYNKKFVKE PPKTWDELIK IAKKHTKDLD GDGVNDQFGL
VYNLTEPFWF IPFMSGHGGW VMDENRQPTL NTDGVVNAFK FIRDLKFEHK IVPEECDYDI
ADSLFKENKA VFLINGDWSL NGYKAVEGLD FGTAAIPKFK DYEWPKPMMS GNGFIMAEGL
SEEKQEALFE FIRFVLEKEN QVRMVKELSI LPSTKAAREV EFEDPILRGS IEQLNHTKPM
PIVPEMRAIW DALRAPIQNV MNGSATPEDA AKEAQDLAEE GVGAMH