Gene Hore_20630 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_20630 
Symbol 
ID7314387 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2231247 
End bp2232491 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content38% 
IMG OID643612507 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002509803 
Protein GI220932895 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones62 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAAAA ATAATTTAGT TAAAGTAATT TCGGTAATTG TTTTGATTTT ATCTCTCAGT 
ACAGTAGCTA TGGCAAAGCA GGTAGAATTA ACATTCTGGA ATGGATTTAC GGGACCGGAC
AGGGTTCAGG TTGAAGGCCT GGTAAAAGAA TTTAATGAAA CACACCCTGA TATAAACATA
AAAATGGAAA TCATGCCCTG GGATAGCTTT TTTCAGAAAT TATTGCCTTC CCTGGCAGTA
GGAAAAGGAC CTGATATTGC TGCTTTTGAT ACTTCTTATA TTCCGCGGTA TGCAGAGTCA
GGCGTTATTG CTCCTATTGA TGACTTATAT GAAGGGTATA TTGATAAAGA CACCCTGATA
CCGGCTATGT ATAATAACCT TAAATGGAAG GGTAAAACCT ACGGATCACC AATGAATTAC
ACAAGTTTAC TTCTTTATTA CAACAAGGAT ATGTTCAAAG AAGCCGGGCT TGATCCCAAT
AATCCTCCAA GAACCTGGAA AGAATTAAAA GAATATGCCC TGAAGCTTAC AAAAGATACT
AATAATGATG GTAAAGTAGA CCAGTATGGT TTTGTAATTG CTGCAAAGCA GACTATTCCC
ATGTGGCCTA TAGTTATCTG GGGGAATGGT GGTCGGATAA TCAAAGATGG AGAGGTTTTT
ATAAATAAAC CGAAAGCTGT GGAAGCGGTA GAAAGCATGG CCAGTCTTAT TAAAGAAGAC
GGTATTTCGC CCATTGGGTT GACGGGGGCT GAATGTGATA AATTATTTGA AACCCAGAGG
GCAGCTATGT ATTTCTGTGG TCCCTGGATG GTAAATGGTT TTAAAAATGC TGGCATAAAT
TTCGGTGTGG CCCAGGTACC TGCCAGGGAA GATGGCAGGA GAATAACCCT CGGTACCAGT
GTAGCCATGG TACTAAACAA AGCTAGCCTG GATAAGAAAG AAGCTGCCTA TGAATTCTTT
AAATTCTGGA ATTCTAAAAA GTCACAAATT TACTGGTCTC TAGGATCTGG TTTCCCCCCA
ACCAGGATAG ATATTACAGA AGAAAAATTG GCTCAAAATC CGTTTGTAGT TGAATTTTCT
AGAGCTGCCA GAGACTCAAG GTTTTATTTG CCTAAATTAG AGAATTTCAA CAAAATTAAT
TCAGATGTTA TTGTTCCTGC CCTTGAAAAG GTCCTATATG ATAAAGCTAC AGCTGAGGAA
GCCCTAGACG AAGCAGCATT TATAATTAAA AGAATTATAG ATTAA
 
Protein sequence
MTKNNLVKVI SVIVLILSLS TVAMAKQVEL TFWNGFTGPD RVQVEGLVKE FNETHPDINI 
KMEIMPWDSF FQKLLPSLAV GKGPDIAAFD TSYIPRYAES GVIAPIDDLY EGYIDKDTLI
PAMYNNLKWK GKTYGSPMNY TSLLLYYNKD MFKEAGLDPN NPPRTWKELK EYALKLTKDT
NNDGKVDQYG FVIAAKQTIP MWPIVIWGNG GRIIKDGEVF INKPKAVEAV ESMASLIKED
GISPIGLTGA ECDKLFETQR AAMYFCGPWM VNGFKNAGIN FGVAQVPARE DGRRITLGTS
VAMVLNKASL DKKEAAYEFF KFWNSKKSQI YWSLGSGFPP TRIDITEEKL AQNPFVVEFS
RAARDSRFYL PKLENFNKIN SDVIVPALEK VLYDKATAEE ALDEAAFIIK RIID