Gene Hore_19770 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_19770 
Symbol 
ID7312792 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2128418 
End bp2129689 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content42% 
IMG OID643612423 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002509719 
Protein GI220932811 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones64 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGT TTTTTCTGTT GTTAGTATTA ACAACTCTAT TTGTTGGTAC TCTGGCTGTT 
TCTGCAGGTG CTACCGAAAT TACTATGTGG GCAATGAATA ATGCTCCATC TGAATTAAAC
ATTGCCTGGT TTAATGAAAA GGCTGCAGAA TTTGAAGAAC TGACCGGTAT CAAGGTTAAC
TTTGAAGAGA TAGCCTGGTC CAGCTGCATG GAGGTTATTT CAACTGCACT GGCAACCGGT
GAGGGCGCGA ATGTAATGCA GGTTGGAACA ACCCAGACAC CTTTCTTTGC AGCTACCGGT
GGATTGGTCG AAATAGACAT TCGTGAATTT GGTGGAAAAG ATAATTTTAT GGAAGGCAAT
TTAAAGTCCA CAGTACTGGA TGGTAAGTAC TATGGTGTAC CGTGGATTGC AGAAACCAGA
GTCCTGTTTT ATAACACAGA AATGTTTGAA AAAGCAGGTG TTGAACCTCC CCAGACATGG
GAAGAACTTA TTGAAGTTGG TGAAAAGATA GTTGATGTAT ATGGAGAGGG AACAGCTATT
GCTATTGCTG GTACAAATGC ATGGGACCTG ATCCATAACT GGGCTCCGAT GCTATGGACC
AGGGGAGGAG ATTTCCTGAC ACCAGACTGG AAACGGGCTG CCTTTAACCT ATCTGAAGCA
GGGTATGAAG CAGTAGAATA TTATGTAGAC CTGGTAAGGA ATGGTTTAGC AAGTACAGCA
TGTGCTGAAT ATGACCAGTC CCAGGCTGAT TCAGCTTTTG CCAATGGTGA TGTGGCAATG
GCCTTCCAGG GACCATGGAA TATTTCAGGT ATAAAGAATG ATAACCCCGA TCTTCCATTT
GCAGCTGCTG AACTTCCAGC TGGACCTTAT GGTAGGGCTT CCTTTGCCGG TGGTAGTAAC
CTCGTAGTCC GGAAAAATGC TCCACAGGAT GAAATTGAAG CATCAATTAA GTGGATTAAA
TTCCTGTTAA GTGATACTAA CCTGACAGAG TATGTTAAAC TTTCTAACAT GTTACCAGCA
ACCAAGGATG CATTTTCTGA TCCATTCTTC CAGAGTGAGA TAATGCAGGT ATTTGAAAAA
TCATTGAGCT ATGCACATGC TTATCCATCT TTACCTGCAT GGGGTGAAAT TGAACTGGCT
ATGAGGACCA GTTTTCAGAA CATTCTTACC GATTATATTG ATGGTGTATA TGATGATAAT
ACCGCCAAAA AATACCTTGA TGCTGCTGCA TTAGAGGTAA ACAACATATT AAAGGAACAT
AGTGATAAGT AA
 
Protein sequence
MKKFFLLLVL TTLFVGTLAV SAGATEITMW AMNNAPSELN IAWFNEKAAE FEELTGIKVN 
FEEIAWSSCM EVISTALATG EGANVMQVGT TQTPFFAATG GLVEIDIREF GGKDNFMEGN
LKSTVLDGKY YGVPWIAETR VLFYNTEMFE KAGVEPPQTW EELIEVGEKI VDVYGEGTAI
AIAGTNAWDL IHNWAPMLWT RGGDFLTPDW KRAAFNLSEA GYEAVEYYVD LVRNGLASTA
CAEYDQSQAD SAFANGDVAM AFQGPWNISG IKNDNPDLPF AAAELPAGPY GRASFAGGSN
LVVRKNAPQD EIEASIKWIK FLLSDTNLTE YVKLSNMLPA TKDAFSDPFF QSEIMQVFEK
SLSYAHAYPS LPAWGEIELA MRTSFQNILT DYIDGVYDDN TAKKYLDAAA LEVNNILKEH
SDK