Gene Hore_04900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_04900 
Symbol 
ID7314469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp531514 
End bp533319 
Gene Length1806 bp 
Protein Length601 aa 
Translation table11 
GC content40% 
IMG OID643610913 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002508243 
Protein GI220931335 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000016301 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAGA GAGTAGGTGT TTTATTACTT ACACTTCTGT TAGTTTTTTC TGTTTTTGGT 
GTTGTTGATG CTGTAAATAA TCCAGATACT TATGTCCATG TAACTATCGG TGACCAGTCA
ACTCTTGACC CACATTATTC ATATGATACC GGTAGTAGTG AATTAATATA TCAGGTATAC
GAAACTTTAA TCGACTATAA AGGTTCCAGT GTAACTGAAT TTAAACCTTT ACTGTCTACC
AAAGTGCCTT CTGTTGAAAA TGGTTTAATT AAAGATGGTG GTAAAACCTA CATCTTCCCA
ATTCGTCAGG GTGTTAAATT TAGCAATGGT AACCCCTTAA CACCTGAAGA TGTTGAGTAC
AGTTTTGAAA GGGCTTTAAT TTTAGACCGT GCTTACGGTC CTATCTGGAT GTTCTATGAA
CCATTATTTG GTCTTGGTTC CCTTTCTGAT CTGACCAAGA AGGTTGTTGG TGTAGAAGAT
CCTAAAAAAT TAACTCCTGA ACAGTCAGCT AAAGTATATG CTGAAATTGA TAAGAAAATC
GAAGTCGATG GTAATAACGT TGTTTTCCAT CTGGAAAATC CCTATCCTCC ATTCTTAAAT
ATCCTGGCCA AAGGTGCTTC CTGGGCCAGT ATACTTGATA AAGAATGGTC TATTGAGCAG
GGAGCATGGG ATGGAAGTCC TGAAACCATT GCTAAATACC ATGACCCTGT AAAAGAAGAT
GACCCACTCT TTAATAAGAT GATGGGTACT GGTCCCTTTA TTCTCGTTGA ATGGGTGAAT
GGTGACCATG TTATCTTCAA ACGTAATGAT AATTACTGGC GTGAGCCTGC TAACTTCAAG
ACTGTAATTA TCAAGAACGT TGATGAGCCT ACCACCCGTA TCTTAATGCT GAAACGTGGT
GACGCTGATT CTATCTCCCT AGATTACCAG TACTTCAACC AGATCGAAGG TGTTGAAGGA
ATTAAGATTA CCAGAGGTAT TCCAGTTCTT CAGAACATGA CCATGTTATT TAACTGGGAT
ATCAATTCCA AGGGTAACGA ATATATCGGA AGCGGTAAGC TCGATGGTAA TGGTATACCT
CCTGATTTCT TCACAGATGT CCATGTAAGA AGAGCTTTCA GCTACTGCTT CAATTATGAA
GCATTCATTG AACAGGTAAG GGACGGCCAG TCCATGAAAT TACGTGGTCC AATTGTTAGC
CCCTTACTCG GTTATGACGA AAATTCACCT GTTTACAAAC TTGACCTTGA AAAGGCTGAA
GAAGAATTTA AGAAAGCCTG GGATGGTAAG GTCTGGGAAA AAGGATTTGA ACTTACCATT
ACCTATAATG CTGGTAACAT GGCCCGTAAG ACAGCTGCAG ATATATTCAA GACCTATATA
GAACAGATTA ATCCCAAGTT TAAGGTTAAT ACCCAGGTTG TTCAGTGGTC AACATTCCTG
GATCAGTCAC ACAGGGGTCT CCTGCCATTA CAGATCGGTG GCTGGTTAGC TGACTTCCCT
GATCCCCATA ACTTTGTACA GCCCTTTATC CACTCACAAG GTTATTATGC CGGTAAACGT
GGTGAAAATT ACCAGAAATG GGCTGTTGAA GTTGGAATCG ATGACCTCAT TGAAAAGGGT
ATAACTACTC AGGATAAAGA AGAACGTGAA AAGATCTATA AGAAGTTACA GCAGATGTCC
CACGACTATG CTATCGATGT CTGGATTGAC CAGCCATTAA GTGCCAGGAT TGAAAGAAGC
TGGGTTAAAG GCTGGTATCC TAACTCCATG CGTCCCGGAC AGGACTTCTA TATCCTGGAT
AAATAA
 
Protein sequence
MSKRVGVLLL TLLLVFSVFG VVDAVNNPDT YVHVTIGDQS TLDPHYSYDT GSSELIYQVY 
ETLIDYKGSS VTEFKPLLST KVPSVENGLI KDGGKTYIFP IRQGVKFSNG NPLTPEDVEY
SFERALILDR AYGPIWMFYE PLFGLGSLSD LTKKVVGVED PKKLTPEQSA KVYAEIDKKI
EVDGNNVVFH LENPYPPFLN ILAKGASWAS ILDKEWSIEQ GAWDGSPETI AKYHDPVKED
DPLFNKMMGT GPFILVEWVN GDHVIFKRND NYWREPANFK TVIIKNVDEP TTRILMLKRG
DADSISLDYQ YFNQIEGVEG IKITRGIPVL QNMTMLFNWD INSKGNEYIG SGKLDGNGIP
PDFFTDVHVR RAFSYCFNYE AFIEQVRDGQ SMKLRGPIVS PLLGYDENSP VYKLDLEKAE
EEFKKAWDGK VWEKGFELTI TYNAGNMARK TAADIFKTYI EQINPKFKVN TQVVQWSTFL
DQSHRGLLPL QIGGWLADFP DPHNFVQPFI HSQGYYAGKR GENYQKWAVE VGIDDLIEKG
ITTQDKEERE KIYKKLQQMS HDYAIDVWID QPLSARIERS WVKGWYPNSM RPGQDFYILD
K