Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hore_04900 |
Symbol | |
ID | 7314469 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothermothrix orenii H 168 |
Kingdom | Bacteria |
Replicon accession | NC_011899 |
Strand | + |
Start bp | 531514 |
End bp | 533319 |
Gene Length | 1806 bp |
Protein Length | 601 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 643610913 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002508243 |
Protein GI | 220931335 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000000016301 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAAGA GAGTAGGTGT TTTATTACTT ACACTTCTGT TAGTTTTTTC TGTTTTTGGT GTTGTTGATG CTGTAAATAA TCCAGATACT TATGTCCATG TAACTATCGG TGACCAGTCA ACTCTTGACC CACATTATTC ATATGATACC GGTAGTAGTG AATTAATATA TCAGGTATAC GAAACTTTAA TCGACTATAA AGGTTCCAGT GTAACTGAAT TTAAACCTTT ACTGTCTACC AAAGTGCCTT CTGTTGAAAA TGGTTTAATT AAAGATGGTG GTAAAACCTA CATCTTCCCA ATTCGTCAGG GTGTTAAATT TAGCAATGGT AACCCCTTAA CACCTGAAGA TGTTGAGTAC AGTTTTGAAA GGGCTTTAAT TTTAGACCGT GCTTACGGTC CTATCTGGAT GTTCTATGAA CCATTATTTG GTCTTGGTTC CCTTTCTGAT CTGACCAAGA AGGTTGTTGG TGTAGAAGAT CCTAAAAAAT TAACTCCTGA ACAGTCAGCT AAAGTATATG CTGAAATTGA TAAGAAAATC GAAGTCGATG GTAATAACGT TGTTTTCCAT CTGGAAAATC CCTATCCTCC ATTCTTAAAT ATCCTGGCCA AAGGTGCTTC CTGGGCCAGT ATACTTGATA AAGAATGGTC TATTGAGCAG GGAGCATGGG ATGGAAGTCC TGAAACCATT GCTAAATACC ATGACCCTGT AAAAGAAGAT GACCCACTCT TTAATAAGAT GATGGGTACT GGTCCCTTTA TTCTCGTTGA ATGGGTGAAT GGTGACCATG TTATCTTCAA ACGTAATGAT AATTACTGGC GTGAGCCTGC TAACTTCAAG ACTGTAATTA TCAAGAACGT TGATGAGCCT ACCACCCGTA TCTTAATGCT GAAACGTGGT GACGCTGATT CTATCTCCCT AGATTACCAG TACTTCAACC AGATCGAAGG TGTTGAAGGA ATTAAGATTA CCAGAGGTAT TCCAGTTCTT CAGAACATGA CCATGTTATT TAACTGGGAT ATCAATTCCA AGGGTAACGA ATATATCGGA AGCGGTAAGC TCGATGGTAA TGGTATACCT CCTGATTTCT TCACAGATGT CCATGTAAGA AGAGCTTTCA GCTACTGCTT CAATTATGAA GCATTCATTG AACAGGTAAG GGACGGCCAG TCCATGAAAT TACGTGGTCC AATTGTTAGC CCCTTACTCG GTTATGACGA AAATTCACCT GTTTACAAAC TTGACCTTGA AAAGGCTGAA GAAGAATTTA AGAAAGCCTG GGATGGTAAG GTCTGGGAAA AAGGATTTGA ACTTACCATT ACCTATAATG CTGGTAACAT GGCCCGTAAG ACAGCTGCAG ATATATTCAA GACCTATATA GAACAGATTA ATCCCAAGTT TAAGGTTAAT ACCCAGGTTG TTCAGTGGTC AACATTCCTG GATCAGTCAC ACAGGGGTCT CCTGCCATTA CAGATCGGTG GCTGGTTAGC TGACTTCCCT GATCCCCATA ACTTTGTACA GCCCTTTATC CACTCACAAG GTTATTATGC CGGTAAACGT GGTGAAAATT ACCAGAAATG GGCTGTTGAA GTTGGAATCG ATGACCTCAT TGAAAAGGGT ATAACTACTC AGGATAAAGA AGAACGTGAA AAGATCTATA AGAAGTTACA GCAGATGTCC CACGACTATG CTATCGATGT CTGGATTGAC CAGCCATTAA GTGCCAGGAT TGAAAGAAGC TGGGTTAAAG GCTGGTATCC TAACTCCATG CGTCCCGGAC AGGACTTCTA TATCCTGGAT AAATAA
|
Protein sequence | MSKRVGVLLL TLLLVFSVFG VVDAVNNPDT YVHVTIGDQS TLDPHYSYDT GSSELIYQVY ETLIDYKGSS VTEFKPLLST KVPSVENGLI KDGGKTYIFP IRQGVKFSNG NPLTPEDVEY SFERALILDR AYGPIWMFYE PLFGLGSLSD LTKKVVGVED PKKLTPEQSA KVYAEIDKKI EVDGNNVVFH LENPYPPFLN ILAKGASWAS ILDKEWSIEQ GAWDGSPETI AKYHDPVKED DPLFNKMMGT GPFILVEWVN GDHVIFKRND NYWREPANFK TVIIKNVDEP TTRILMLKRG DADSISLDYQ YFNQIEGVEG IKITRGIPVL QNMTMLFNWD INSKGNEYIG SGKLDGNGIP PDFFTDVHVR RAFSYCFNYE AFIEQVRDGQ SMKLRGPIVS PLLGYDENSP VYKLDLEKAE EEFKKAWDGK VWEKGFELTI TYNAGNMARK TAADIFKTYI EQINPKFKVN TQVVQWSTFL DQSHRGLLPL QIGGWLADFP DPHNFVQPFI HSQGYYAGKR GENYQKWAVE VGIDDLIEKG ITTQDKEERE KIYKKLQQMS HDYAIDVWID QPLSARIERS WVKGWYPNSM RPGQDFYILD K
|
| |