Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0597 |
Symbol | |
ID | 4710860 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 672913 |
End bp | 674814 |
Gene Length | 1902 bp |
Protein Length | 633 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639855055 |
Product | extracellular solute-binding protein |
Protein accession | YP_001002185 |
Protein GI | 121997398 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGAGCA TGCTCCCATT CCCCCGCAGG CGCTGCGCTC CCCGGCATTC TCCGCCCTGT TGGGCGGCCG TGATGGCGGC GGTGTTCCTG ACGGTTCTGC TCCCGCTCTC CGCCGCGACC GGCAACGAGG AGGACGCCCC GGGCGTCCAC GGCCTGGCGT TGCACGGCGA GCCCAAATAC CCGCCGGACT TCTCCCACTT CGACTACGTC AACCCGAAGG CCCCCAAGGG GGGTACGGTC ATCCGCGAGG CGCGGGGGAG TTACGATAGC CTCAACGGCT ACATCCTGCG CGGGACCAAA CCGCCGGGTC TGGGGATGGT GATCGATACC CTGATGGTCC ACGCCGATGA TGAACCGTTT TCTGTCTACG GGCTGATCGC CGAACGCGTG GAGGTGGCCG AGGACAACGC CTGGGTCGAG TTCAAGCTGC GTGAGGAGGC CCGTTTCCAC GACGGTGAAC CGATCACCGC CGATGACGTG GTCTTCAGTT TCGAAGTGCT CCGTGAGCAC GGGCACCCCA GGTTGCGCTC CTACTACCGC CACGTCGAGT CCGCCGAGGC CGAGGGTCGT CATCGGGTGC GTTTCGAGTT CGCCCACGCC GGCAATCCCG AATTGCCGCT GATCATGGGC GAGCTGCCGG TGCTGCCTCA GCATTACTGG GAGGAACGCG ATTTCAGCCG GACCACCATG CAGCCGCCGT TGGGCAGTGG GCCCTACCGG ATCGCCGAGG CCCGTCCCGG GCGCAGCATC ACCTACGAGC GGGTGGAGGA CTACTGGGCG GAAGACCTGC CCGTGCGTCG CGGGCGGTTC AATTTCGACC GCCTGCGCTA CGACTTTTAC CGAGACGCCA CCGTTGCGCT GGAGGCCTTC CGCGCCGGGG AGTTCGATCT GCGCGAGGAG TATACTGCCC GCCACTGGGC CACCGGTTAC GAGACCTCGG CGCAGCGCGA AGGACGCATG GTTCTCGAGG AGATCGAGCA CAGCCGCCCG GCGGGGATGC AGGGGTTCGT CTTCAACACC CGTCGCCCGG TCTTCGAGGA CCGCGAGGTG CGTCGGGCAC TGAGCTACGC CTTCGACTTC GAGTGGACCA ACCGCCAGCT GTTCCACTCG GCGTATACGC GTACGGCGAG CTACTTCGAA AACTCGGAGT TGGCAGCTCG CGGTGCACCG GGCGAGGCCG AGCAGGCGAT CCTCGCCCCC TTCCGGGAGG AGCTCCCCGA GGCGGTCTTC GAGCCGTACC GGCCGCCGGT CACCGACGGC TCGGGCTGGA ACCGCGAGAA TCTGCTCAAG GCGCTCCGCA TCCTCAAGGA GGCCGGCTGG TCGGTGGGGG ACGACGGCAT CCTGCGTCAC CGGGACAGCG GTCGGCCGCT GGTCTTCGAG CTGCTCCTGG TCAACCCGAG CTTCGAGCGC GTGGCCCTGC CGTTCGTCCA GAACCTTCGG CGCATCGGAG TCCTGGCGCG GGTGCGCACG GTGGATACCA CCCAGTACCA GTACCGGCTG GATCACTTCG AGTTCGATAT GGCTGTGGTG GTGCTGCCGC AATCGCCCTC GCCGGGTCAC GAGCAGGCCA TGTACTGGAG TTCCGAGGCC GCTGACGAAC CGGGCAGCCG GAACTACGCC GGGGTGCGGG ACCCGGTGGT CGACGAACTG GTCGAACGCC TGGTCTCCGC TGAGGATCGC GATGAGCTGG TCCACCTGAC CCGCGCCCTG GACCGCGTCC TACTCGCCGG CCACTACGTG ATCCCCAACT GGCATACCCC GGTCCATCGG GTCGCCTACT GGGACAAGTT CGGGCGGCCG GAGACGGCCC CGAAGTACGG ACTGGGGTTC GATACCTGGT GGGTCGACCC GGACAAGGAG CAACGGCTGC GCGAGGCCAA CGGCCCGCGG TCGGTCCGAT AG
|
Protein sequence | MSSMLPFPRR RCAPRHSPPC WAAVMAAVFL TVLLPLSAAT GNEEDAPGVH GLALHGEPKY PPDFSHFDYV NPKAPKGGTV IREARGSYDS LNGYILRGTK PPGLGMVIDT LMVHADDEPF SVYGLIAERV EVAEDNAWVE FKLREEARFH DGEPITADDV VFSFEVLREH GHPRLRSYYR HVESAEAEGR HRVRFEFAHA GNPELPLIMG ELPVLPQHYW EERDFSRTTM QPPLGSGPYR IAEARPGRSI TYERVEDYWA EDLPVRRGRF NFDRLRYDFY RDATVALEAF RAGEFDLREE YTARHWATGY ETSAQREGRM VLEEIEHSRP AGMQGFVFNT RRPVFEDREV RRALSYAFDF EWTNRQLFHS AYTRTASYFE NSELAARGAP GEAEQAILAP FREELPEAVF EPYRPPVTDG SGWNRENLLK ALRILKEAGW SVGDDGILRH RDSGRPLVFE LLLVNPSFER VALPFVQNLR RIGVLARVRT VDTTQYQYRL DHFEFDMAVV VLPQSPSPGH EQAMYWSSEA ADEPGSRNYA GVRDPVVDEL VERLVSAEDR DELVHLTRAL DRVLLAGHYV IPNWHTPVHR VAYWDKFGRP ETAPKYGLGF DTWWVDPDKE QRLREANGPR SVR
|
| |