Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_0244 |
Symbol | |
ID | 7401170 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 263745 |
End bp | 265472 |
Gene Length | 1728 bp |
Protein Length | 575 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643707307 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002564919 |
Protein GI | 222478682 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.794885 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.536815 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCATCCC GCGACGAGAG CGTTTCACGG AGAAAGTTCC TCGGTGCCGC CGGTGGCGCT GCAGTAACGG TCGGTCTCGC CGGCTGTTCC GACAACGACG GCGAGGATTC CGACGGTTCC GACGGCTCGG ACGGCTCCGA TGGTTCGAAC GGTTCGGACG GCTCCGACGG TTCGGACGGC GGCGACGACA CCAGCCTCCT CCGGTACGGC CGGGGGAGTC ACTCGGCGAC GCTGGACTTC CAGAACAGCA CGAGCGGCGA GGTTGCGAAG GTGACCGAGC AGATCTACGA CACGCTCATC AACTTCGAGC CGGGCGAGTC GACGCTCACC GACGGGCTCG CCTCGGACTA CTCGCTCGAC GGCGAGACGG CATCGCTCAC GCTGAAGGAG GGGGTCACCT TCCACAACGG CGAGGAGTTC ACCGCACAGG ACTTCGAGGC GACGTACCGC CGCTTCGTCG ACTCGGAGTA CGAGTACTAC GCTGGCGACG ACTACGTCTC CGCGTACGGT CCCTTCACGC TCGGCAACTG GATCGACGAG ATTCAGGTCG ACGGCGACTA CGAGATGACG ATCCAGCTCA CGCAGACGTA CGCGCCGTTC CTGCGTAACC TGGCGATGTT CGCGGCCGCT GTCCACTCCG AGGCCGCCAT CGAGGAGTAC GGCACCGACC TGTCCGAAAA CGCGGTCGGA ACGGGGCCGT TCGAGCTCAA CACCCTCGAC GACTCCAACG AGCAGATCCG ACTCGACGCG TACGACGACT ACTGGGGCGA CGGCCCGCAG GTCGACGAAG CCGTTTTCGT CACGGTCGGC GAGAACTCCA CCCGAGCGCA GTCGCTCGCG AGCGGAGAAC TCGACATTAT CGACGGGCTC GGCGCGCAGT CCTCCCAGCA GGTCGAAAGC GCCGACAGCG CCGAACTGGT CCGCACCGAG GGGATCAACA TCGGCTACAT GGCGTTCAAC ATCGCGGCGG TCGAGGAGTT CCAGGACCGC CGCGTCCGTC AGGCCGTCAG CCACGCGATC AACACCGAGG CGATCGTCAA CCAGATCTAC GCCGGCTTCG CGACGGAGGC CAGCCAGCCG CTGCCGCCGA ACGTGCTGGG CCACAACGAC GACATCGAGC CGTACCCGTA CGACCCCGAG CAGGCACAGA GCCTGCTGGA GGAAGCCGGC TACGGCGACG GGTTCTCCTT CGAACTGGCG ACGTTCCAGA ACCCCCGCGG ATACAACCCC TCGCCGCTCC AGACGGCCGA GACGGTCGCC TCCAACCTCG GCGAGGTCGG CATCGAGGTC GAGATCAACC AGCAGTCGTT CGCGCCGTTC CTTGAGTACA CGGCTCAGGG CCGCCACGAC GCCTGCTTCC TCGGCTGGTA CACCGACAAC GCGGACCCGG ACAACTTCGC GTACGTACTC TTACACCCGC AGGTTGAGGA GAGCGAACTC ACCGAGGGCC AGGACTGGGT GAGCTTCGAT ACCGAGGGGT ACAACACGAG TAACCGCTCG GCGTGGGCGA ACCAGGAATA CATGGACCTC GTCGAGGAAG GTCAGCAGAC GACCACAGAG AGCGACCGCG CGGAGCTCTA CAACGAGGCG ATGCAGATCG CCCACGACGA GGCGCCGTGG GTGTACCTGG ACTACGCCGA GGAGCTGCGG GGCGTCGCCA ACCGGGTCAA CGGGTTCCAG ATCGCCGCGA TCAGCGGCCC GTACCTGAAC CTGGTCTCGC TGGAGTAG
|
Protein sequence | MSSRDESVSR RKFLGAAGGA AVTVGLAGCS DNDGEDSDGS DGSDGSDGSN GSDGSDGSDG GDDTSLLRYG RGSHSATLDF QNSTSGEVAK VTEQIYDTLI NFEPGESTLT DGLASDYSLD GETASLTLKE GVTFHNGEEF TAQDFEATYR RFVDSEYEYY AGDDYVSAYG PFTLGNWIDE IQVDGDYEMT IQLTQTYAPF LRNLAMFAAA VHSEAAIEEY GTDLSENAVG TGPFELNTLD DSNEQIRLDA YDDYWGDGPQ VDEAVFVTVG ENSTRAQSLA SGELDIIDGL GAQSSQQVES ADSAELVRTE GINIGYMAFN IAAVEEFQDR RVRQAVSHAI NTEAIVNQIY AGFATEASQP LPPNVLGHND DIEPYPYDPE QAQSLLEEAG YGDGFSFELA TFQNPRGYNP SPLQTAETVA SNLGEVGIEV EINQQSFAPF LEYTAQGRHD ACFLGWYTDN ADPDNFAYVL LHPQVEESEL TEGQDWVSFD TEGYNTSNRS AWANQEYMDL VEEGQQTTTE SDRAELYNEA MQIAHDEAPW VYLDYAEELR GVANRVNGFQ IAAISGPYLN LVSLE
|
| |