Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_3666 |
Symbol | |
ID | 7402457 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012030 |
Strand | + |
Start bp | 429644 |
End bp | 431335 |
Gene Length | 1692 bp |
Protein Length | 563 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643710197 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002567763 |
Protein GI | 222481527 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGACA CCACCGAGCC AACCGAAGAG GTAGCAGGCG AGGTTTCGGT CACGGACCTC CTCACGAACA AATCCCGGCG TCGGTTCCTC TCGGGTCTCG GTGCTGTCGG GGCGGCCGGT CTCGCCGGTT GTACTGGTTC AGGAGGCTCT GAAGACACGA CCACTAGCTC CGACGGCACA ACCAGCAGCT CCGACTCAAC GCTCACTGCA AACGTCTCAC AGCGGATCGG CACAATCGAC CCCGCGAAAG GGACCGATTA CGTGCAAGCG ATGGTGCTTG TGAACCTCTA CGATCCGCTC GTGTTTCCCA ACAGCGAGGG TGAAATTCAA CCGAATCTCG CCTCCGATTG GACCGTTTCG GAGGACAGCA CGACCTACAC GTTCACCCTG CGTGAGGATG TGACGTTCCA CAGCGGAAAC TCGTTTACTG CCGAGGACGT AAAATTCTCC ACGGAGCGTT TTATCGATCT CGACCAGGGG TACGCCTCGC TGTTGAGCGG CGTCCTCGAC AAAGAAAACA TTACTGTCGA AGACGAGCAG ACGGTCACCT TCGAGTTGAA CCGGTCGTAC GCGCCGTTCT TGCCCATCAT GGTTCTCGTG TTCATGGTCG ACAAGGCGAC GATCATGGAC AATTTAGAAG ACGGCGAGTA CGGCGACCGT GGTGACTACG GGCAAGCGTA CATCAACAAC AACGACGCCG GATCCGGTGC ATACCAACTC GAAGACTTCT CGCGGGGGAA TTCCATCACC TTCGCCGCCT TCGACGACTA CTTTGGCGAG TTCCCCGACG GCTCCTTCGA TACGGTCGAA GTCCAGATCA TTACTGAAAA TTCGACGGTT CGAACGCTGA TGCGAAACGG CGATCTGGAT ATGAGTGGTC AGTACCAGAA CTCCCAGACG TACCAAGCCA TCGACGAATC GGACAACGCA CGTGTTGAAG AAATACCGAC GTTCGGCCTA CTGTACAACA AAATCAACAC CCAGAAAGCT CCGACGGATG ACCGCGCCGT GCGCGAGGCG ATCGCGTGGG GATTCGACTA CGAGCAGGTT GTCAATACGA TTCGGCCGAA AATGAATCGC GCACAGGGAC CGCTGCCGCC GACGTGGGGC GAACACGACG AAGACGTTTT GCAACCGTCC TACGACCCGG ATCGAGCGAG ACGGGTCCTC GAAGACGCTG GCTACTCGGA GGGCGAACTC ACCATCACGA ACACGTACAC CGAGTCATAC GCTTTCCAAG AGCAGATTGC CCTGCTCTTC CAGGATAACA TGGCAGATAT CGGGATTAAC GTTGAGCTGA ATCCTCAGAC GTGGGGGACG ATCACAGAGT TGGCTACGTC ACCGGAAGAC ACCCCACATA CGAGCCAAGT GTTCTACGTT CCGACCTATC CCTCGCCGGA TTCGATGTTC TACAACCAGT TCCACTCGGA GGCCGCAAAC ACGTGGATGA GCATGGAACA TCTCGACAAC GACGAGGTCG ACGCCCTCAT CGATGAGGCA CGCCAGACGC CGGACCCGGA AGCCCGTGCC GAGATCTACC GGGAACTGCA GAATACCCTA GCTGATCTCT ACTGTGATAT GCACCTCTAC CACACGGTCA AAACAATCGG CTTCCAGAAC GACGTTGAGG GGCTCACCCT CCGGCCAGCA CAAGGCTTCG AATACACGTT CCGGGATCTC CACCAAGTCT AA
|
Protein sequence | MNDTTEPTEE VAGEVSVTDL LTNKSRRRFL SGLGAVGAAG LAGCTGSGGS EDTTTSSDGT TSSSDSTLTA NVSQRIGTID PAKGTDYVQA MVLVNLYDPL VFPNSEGEIQ PNLASDWTVS EDSTTYTFTL REDVTFHSGN SFTAEDVKFS TERFIDLDQG YASLLSGVLD KENITVEDEQ TVTFELNRSY APFLPIMVLV FMVDKATIMD NLEDGEYGDR GDYGQAYINN NDAGSGAYQL EDFSRGNSIT FAAFDDYFGE FPDGSFDTVE VQIITENSTV RTLMRNGDLD MSGQYQNSQT YQAIDESDNA RVEEIPTFGL LYNKINTQKA PTDDRAVREA IAWGFDYEQV VNTIRPKMNR AQGPLPPTWG EHDEDVLQPS YDPDRARRVL EDAGYSEGEL TITNTYTESY AFQEQIALLF QDNMADIGIN VELNPQTWGT ITELATSPED TPHTSQVFYV PTYPSPDSMF YNQFHSEAAN TWMSMEHLDN DEVDALIDEA RQTPDPEARA EIYRELQNTL ADLYCDMHLY HTVKTIGFQN DVEGLTLRPA QGFEYTFRDL HQV
|
| |