Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_0133 |
Symbol | |
ID | 7401654 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | - |
Start bp | 140749 |
End bp | 142743 |
Gene Length | 1995 bp |
Protein Length | 664 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643707197 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002564809 |
Protein GI | 222478572 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTCGGC AGATCAGTCG GCGCGGCGCG CTCGCGGGAG GGTTCGCGGC CCTGAGCGCC GGCTGTCTCG GCCGCACACG GAACATCGCG GGACGCGACC GGTCCTCGCA GCTAACTCTT GAGATCAACG CCCCGCCCGC CGACAGGGAC CCCAACGCGA TTCGGATCGC CAGACACCTC GCGGAGAACC TGAACGCAGT GGGCATCGAC GCCCGGATCA GCACCCTCGG GCACACCGAC CTCCGGCGGA AGGTGCTCAT CAACCACAAC TTCGACGTGT ACGTCGGGCA GTTCCGGGAG GCGGAGCCGT TCGACCCGGA CGCGATGTAC GCGTTCACTC ACTCCCAGTT CGTCGCGGAG TCGGGGTGGC AGAACCCGTT CGGATTCACC GACATCAACG GCGTCGACGA GTTGCTCGCG ACCCAGCGCC GAGCGGACGG CGACGAACGA CGCGAGGCCG TGGCGGAGCT CCAGCGAACC CTCGGTGAGC TGCAGCCGTT CACCGTCGTT GCGTTCCCCG ACCCGCTCAT CGCGGTCCGC GAGGACCGCT TCGAGAACTG GACGAACCAT CAGCCGCTGT CAGTCGGTGG GTTGCTCGGC TTGGAGCGCT CCGCCGCGGC GGACGGGGAC GCCGACGGGA GCGGGGCCAA GATCGAGGAC GGCGGGACGG AAGCCGGGAA CGGGACTGCC GACGGCAACG AGACCGCCGA CGGCGACGAG ATCGATGGTA ATGAGACCGC TGTCAACGAG ACCGACGACG GGCTGATCGA CGACAACCCG CTCACGGACG ACGACGATGG TGCGGACGAT GCCGCCACCC TCCGGCTCGT GACGACCGAC GAGCGCATCA CGCAGAACTG GAACCCGATC GCCGCCGAGT ACCGGCGCTA CGGCACGTTC ACCTCCCTGC TGTACGACCG ACTCGTGCTG GTCGACGACG GGGAGGTGAT CCCGTGGCTC GCCGCCGACT GGGAACAGGT CGGCGACGCG GGGGTCAAGA TCTCGCTGCG AGAGGCCGAC TGGCACGACG GGGAACCGGT GACGGCCGAA GACGTCGCGT TCACCTACGA GTTCCTCCAA GACACCTCGC TGGGAACGAC CGAGTCGCCC GTGCCGACAC CGACCTTCCG CGGTCGGGTG TCCGCCGTCG AGACGGCGAC CGCCATCGAC GAGACGACGG TCCGGCTGAC GCTCGACGGC GTCAACGACG CGGTCGGCAT GCGCGCGCTC CAGGTACCGA TCCTCCCGAA GCACGTTTGG GGGGAGCGGA CCGATATGGC GACGATCGCC GGATTCGAGT TCGACGCGGA GACGACCGAG GCCGTGGTGA CGAACAACGA GAATCCGATC GGGAGCGGTC CGGTGCGCTT CGTCGAGGCC ACTCCCGAAG AGTCGGTCGT CTTCGAGCGC AACCCGGACC ACTTTCTCGT GCGTGCGGCA GACGGGGGGG AATCGGCCGG TGACGCGACG GACCCGCTCA CGGAGATCTC CGAGCGATTT CACGGGAAGC CGGCGTTCGG CCGCCTTGAG ATCGAGGTAA TGGGGTCAGA CATCGCCGCG GTGCAGGCGG TCGGAGACGG CTTCGCGGAC GCGACAGTCT CGAACCTCGG CCCGGACTCT GTCCCGCGGA TCGGGCGCGA AGCCGACGCT CGGCTCGTGA CAGGGCGATC CGGCGGGTTC TACCACATCG GGTACAACAC TCGCCGGGCA CCGCTGTCGA ACCCCCGATT CCGCAGGGTC CTCGCGTCGC TGATCGACAA GCAGACCCTC GTCGACGTTG CGTTCGACGG GTACGCCGAA CCCGCGGCTT CACCGCTCGC CGCCACCCCG GAGTGGGTGC CCTCGGACCT CCGCTGGGAG GACCGCGAGA CGGACCCAGT CCATCCGTTT GTCGGCGAGT CGGGATCGCT CGACTCCGAG ACGGCCCGTA ATCGACTCCG CGAGGTGGGG TACCGGTTCG ACGAGGAGGG ACGGCTGCTC GCACCGAACA CATGA
|
Protein sequence | MTRQISRRGA LAGGFAALSA GCLGRTRNIA GRDRSSQLTL EINAPPADRD PNAIRIARHL AENLNAVGID ARISTLGHTD LRRKVLINHN FDVYVGQFRE AEPFDPDAMY AFTHSQFVAE SGWQNPFGFT DINGVDELLA TQRRADGDER REAVAELQRT LGELQPFTVV AFPDPLIAVR EDRFENWTNH QPLSVGGLLG LERSAAADGD ADGSGAKIED GGTEAGNGTA DGNETADGDE IDGNETAVNE TDDGLIDDNP LTDDDDGADD AATLRLVTTD ERITQNWNPI AAEYRRYGTF TSLLYDRLVL VDDGEVIPWL AADWEQVGDA GVKISLREAD WHDGEPVTAE DVAFTYEFLQ DTSLGTTESP VPTPTFRGRV SAVETATAID ETTVRLTLDG VNDAVGMRAL QVPILPKHVW GERTDMATIA GFEFDAETTE AVVTNNENPI GSGPVRFVEA TPEESVVFER NPDHFLVRAA DGGESAGDAT DPLTEISERF HGKPAFGRLE IEVMGSDIAA VQAVGDGFAD ATVSNLGPDS VPRIGREADA RLVTGRSGGF YHIGYNTRRA PLSNPRFRRV LASLIDKQTL VDVAFDGYAE PAASPLAATP EWVPSDLRWE DRETDPVHPF VGESGSLDSE TARNRLREVG YRFDEEGRLL APNT
|
| |