Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_0632 |
Symbol | |
ID | 7401767 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | - |
Start bp | 651759 |
End bp | 653063 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643707698 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002565304 |
Protein GI | 222479067 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2182] Maltose-binding periplasmic proteins/domains |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.708501 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGAGA AACGCAGAAC GCTTCTGAAG ACGATGGGGG GATCGACCGC GCTCGCTGCG CTCGCGGGTT GTATCAGCAC CGGCGGCGAC GGCGGCGACG GCGGCGACGG CTCCGACGGG TCAGACGGTT CCAATGGTTC GGACGGTTCC GACGGCTCCG ACGGTTCCGA CAGCGGAACG ACGGGCACCA CGACGCTGTG GGCCGACCTC TCCGCCGCCG AGGACGAGGC CATGTCCGGC TACATCGACG AGTACGAGTC GGATTCGGGC GACACCATCA ACAAGGAGGC GCCCGGCGGA GAACTCGACC AGCAGCTCGA GACGGCGATT CCGGCCGGCG ACGGCCCCGA ATCGTGGATC TGGGCGCACG ACTGGGTCGG CCGGTTTGCA GTCCGCGAGG AACCGCCGTT CCTGTACGAC GCGAGTGACG ATGTCGACGT CTCGCTCGAC AGCTACACCG AGACCGCCCG GCAGGCCGCC CAGTTCGACG GCGCCCTCCA CGGGCTCCCG TTCGCCTCCG AGACCGTCGC GCTGTTCTAC AACGAGGACA TGGTCGACGA GCCGCCGGAG ACGATGGAAG AGATGGTCTC GATCATGGAC GACCACCACG ACCCGGCCAA CGGGCAGTAC GGGCTCTCGT ACCCCGTGAC GGACCCCTAC TTCGTCAGCG GGTTCATCCA GGCGTACGGC GGGGACATCT TCGATGAGGA GAACCTCAAG GTGACCGTCG ACAGCGACGC GTGTAAGCAG GGCATAGACG CCCTCGAGAC GCTGTCCGAC TACGTTCCGT CCGACCCCGG CTACGAGTCG CAGATCGTCG CGTTCGCGGA CGGGCTCGCG CCGTTCGCGA TCAACGGCCC GTGGGAACTC GGCAACCTTC AGGACGAAAT CGACAACCTC GGCGTCACGA CGCTGCCGAC CGTCGACGGG AACAACCCGC GTACGTACTC CGGGATCCAG CTGTTCTACT TCAGCTCGAT GCTGGCGGAC GCCGACCAAT CGACGGTCGA CGCCACGACC GGGCTCGCCG AGTGGTACAC GACCAACGAG GACATCGTCC TGAGCAACGC CGACGAACAG GGGTATATCC CCGTCCTCAC GAACGTCGTC GACAACGACG ACCTCTCCAG CGAGGTTCAG GCGTTCGCCC AACAGGTCGA TCACGGTGTC CCCATCCCGA CACACCCCGA CATGGACAGC GTCTGGACGC CCGTAACGGA CGCGTTAGAG CGCGTCTTCA ACGACGAGCA GGACAGCGAC GCGGCGCTCG ACCAGGCCGC CTCCGAGATC CGGGAGGCGC TGTAG
|
Protein sequence | MNEKRRTLLK TMGGSTALAA LAGCISTGGD GGDGGDGSDG SDGSNGSDGS DGSDGSDSGT TGTTTLWADL SAAEDEAMSG YIDEYESDSG DTINKEAPGG ELDQQLETAI PAGDGPESWI WAHDWVGRFA VREEPPFLYD ASDDVDVSLD SYTETARQAA QFDGALHGLP FASETVALFY NEDMVDEPPE TMEEMVSIMD DHHDPANGQY GLSYPVTDPY FVSGFIQAYG GDIFDEENLK VTVDSDACKQ GIDALETLSD YVPSDPGYES QIVAFADGLA PFAINGPWEL GNLQDEIDNL GVTTLPTVDG NNPRTYSGIQ LFYFSSMLAD ADQSTVDATT GLAEWYTTNE DIVLSNADEQ GYIPVLTNVV DNDDLSSEVQ AFAQQVDHGV PIPTHPDMDS VWTPVTDALE RVFNDEQDSD AALDQAASEI REAL
|
| |