Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_2520 |
Symbol | |
ID | 7401572 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 2498240 |
End bp | 2499598 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643709592 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002567163 |
Protein GI | 222480926 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.420378 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCACACG ACAGCCGATC CGACGACAAC AACCGCCTTA CGCGTCGCCA CTACGTGGCC GGCGCCGGTA CCCTCGCAAC CGTGGGGATC GCCGGCTGTT CCGGCGGTGG CGACGGCAGT GACGGTTCCG ACGGCAGTGA CGGCAGTGAC GGTTCCGACG GCTCCGACGG GAGCGACGGC GGCTCCGGTG AGCCTGTCGA GGTACTCCAC GGCTGGACCG GCGGCGACGG CGCCGCCGCG GCCGAGGCAT TGGAGGCAGC GTTCAACGAG GTTCACCCCG ATGTCGGCCT CGAGATGAAC CCCATCGGCG GCGGCGGCAA CCAGAACCTC GACGCAGTCG TCGCGAACCG ACTCCAGAGT GACGATCCGC CGGGATCGTT CGCCGGCTGG CCCGGTCCGA ACCTCCTCCG CTATCAGGGG GTCCTCGGCA GCGTCGACGA CGTTTGGGAG GAGAACGGCT TCGAGGACGC GATGGTCGAG GAGGCGGTCG AGCTCCACAA GCAGAACGGC AGCTACCGCG CCGTGCCGCT CGGCTCCCAC CGACTGAACT GCCTCTTCTA CAACGTCTCC GTCGTCGAGG ACGCGGGGAT CGATGTCGAC TCGCTGAATA GCCCCTCGGC ACTCATCGAC GCCTTCGAGA CGGTTTCCAG CGAGACCGAC GCGATTCCGA TGACCCACGG GATGTCCGGG ACGTGGACGA CGACGCAGCT GTGGGGCGCC GTGATGCTCG GCGTCAACGG CTACCAGCCG TATATGGACT TCCTCGAGGG TAACGGCGAC GAGAGCGCCG TCCGGTCGGC GTTCGAGACG ACCGCAGAGA TGCTGGAGAA CCATATCAGT GACGACGCGG CTTCGATCGG TCTGACCCAG TCGAACCAGA ACATCATCAA CGGCGACGCC GCGTTTATCC ACCAAGGTAA CTGGGCGGCG GGCGCGTTCC GGAACGCCGA GAACTTCGAG TACGGAGACG ACTGGGGCTT CAAGACGTTC CCCGGGACCG AGGGAATGTA CACCCTCCAC TTCGACTCGT TCCTCTACCC GTCGGACAAC CCGACGCCCG AGGCCTCGAA GACCTGGGAG GCGTTCGCCG GCAGCCCGGA GGCCCAGATC GCGTTCAACC AGTACAAGGG CTCGATCCCG ACTCGGACCG ACGTGAGCAT GGAGGAGTTC GGTCCGTACC TCCAGGAGAC GGCCGAGGAC TTCGCGAACG CGGAGTACCG ACCGCCGAAC CTTCAGCACG GCCTCGGCGT CCCCTCCGAG ACGATGACGG CGCTCAACGA CGTCATCTCT TCGGAGTTCA CGGGACCGTA CAACGTCGAC GCCGCGACGC AGGGCTTCCT GAACGCGGTG TCGAACTGA
|
Protein sequence | MSHDSRSDDN NRLTRRHYVA GAGTLATVGI AGCSGGGDGS DGSDGSDGSD GSDGSDGSDG GSGEPVEVLH GWTGGDGAAA AEALEAAFNE VHPDVGLEMN PIGGGGNQNL DAVVANRLQS DDPPGSFAGW PGPNLLRYQG VLGSVDDVWE ENGFEDAMVE EAVELHKQNG SYRAVPLGSH RLNCLFYNVS VVEDAGIDVD SLNSPSALID AFETVSSETD AIPMTHGMSG TWTTTQLWGA VMLGVNGYQP YMDFLEGNGD ESAVRSAFET TAEMLENHIS DDAASIGLTQ SNQNIINGDA AFIHQGNWAA GAFRNAENFE YGDDWGFKTF PGTEGMYTLH FDSFLYPSDN PTPEASKTWE AFAGSPEAQI AFNQYKGSIP TRTDVSMEEF GPYLQETAED FANAEYRPPN LQHGLGVPSE TMTALNDVIS SEFTGPYNVD AATQGFLNAV SN
|
| |