Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_0252 |
Symbol | |
ID | 7401178 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | - |
Start bp | 272588 |
End bp | 274228 |
Gene Length | 1641 bp |
Protein Length | 546 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643707315 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002564927 |
Protein GI | 222478690 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCACGCG ATCACTCGAA CGTGAACCGC CGGACGTACC TCACGTACGT CGGAGGAACT GCGGCCACCG TCGGGTTGGC CGGCTGTTCG GACAACGGCG GCTCGGGCGA GGACAACGAA AACGGCAGTG ACGGCAGCGA CGGCAGCGAC GGCAGCGATG GCGGGGACGA AGAGCAGCTT CCGGAGCCGG AGACCCGCGA AGAGCACCTC CAGCGCGCGA ACCTCCGGCT CAACCAGCGC GCCCCGTGGA TCTTCCTCAA CCGCCAGTAC AGCGTGTACG GCATCGCGAG CCGACTCGGC TGGGACGCTC GCCGCGACGA GCGGATCGAA GCGCAAGCCA TCTCGGTGAC GGAGGGCGAG CCGTCGGTCG CGATCACCCA GTCGTCGATG GATTCGGGGC TCGACCCCCA CGACCACCGT GAGACGCCGA CCGACAACAT CGTCGTGCAG GCCTACGACG GCGTGCTCGG TCGCAACGCC GACGGCGACA TCATCGACGC GCTCGCGACG GACTACGAGC GGCTCGAAGA CGGTCGCGTC CGATTCGAGA TCCGCGACGG CGTCACCTTC CACAACGGCG ACGAGCTCCA GCCGTCGGAC ATCGCCTACA GCGTCAACCG GGTCGTCGAC CCCGAGGTCG GAATCTCGAG CCCGCAAAAC GACCAGCTCG CCGGCGTCAC GGGCGCCGAG GTCGTCGACG GCGGGGTCGA GGTGACCTCC GACGGGATCA ACCCGATCGT GTTCTCGCTT TTCGCCTCGT ACTGTAAGGT CGTTCAGCAG GACTGGATCG AGTCGCGCGA CACCTCGGCG ATAAACTCCG ACATGAACGG GACCGGCCCG TTCCAAGTCG TCGAGTACGA GCAGGACGTC GAGATCGTCT ACGAGCCCTA CGAGGGGTAC TGGGGCGACG CGCCGGAGAT CGAAGAGCTG ACGATCCGGT CGGCCAGCGA GGCGAGCACG CGCGTCTCGC AGCTGCTGGC CGGCGAGACG GACCTCATCG TCAACGTGCC GCCGCAGGAA GTGAGCCGCG TCCGCGACGA GGACACCACG GAAGTCACAG CGGTGCCGAG CACTCGTGTC GTGTTCAACG CCATGCGGTA CGACGTAGAG CCGTTCTCCA GCGTGGAGTT CCGGCAGGCG ATGAACTACG CCATCGACTT AGACAGCATC ATCGAGAACA TCCTGCAAGG GTTCGCCGAC GCGACCGGCC AGCCGACCCT CGAAGGGTTC GTCGGCTACA ACGAGGAGAT CGATCCGTAC CCGCAAGACA TCGAGCAAGC CGAACAGCTC GTCGAGGACT CCGGTCACGC CGGCGCCGAG ATCACCCTCG AAACGCCCGT GGGACGGTAC CTCCGCGACG TGGAGATCGC ACAGGCGGTC GCTAGCCAGA TCGACGAGCT CTCGAACGTC TCCTGTGAGG TCGAGCAGCG CGACTTCGCC TCGCTCGCGG GCGAGGTGAC GAGTGGTGAT ATCGAAAACA TGCCCCACTT CTACCTGCTC GGCTGGGGGA ACACGACGTT CGACGCCAGT CAGACGATCA TCCCGCTCCT CACGTCCGAC GGGGCGCTCT CCAGCTATCA GGGCGACGAC GAGGTCGACG AGCTCATGTC CGAGTCCCAG AACCTGCCGG GCGGAAACTA A
|
Protein sequence | MSRDHSNVNR RTYLTYVGGT AATVGLAGCS DNGGSGEDNE NGSDGSDGSD GSDGGDEEQL PEPETREEHL QRANLRLNQR APWIFLNRQY SVYGIASRLG WDARRDERIE AQAISVTEGE PSVAITQSSM DSGLDPHDHR ETPTDNIVVQ AYDGVLGRNA DGDIIDALAT DYERLEDGRV RFEIRDGVTF HNGDELQPSD IAYSVNRVVD PEVGISSPQN DQLAGVTGAE VVDGGVEVTS DGINPIVFSL FASYCKVVQQ DWIESRDTSA INSDMNGTGP FQVVEYEQDV EIVYEPYEGY WGDAPEIEEL TIRSASEAST RVSQLLAGET DLIVNVPPQE VSRVRDEDTT EVTAVPSTRV VFNAMRYDVE PFSSVEFRQA MNYAIDLDSI IENILQGFAD ATGQPTLEGF VGYNEEIDPY PQDIEQAEQL VEDSGHAGAE ITLETPVGRY LRDVEIAQAV ASQIDELSNV SCEVEQRDFA SLAGEVTSGD IENMPHFYLL GWGNTTFDAS QTIIPLLTSD GALSSYQGDD EVDELMSESQ NLPGGN
|
| |