Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_0069 |
Symbol | |
ID | 7401424 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | - |
Start bp | 72570 |
End bp | 74471 |
Gene Length | 1902 bp |
Protein Length | 633 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643707130 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002564745 |
Protein GI | 222478508 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.884442 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGATA AAGACGAACT CAGGCGGCGT CAGTTCCTCC GAGTGACCGG AGCGTCGGCA CTCACCGCCG GCATCGCAGG TTGTTCCGGA GACAGCGGCG GCGAACCCAG CGATGGGGGC GACGGCTCCG ACGGCGAGGA CGGATCCGAC AGCGAAGACG GATCCGACGG CGAGGACGGA TCCGACGGCG AGGACGGATC CGACCAGTTG ATGCCGTCGT CGTACCCGTA CGGGGCTAAC GAAAATCGGA TCAGTGAGGC TCGAGCGGTT ATGGAAGAGG CCGGATACGG CCCTGACAAC CGATTCAGCC TCGATTGGCT CCAGTACAAC TCCCCGGCGT GGGAGGAGAT TGCGAATACG CTTCGTGCCC GCCTCGAGTC GGTCCACGTC GATATGAACA TCAGCAAGGC CGACTTCGGC GCCCTCCTCG AGCGGACCGA AAAGGGCGAG ATGGACGCGT TCACGCTCGG CTGGATCGCG GATTACCCGG GTGTCAGGAA CTTCGCTCAG CTGGTCGATC CGGATAACAC GATCTACGAC GCCGAAGGGG CTTCGCCGAA CGGTGCGCGG CTGTTCTGGA GCGAAGATTC CTACACCGAC CCCGAAGTCC GCAGCGCGAT GTCAGAGGCG TTCGCGCAGC TTTCGGAGAA TCCGGGCAAC AAGGACGAGG CGGAGAGCGC ACGCGCGGAG GCGACGCTTC GGTTGGAGAA GCTCCTCTGG GAGTCCGCGG CGTTGCTCCC GGTGTATCAC AGCGTCGAAG ACGTGTTCTG GTACGACCGG GTCGACTACA ATCCGCCGGG CGGGATGGGA GTGTCCCGGG CGAAGACCAG CACGTCCGTC CAGGGACTCG AAGGCAGCGA CACGCTGAAG GGCACGTCCG CGACCTTCAA CGCACTCGAC CCGATCGCGT CGGGGAACAC GGCGAGTGGT TCGAAGGTCA TGGACATATT CGACGCACCG CTCAACTACG TCAACGGAAC GGTCGAGGTC GAGCCGCTTC TGATCGAGGA CTACACCACG AACGACGACC TCACCGAGTA CGAGTTCACG CTCAAGCAGG GCGTCCAGTT CCACGGGGAT TACGGCGAAC TGACGGCGGA CGATATGGTC TACTCGATCC GGCGTCTGGT CGAATCCTCG AACTCGACGA ACACGTACTT CCCGATCAGC GTCCTGAACA TCGACCGCGA GGAGGACGAG GATGGCAACG TCGTGCCCGG ATCGGTTGCC GTCGAGGCGA CCGGCGACTA CACCTTCAGC GTCACGCTCC GCAATTCGTT CGGCTACGCG CTCGAAGTGC TCAGTTACTC GGCGTTCTCG GCCGTCCCCG AGGGTATCGT GGGCGACGTC GAGGGATACG ACGGCGACAT GGATTACCAG CAGTTCTCGA CGAACCCGGT CGGCTGTGGT CCGTACGTCT TCGAGGAGTG GAACTCCGGT GTCGGCGGAG AGTTCCGAGC CTCGGCGTTC ACGGACTACC ACGGCGGAGA GCCGGCCGCC GCGAACATCC AGGACGCGAT CCTCAGCGAG CCGAACGCGA TATACAACCG GTTCCTCAAC GAGAACGCGG ACGTCAGTGC GATCCCGACC TCGCAGTTCG ATCCCGGACT GAGCGACCTG ACGAGTCAGG ACGGCGCCCA ACAGACCGGA ACGTACGGTC CGCTCGGGAA CGACCAGACG GTCAACATGT CGCGGACGCC GACGATCGAC ACGTTCTACA TCGCGTTCAA CATGGAGAAC GTCCCCAAGC CGGTCCGACA GGCGATGGCG TACGTGATGA CGGGCGACGA CTTCACCGAG AGCGTCTTCA AGGGTCGTGG CGAGTCCGCG TACCACCTCA CGCCGCCACA GATCTTCCCC GGCGGCGGTG AGGGGTACGC CGACCACTGG CAGGGCGAAT AA
|
Protein sequence | MSDKDELRRR QFLRVTGASA LTAGIAGCSG DSGGEPSDGG DGSDGEDGSD SEDGSDGEDG SDGEDGSDQL MPSSYPYGAN ENRISEARAV MEEAGYGPDN RFSLDWLQYN SPAWEEIANT LRARLESVHV DMNISKADFG ALLERTEKGE MDAFTLGWIA DYPGVRNFAQ LVDPDNTIYD AEGASPNGAR LFWSEDSYTD PEVRSAMSEA FAQLSENPGN KDEAESARAE ATLRLEKLLW ESAALLPVYH SVEDVFWYDR VDYNPPGGMG VSRAKTSTSV QGLEGSDTLK GTSATFNALD PIASGNTASG SKVMDIFDAP LNYVNGTVEV EPLLIEDYTT NDDLTEYEFT LKQGVQFHGD YGELTADDMV YSIRRLVESS NSTNTYFPIS VLNIDREEDE DGNVVPGSVA VEATGDYTFS VTLRNSFGYA LEVLSYSAFS AVPEGIVGDV EGYDGDMDYQ QFSTNPVGCG PYVFEEWNSG VGGEFRASAF TDYHGGEPAA ANIQDAILSE PNAIYNRFLN ENADVSAIPT SQFDPGLSDL TSQDGAQQTG TYGPLGNDQT VNMSRTPTID TFYIAFNMEN VPKPVRQAMA YVMTGDDFTE SVFKGRGESA YHLTPPQIFP GGGEGYADHW QGE
|
| |