Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_1225 |
Symbol | |
ID | 7399493 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 1235253 |
End bp | 1236173 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643708290 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002565888 |
Protein GI | 222479651 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0725] ABC-type molybdate transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00000686455 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAATCGC GTTCCCGACG GTCGGTCCTC GCGGCGCTCT CTGCCGGGAG TGGACTCGGG ATCGCCGGCT GTCTCGGCGA CGACGAATCC GTCTCGGTCC TCGCCGCCGG GAGCCTCGCG GTCGTCCTCG ACGATCACGT CGGGGGGCAG TTCGAAGCCG AGACGGGGAT CGCCTGCCAC GCGGAATACT ACGGCACAAA CGCGGTGATG CGGATGGTAA GCGACGGCCG GAAGTACCCC GATGTTGTCG TGAGCGCGGA CGCCGGCCTC TTGCGCGACC GGCTCTACGA CACGCATACC ACGTGGGATG TCTCGATCGC GTCCAACGCC GTTGGGATCG CCTACGCCTC CGACACGCGG TTCGGGGAAC GCCTCGAGGC GGGCGATCCG TGGTACGAAG TCGCCCGTGA TGCCGACCCC GGTGCCCTCG CGATAAGCGA CCCCGACCTC GACCCCCTCG GATACCGTGC CATCCACGCG TTTCGACTGG CAGAGCGGGA ACACGGACTC GACGGCTTCG CCGAGGCCGT CACCGACGCC GCCTATCGGG AACCGCAGGA GCCGCAGTTG CTCGCCGGCG TCGAAACGGG AAACCGCGCC GCCGCGGTCG TCTACCGGAA CATGGCCGCG GACCACGGGC TTCCGTTTCA CCCGTTCCCG GAGGCGTACG ATTTCTCGAA CCCGGAGTAC GCCGATCGCT ACGCCGAGGC CTCGTACACG ACGGACGGGG GGTACACGGC GACCGGTGCG CCGATCGTGT ACAACGCGAC GGCCCTCGAG AGCGCAGATT CACCGGACGC CGGACGCAAG TTCGTCCGGT TCCTCGCGAA CGCCAGCGAT CTCCTCCGCG AGAACGGGTT CGAGACGGCG GGATTCCCGA GGACTCACGG CGACGTTCCC GCCGAGGTGA CGGACGGATG A
|
Protein sequence | MQSRSRRSVL AALSAGSGLG IAGCLGDDES VSVLAAGSLA VVLDDHVGGQ FEAETGIACH AEYYGTNAVM RMVSDGRKYP DVVVSADAGL LRDRLYDTHT TWDVSIASNA VGIAYASDTR FGERLEAGDP WYEVARDADP GALAISDPDL DPLGYRAIHA FRLAEREHGL DGFAEAVTDA AYREPQEPQL LAGVETGNRA AAVVYRNMAA DHGLPFHPFP EAYDFSNPEY ADRYAEASYT TDGGYTATGA PIVYNATALE SADSPDAGRK FVRFLANASD LLRENGFETA GFPRTHGDVP AEVTDG
|
| |