Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_1899 |
Symbol | |
ID | 7399770 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 1900541 |
End bp | 1901422 |
Gene Length | 882 bp |
Protein Length | 293 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643708970 |
Product | carotene biosynthesis associated membrane protein |
Protein accession | YP_002566547 |
Protein GI | 222480310 |
COG category | [S] Function unknown |
COG ID | [COG2324] Predicted membrane protein |
TIGRFAM ID | [TIGR03460] carotene biosynthesis associated membrane protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCGACC CCGCCGAATC CGCCGCAACG TCGGGACGAC TCGGCGCGCT CCGCGCCCGA CTTCCCGACA CGCGCCGCGA GGCGGAGCGT CTGCTCGACA GAATCGTCCG CGAGAACCGG TTTACGATTG CCGTGCTGTT CCCCCTCGTC GGTGCGATCG CACTCGTCGG CAGCGCGGAG GGGTGGGTGC CCGAGCCGCT CGCGTTTAAC CCGTGGTTCG TGCTGTTCGG TGTGCTGGTG ATGCGGTCGC CGCTCGTCGT CGGGGTGTTG CCCGCGCTCG ACCGCCGCGC GGTCGGGTGG CTCGGGGTGC TCGTCGCGTA CACCTACGCG ATCGAGCTGT TCGGCGTCGC CACCGGGTGG CCCTACGGCA CCTTCGAGTA CACCGTGAGC CTCGGGCCGA TGCTCGGCGG GGTGCCGCTG GCGCTCCCCG TCTTTTTCAT TCCGCTCGTG GTGAACGCCT ACCTGCTCTG TCTGCTGCTT CTCGGCCCGC GGGCGTCGAA CGGGTGGCTC CGGCTCGCGA CCGTGATCGC CGCGGTGGTC GCGATGGATG TGGTGCTCGA CCCCGGCGCC GTCGCGCTCG GCTTTTGGAG CTTCGGCGGC GGCGCCTTCT TCGGTGTTCC CCTCTCGAAC TACGCCGGCT GGGTGTTGTC GGCGACGGTC GCGGTGGTCA CGCTCGACCG CGCGTTCGAA CTTGGGGCGC TCGGCGATCG ACTGCGCGAC TGCGAATTCA TGCTCGACGA CATGGTGAGC TTCGTGATCC TCTGGGGCGG AATCAACCTC TGGTACGGGA ATCTACTCCC CGCCGCCGTC GCGGTCGCAT TCGGCGTCGG GCTCGTCCGC GCCGACCGAT TCGATGCGAC CCTGTTCACG CAGTGGCGGT GA
|
Protein sequence | MSDPAESAAT SGRLGALRAR LPDTRREAER LLDRIVRENR FTIAVLFPLV GAIALVGSAE GWVPEPLAFN PWFVLFGVLV MRSPLVVGVL PALDRRAVGW LGVLVAYTYA IELFGVATGW PYGTFEYTVS LGPMLGGVPL ALPVFFIPLV VNAYLLCLLL LGPRASNGWL RLATVIAAVV AMDVVLDPGA VALGFWSFGG GAFFGVPLSN YAGWVLSATV AVVTLDRAFE LGALGDRLRD CEFMLDDMVS FVILWGGINL WYGNLLPAAV AVAFGVGLVR ADRFDATLFT QWR
|
| |