Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_1481 |
Symbol | |
ID | 7400309 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 1490434 |
End bp | 1491951 |
Gene Length | 1518 bp |
Protein Length | 505 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643708543 |
Product | protein of unknown function DUF790 |
Protein accession | YP_002566139 |
Protein GI | 222479902 |
COG category | [S] Function unknown |
COG ID | [COG3372] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTACGCA AAGACCTCCT GCGCGTCTCG CGGGCCGGCG GCGGGTACCG GCCGCGCTTT GCGACCCGCG AGCACCGTCC GCTCGCGGCG CGGGTGCTCG GGACCTTCGA GTCCCACGTC GGGGAGCGGC GCGGTGACCT CGACGACGCG CTGGCGGCGC TGGAGGCCGA CGCCGCCGAG TCCGGCGGCG ACTTCAAGCT CGTCCGCGGG CTCGCGGCCT TGATCGAGCG CGAGTGCGTC TTCGAGACGC GCGCGCCGGT CCCGCCCCAG CGCGTTCGGC GCGCGGCGTT CGAGGCCGCG GAGGCCGTCG GCGTCGCGAG CGAGACCGAG CGCGAAACCG CGATCGACCG CGCCGCCGAC GCGCTCGGGA TCGAGCCCGG GGACGTGGAG GCGTCGCTGT ACGCCGACCG CGACGTCAAC GAGATCCTCG TCGACGCCGA CGTGCGGTGG GACCCGGACT CCCTCCTCGA ACAGTACGAC CTCTCGCTCG CACAGACCGC GCTGTTCGAC GCCACCGAGG TCCGGATCCG ATCGAACGAC CCGAAACGGC TCGTCTCGGC GGTCAAGCGG CTCCGGCTGA TGTACGAGGT GGAGACAACC CCCGAGGGCC GAGAACTGGT CGTCACCGGG CCGGACGCGC TGTTCTCGCG CACCCGGCGC TACGGGACCG CGTTCGCGCG ACTGCTCCGG ACCGTCGCGA AGTCCGCGGA GTGGGAGCTG TCGGCGACGA TCGACGACCG CGGCCGCGAG CGCACCATGC GCCTCTCCGA CGGCGACGTG ACGGTTCCGG GCGTCGAACC GGTCGCGGAG CCGGACTTCG ACAGCGGCGT CGAGGCCGAC TTCGCCGGGC GGTTTCGCGG GCTCGACCTC GACTGGACCT TGGTGCGCGA GCCGGAAACC CTCGAAACCG GAAACCGGGT GATGATCCCC GACTTCGCGT TCGACTACGA CCACGCCGAC TTCCGGCTGT TCTTCGAGGT GATGGGGTTT TGGACCCCCG AGTACGTCGC GAAGAAACTC GGCCAACTCG CTGACGTGGA AGATGTGGAT CTGCTGGTCG CCGTCGACGA GAGCCTCGGC GTCGGTGAGG AGATCGCCGC GCGGGACCAC CGCGTGGTGA GCTACTCGGG GACCGTCCGC GTGAAAGCGA TCGTCGACGT GCTCCGCGAG TACGAGGCCG AGTTCGTCGC CGATGCGGCC GCCGACCTCC CCGAGTCGCT CTCGCCGGAC GCGGACGCGA TCGGGCTCGA CGAACTCGCG GACGAGCACG GCGTGAGCGT CGAGGCGATC GAGGACAAGT CGTTTCCGGA CCACGAACTG GTGGGGCGGA CCCTCGTGCG GCCTGCGGTA CTGGAGGCGG CGGGCGAGAA GCTTGAAGCC GGAATGGCGT TCTCGGAGGT TGAGTCAATA CTCGACGACT ATGCGATCGA CGACGCGAGC GCCGCCCTCT CGCGGCTCGG GTTCCGGGTC GAGTGGGAGG GGCTCGGCGG CGGGACGGTC CGAGAAAAAG GCGAGTAG
|
Protein sequence | MLRKDLLRVS RAGGGYRPRF ATREHRPLAA RVLGTFESHV GERRGDLDDA LAALEADAAE SGGDFKLVRG LAALIERECV FETRAPVPPQ RVRRAAFEAA EAVGVASETE RETAIDRAAD ALGIEPGDVE ASLYADRDVN EILVDADVRW DPDSLLEQYD LSLAQTALFD ATEVRIRSND PKRLVSAVKR LRLMYEVETT PEGRELVVTG PDALFSRTRR YGTAFARLLR TVAKSAEWEL SATIDDRGRE RTMRLSDGDV TVPGVEPVAE PDFDSGVEAD FAGRFRGLDL DWTLVREPET LETGNRVMIP DFAFDYDHAD FRLFFEVMGF WTPEYVAKKL GQLADVEDVD LLVAVDESLG VGEEIAARDH RVVSYSGTVR VKAIVDVLRE YEAEFVADAA ADLPESLSPD ADAIGLDELA DEHGVSVEAI EDKSFPDHEL VGRTLVRPAV LEAAGEKLEA GMAFSEVESI LDDYAIDDAS AALSRLGFRV EWEGLGGGTV REKGE
|
| |