Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_1538 |
Symbol | |
ID | 7401468 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 1560335 |
End bp | 1561555 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643708604 |
Product | protein of unknown function DUF405 |
Protein accession | YP_002566196 |
Protein GI | 222479959 |
COG category | [S] Function unknown |
COG ID | [COG2311] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCCGCG ACGCCGGCCC GACGCCGCCC TCCGAACGCA TCGTCGCCCT CGACGCCCTC CGCGGGGTCG CCCTGCTCGG CATCCTCCTG ATCAACGTCT GGGCGTTCGC GATGCCCGAG ACGACGCTGT TTAATCCGAC GGTGTACGCC GACACCACCG TTTACGGCGA CTTCACGGGG GCGAACTACT GGGCGTGGGC GTTCAGCCAC GTGTTCGCAC AGAACAAGTT CATCACGCTG TTCTCGGCGC TGTTCGGTGC GGGAATCCTA CTGTTCATCG AGAGCAAAGA GGAGAAGGGG CAAGACGCGG TGCGGCTCCA CTATCGCCGG ACCGCGATTC TCATCGCGAT CGGGCTCATG CACGCGTATC TGCTGTGGTA CGGCGACATC CTCGTCGCGT ACGGGGTGAC CGCGCTGGTC GTCGTCGCGT TCCGGAATCT CGAAGCCCGG AAGCTCGCTG GGGTCGGCGT GGTCTTCCTG CTGTTCCTCC CCGTGGTCGA ACTGTTCGCC GCGATCACTC TCGGCGGCGA CGCGATCGCA TCGCAGTGGG CACCGGCGGA AGCCGCGATC GAACAGCAGG TCGCGACGTA CCGCGGCGGC TGGCTCGAAC AGCTCGACCA CCGGGTCCCA TCCTCGTTCA GCCGACAGAC GACCGGCTAC ATCAACGGGC CATTCTGGCA GGTCGGTGGC ACCATGCTCC TCGGGATGGC GCTGTACCGC TGGGGCGTGT TGACCGGCGA GCGGTCGTCG GCCCTGTACC GCCGGCTCGT TGCGCTCGGT GTTGTCGGCC TCGCGATCAC CGTCGCCGGC GTCGTCTACA TCGAGGCCAA CGACTGGAGC GCCGGCGCCG CGCTGTACTG GCGGCAGTTC ATCTACGTCG GCAGCTTCCC CCTCGCCGGC GGCTACCTCG GGATCGTGAT GCTGTACGCC CGCCGGCGCC CGGACGGCCC CGTGACTCGC GGCCTCGCCG CGGTCGGGCG GACCGCGTTC ACGAACTACC TCCTGCAGAC GGTGATCGCG ACCACCGTCT TCTACGGCCA CGGCCTCGGG CTGTTCGGCT CCGTTACCCG CGTCGAGCAG CTCGGGTTCG TTCTCGTCGT TTGGGTGGTG CAGATCGTTT TGTCAGTCCT GTGGCTGCGA TCTTTCCGGT TCGGTCCCGT CGAGTGGATC TGGCGGACGC TTACGTACGG GGAGCGACAG CCGATACGAA ACCCGGAGTA G
|
Protein sequence | MTRDAGPTPP SERIVALDAL RGVALLGILL INVWAFAMPE TTLFNPTVYA DTTVYGDFTG ANYWAWAFSH VFAQNKFITL FSALFGAGIL LFIESKEEKG QDAVRLHYRR TAILIAIGLM HAYLLWYGDI LVAYGVTALV VVAFRNLEAR KLAGVGVVFL LFLPVVELFA AITLGGDAIA SQWAPAEAAI EQQVATYRGG WLEQLDHRVP SSFSRQTTGY INGPFWQVGG TMLLGMALYR WGVLTGERSS ALYRRLVALG VVGLAITVAG VVYIEANDWS AGAALYWRQF IYVGSFPLAG GYLGIVMLYA RRRPDGPVTR GLAAVGRTAF TNYLLQTVIA TTVFYGHGLG LFGSVTRVEQ LGFVLVVWVV QIVLSVLWLR SFRFGPVEWI WRTLTYGERQ PIRNPE
|
| |