Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_2004 |
Symbol | |
ID | 7402023 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 1997922 |
End bp | 1999013 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643709075 |
Product | hypothetical protein |
Protein accession | YP_002566652 |
Protein GI | 222480415 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.069614 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.292066 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGAGA GAACGAGCGG GACGGCGTCC GAAGAGGCGA ACGAGGGCGA GAAAGCGGAG GGAGGCGAAG ATACCGACGA CGGCGACACG GCCGACACGA TGCGCGTTCG CGCCGGTGAT AGCCGGGTGA AGCTCTGGCT GCTGTTGCGT GCGAACCGGC TCGTCGTCGC GGGAGTCTTA ACACTCGTCG TCTTCGTCGC GTTCGTCACC GTGGCGGCCG CGTTCTCCCC GTCTCTCGCT GAGAAGATCG GGTCGGGCGA TCCGATCGAT ACGCTGTTCT CGACGATGAT CGCGGCGATC GTCACCGGAA CGACGCTCGT CGTCACAATC GGTCAGGTCG TGCTCACGCA GGAGAACGGC CCGCTCGGAG ACCAGCAGGA GCGCATGAAC GACACGCTCA CCGTTCGAGA CTCTATCGCG GAACTGACCG GCTCCCCGGT GCCCACGGAC CCCGCCGCGT TCCTCGATGC GATCCTCGTC GCCGCGTCGG AGCGCAGCCG GGCCCTCCGC GAGTCGGTTC GGGAGCGCGA CGGCGACCGG TCGGACCGGA TCGCCATCCG AGAGGACGTC GACGACCTCG CCGCGAATAT CATCGAAAAC GCCGACGGCG TGCGCGACAG TCTGGACGGT GCGGAGTTCG GCTCCTTCGA CGTGGTGTTC GCGGCCATCG ACTTCGACTA CAGCCCGAAG ATCGGCCAGA TCGAGCGCGT CGACGACGAC CACGACGACG CGTTCACCGA CGACGAGCGC GCTCTGCTCA AGGAGCTGAA GGAGTCGCTG TCGCTGTTCG GTCCCGCCCG CGAACACATC AAGACGCTGT ACTTCCAGTG GACGCTGATC GACCTCTCGC GGCAGATCCT CTACGCCGCG GTGCCCGCGT TGGTCGTCGC GGGGCTCATG CTCGCGGTCG TCGACGCCGG GACGTTCCCC GGGAGCACCC GCGGGGTCGA CCACGTGACG CTCGTCGTCG GGGCTGCGTT CGCGGTCACG CTCCTCCCTT TCTTGCTTTT CGTCTCCTAC GTGCTCCGCG TACTCACCCT CGCGAAGCGC ACGCTCGCCA TCGGGCCGTT GGTGCTGCGG GACTCGAAAT GA
|
Protein sequence | MTERTSGTAS EEANEGEKAE GGEDTDDGDT ADTMRVRAGD SRVKLWLLLR ANRLVVAGVL TLVVFVAFVT VAAAFSPSLA EKIGSGDPID TLFSTMIAAI VTGTTLVVTI GQVVLTQENG PLGDQQERMN DTLTVRDSIA ELTGSPVPTD PAAFLDAILV AASERSRALR ESVRERDGDR SDRIAIREDV DDLAANIIEN ADGVRDSLDG AEFGSFDVVF AAIDFDYSPK IGQIERVDDD HDDAFTDDER ALLKELKESL SLFGPAREHI KTLYFQWTLI DLSRQILYAA VPALVVAGLM LAVVDAGTFP GSTRGVDHVT LVVGAAFAVT LLPFLLFVSY VLRVLTLAKR TLAIGPLVLR DSK
|
| |