Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_0624 |
Symbol | |
ID | 7401759 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 641922 |
End bp | 643574 |
Gene Length | 1653 bp |
Protein Length | 550 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643707690 |
Product | hypothetical protein |
Protein accession | YP_002565296 |
Protein GI | 222479059 |
COG category | [S] Function unknown |
COG ID | [COG3390] Uncharacterized protein conserved in archaea |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.802185 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGACG ACGGACCGGG CGCCCGCGAG GTCGCTCACC GCGTGTTCGC CGCCGAGTTC GACGACGCCT CCCTCTCGTA CTCCGAGAGC GACGAGGAGC GCGCCCCGAA CTACGTCGTC ACCCCGACCG GCGCGCGCGT GAACCGGCTG TTCGTCGCGG GCGTGCTCAC CGAGGTCGAG CGCGTGAACG ACGAGACGCG TCGCGGGCGG GTCGTCGACC CATCCGGCGC GTTCGTCACC TACGCCGGCC AGTACCAGCC CGAGGCGCAG ACGTTTCTCG AACGCGCCGA GCCGCCGGCG TTCGTCGCGC TCACGGGCAA GGCACGCACC TTCGAGCCGG AGGACTCAGA TCGGGTGTTC ACTTCGGTTC GCCCCGAGAG CCTCAACGCG GTCGACGCCG ACACCCGCGA TCGGTGGGTC GTCTCCGCCG CGGAGGCCAC TCTCGACCGA CTCGCCGTCT TCGCGAAGGC CCTCGACTCG GAGCTCCGCG GCGAGGAACT CCGCGTAGCC CTCGAAACGG GCGGCGCACC GGCGGCGCTG GCGGCCGGGA TCCCGAAAGC CATCGCGCAC TACGACACCT CGACCGCCTA CGTCGAGGCG CTCCGGCGGC TCGCCGTCGA CGCCCTGAAG CTGATCGCCG GCGACCGCGA CGAGGTGCGC TCACCCGATA TCGCCCCCGA TGCGGGCGGT GAGGCCGCCA TCGGCGCGCT CCCGGAAACG GACGTGACGA TCGAGGCGCC GGCGGAACCG GTCGCCGACG ATGGGGCGGT GGAATCGGAA TCCGAGTCTG TCGCCGCCGA ATCGGACGAG TCTGCCTCTG AGCCGGAACC CAACGCTGAG TCGGAACCTG CGGTGACAGG TGACGAGACG GCCGACTCAA CCGAGGCCGA CTCAACCGAG ACCGACTCAA CCGAGACCGA CTCAACCGAG ACCGACTCAA TCGAGACCGA TACTGAGGAC ACCGCCGTGT CCGTCGAAAC CGAACCGTCG GATTCCGGTT CCGATGCTGC TGGATCCGTC GACGCCGGCC CCGGCTCCGC CGACTCCGAG TCTGACGACG AGAGCGGTGG GCTCGGGGAC TTCGACGCCG GGACCGACGA TACCGAGACC GACGAATCCG GGACGGACGA CGCCGAGGCC GACCTCGACG AGCCGTCGTC CGAGCCCGGC TCCGACGAGA TGTACCAGCT CGACGACGAG GAACGCGAGG AGATCGAGTC GGAGTTCGGC ACCGACTTCT CGACCGGAAC CGAGGTCGAC GAGCCCGGCG AGGCCGACAT CGACGTACCC GATCCCGAGG AGATAGAGGA GTCGCCGACC TCGGTGGGCG CTTCGGAGTC CGCCGGGGCC GACGCACCCG CCGGAAGCGC TGACGAGGCC GCAGCGGAGC CCACCACTGA TTCGGAACCA GAGAGCGAAG AGCCGGCGGG CGACGAGACG AGCGATGCTG ACGCCGACAT CGACCTCGAA GCCGCCGTTA TCGACGCGAT GGGTGCGCTC GACGACGGCG CCGGCGCCGA TCGCGAGGCG GTCGTCGAGG CGGTCGTCGA CGACCACGGC GTCGCAGCCG ACGCGGTCGA GGACGCGATC CAGGACGCGC TGATGAGCGG GAAGTGCTAC GAGCCCGGCG ACGGCACGCT GAAGCCGATC TGA
|
Protein sequence | MSDDGPGARE VAHRVFAAEF DDASLSYSES DEERAPNYVV TPTGARVNRL FVAGVLTEVE RVNDETRRGR VVDPSGAFVT YAGQYQPEAQ TFLERAEPPA FVALTGKART FEPEDSDRVF TSVRPESLNA VDADTRDRWV VSAAEATLDR LAVFAKALDS ELRGEELRVA LETGGAPAAL AAGIPKAIAH YDTSTAYVEA LRRLAVDALK LIAGDRDEVR SPDIAPDAGG EAAIGALPET DVTIEAPAEP VADDGAVESE SESVAAESDE SASEPEPNAE SEPAVTGDET ADSTEADSTE TDSTETDSTE TDSIETDTED TAVSVETEPS DSGSDAAGSV DAGPGSADSE SDDESGGLGD FDAGTDDTET DESGTDDAEA DLDEPSSEPG SDEMYQLDDE EREEIESEFG TDFSTGTEVD EPGEADIDVP DPEEIEESPT SVGASESAGA DAPAGSADEA AAEPTTDSEP ESEEPAGDET SDADADIDLE AAVIDAMGAL DDGAGADREA VVEAVVDDHG VAADAVEDAI QDALMSGKCY EPGDGTLKPI
|
| |