Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_2233 |
Symbol | |
ID | 7399942 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | - |
Start bp | 2218307 |
End bp | 2219407 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643709306 |
Product | hypothetical protein |
Protein accession | YP_002566880 |
Protein GI | 222480643 |
COG category | [S] Function unknown |
COG ID | [COG5282] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR03624] putative hydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.442538 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACATCC TCCGAAGCCT CCGGACCGTC TCCGAGGCCA GCGGGCCGGG CGTCGTCGAC TGGGACCGGG CCGCGGCGGC CGCGAAGGCG AGTACCGACT CCGGATCGAT CGCCCTCACC GAGGCGGAGC GAGCCGGGTA CGCGGCCGAC GTGCGCGACG CGCGCTCCCG TCTCCGCGAG GTCGCCGGTA TCGAGTTCGA CGTGCCCGAC CGCATCGAGG TGCAGAACCG GCACCACTGG ATCGACGCCA GCGTCGACAC GTTCCGGAAC GTGATGGCGC CGATCGAGGC GGCGACGACC GATTCCGACA ACGAGGGGGC GGTGATCGGG GGCGGCGAGG AGCCGATCGG CGGGATCGTC GAGCCGACCG GCGGGCCCGT GGGCTTCCCG ACCGGCGACC TGACGCGAGG GTTCGCGCAG GACCTCTCGC GGATCGCTAA CACTGGCTCG ATGGCGTTCA CGCTCGGGTT CTTAGCGCGC AACGTACTCG GCCAGTATGA CCCGCTCCTG TTGGCCGACG AGCCCGACGC CGACCACGGG CTCTACTTCG TCCACCCGAA CATCGTCGCG GTCGCGGCGT CGCTCGACGT CGAGTACCCT CGGTTCAGGC GCTGGATCGC TTTCCACGAG GTGACGCACG CGGCGGAGTT CGGCGCGGCG CCGTGGCTCC CCGAGTACCT CGAATCGCGG GTTGAGCGCG GGATCAAGGG GCTCACCGGC GGCGACAGAC TGACCGCGGG CGGGCTGCCG GTCGACGCGC TCGATACCGA GCCGTTTGCG GAGCTGCAGG CGGCGATGAC GGCGGTCGAG GGGTACGCCG AGGTGCTGAT GGACCGTGCT TTCGACGGCG AGTACGCCGA CCTCCGCCGG AAGCTTGACG AGCGTCGGGG CGGAGGCGGC CCGGTCCAGC GGCTCGCGCG CCGGCTGCTC GGGCTCGGAC TGAAGCGCCG GCAGTACGAG CGCGGCGCCA CCTTCTTCCG ACACGTCGCC GACGCCCGGG GGATCGAGGC GGCCGGCGCC GTCTGGGAAC GTCCCGAGAA CCTTCCGACG AGCGCCGAGC TTGAGGATCC CGACATGTGG CTGGTTCGAG TCGACCCCTG A
|
Protein sequence | MDILRSLRTV SEASGPGVVD WDRAAAAAKA STDSGSIALT EAERAGYAAD VRDARSRLRE VAGIEFDVPD RIEVQNRHHW IDASVDTFRN VMAPIEAATT DSDNEGAVIG GGEEPIGGIV EPTGGPVGFP TGDLTRGFAQ DLSRIANTGS MAFTLGFLAR NVLGQYDPLL LADEPDADHG LYFVHPNIVA VAASLDVEYP RFRRWIAFHE VTHAAEFGAA PWLPEYLESR VERGIKGLTG GDRLTAGGLP VDALDTEPFA ELQAAMTAVE GYAEVLMDRA FDGEYADLRR KLDERRGGGG PVQRLARRLL GLGLKRRQYE RGATFFRHVA DARGIEAAGA VWERPENLPT SAELEDPDMW LVRVDP
|
| |