Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_0412 |
Symbol | |
ID | 7401029 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 429354 |
End bp | 432545 |
Gene Length | 3192 bp |
Protein Length | 1063 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643707476 |
Product | hypothetical protein |
Protein accession | YP_002565085 |
Protein GI | 222478848 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAAACG ACAATAATAC ACGTAAGAAG GCCAGTGCGG TCTTCTTCGC GGCGATCATG GTCGTCTCCA TGGTCGCAAT TGGTTTCGCT GCTGCTCCGG CAGCAGCCGA CCTTGATGGT ACAAACAGTA CCATCGACTC TGCGGCCGCC GAACCCGTGA CGGCTGGGTT CGACTCGTCA GAGCAGGACG TACAGGTAAA GGTTACAATT AGCGACAATG CCAGCGACAA CGTGACTGTC GACCTCTCTG AGGCGGCAGA CGTGGAAGCT GTTCCGTCGC TACAGGACGT AACCATCTCC CCCAGTACGA CTGGTAATGT TAACCTCCTC GGCTCGGAGT TTGACGAGAG CACGAACGTT CTCGAGTTCC AAGTCAAAGG TACCACCGCA GGAGACGACG TCGCAACAGT CACTGCCTCC TTCGAGCACG ACCTCAGCAA TGTTGAAGGT CCCGCCGGTA CTACAAACCG GCTGAGCTAC AGCGTTGGTG CAGGCACTGA AACGGATTCC ACGATTGCTA ACTTCGAGGT TTACTCCGCT GAGGATGTCA GCCTTGACGC ACCGCTCACC GAAGACAATG TGAACTCCGC GGATTTCACA GTCTCGCACC CGGCTAACAC TCCGGTCGCC GGTGATGAAG TTTCACTCTT CATCACTGGA AACGATAATT CGACTGTTGT CGATCCCACT TCGGCTACCA CAGACGCAGA CCCCACAACC TTCACGGGTG TCAATCTGGA CAGTCAGGGA ATCGCTTCCG GTGCTGGCGA CGAGATCCAG TTCCATGCTG TGTTGGCGGC TGACGCCGAC AACGGTACGA GCGGATTCGA GGCCAGCTTA AATAACGTTC AGGCTGATGG TACCGCCACT GTCGATGATT CTGATGCGGA TATCATCACC GATAGTGATT GGGACGGTTC GCAGCTCTGG GAAGGTCAGA CAGTTACTGT TGACCTTAGT GGCAGCAATG TTGCTCCCGG TGACGAGGTC ACCGTGCGAG AGGTCACTAA CCGCAATGGT AACGGGTACG CGACCAACAC GCGACTAGCC CGCTCCCTCA CCGTTAGCGG TGATCGGACG ATCTCCATCG AAACGGATCG TCTCCGTGGT GAAGCCGAAT ACGTCCTTCG GACGTCTAAC GGGCTCCTAG CAGCCGGAAC GGCCGGCGCT GGTACTGACG GAGAGTTCAG GATGGTTCTT GGTGTATCCG ACATCGCTGA GGCGCAAGTT CTCACGCAGG ACCTCTCGGC AGAGTTTGCT GAGGATGAAG CGAACAACGA CGAGAAGATC GACGTCGACG TTGCATCGCT GCGTAGCGAA TTCGACGTTG AAGTGAGCGG CGACCTTGAC GACAGCGAGC TCTCCGAAGA AGAACTCGAG AACATCTTCG ACGACGAGAG CGCAACCGGC CTCGACGACG GCGACGATGA CACGGTGCTC ATCGAGGACG TCCAGGACGG CGAAGCCTTC GTTGCGAACT TCACTGATGT TGACGGCGGC AACTACTCCT TCGACTTCGA CGTCGAGGAC ACTACGGCCT CCGACACCGA CTCGATCGAA GTGACCGAGC TCGGCGAGGG TGAACTCACC CTTGGTGGCG AGAGCATCGT CACCGAACAG CAGGGTGACG TCGCGAACAT CACCGTCACC TTTGACGGTA CCGCTGAAAC TGGTACGCTC CTCGTCGGCG ACGAAGACGA TGTTGGCTAC CAGGGTAACA TCACGATTGA CTCGAACGGC GAAGACGAAG TGAACGTCCT GTTCAACAGC TACGCCGCGG GTAGCTCCGG TAACGGGACG GTCTTTGAGC TCGCGAACCC GGACGACACG GACGCTGAGC TCGACAACTT CCAGCAGAAC CAGATTTCCG ACGTCCTCTC CGATGGCGAC TACACGCTCT CGGTGAGTAC GTCGAGCGAC TACGACGATA CGCTGGACGA TCCCGACACG ATCGGGACGC TCGTGCTCGA ACAGCGCGCG ACGACGAACC AGCAGATCTG GACGACCTCT GAAGGCACCG TCACTGACAT CGTCGACGCA GCAGATGCCG ACGATGAGGA CGGGCTCGAA GAGCTGAACT CCCAGATTGA GGGTGACAAC GTGACCCAGG CGAGCACGAT CGCCGAGGGC GACTACGTCA TCCACCAGAT CGAGGCGTCC GGCCTGTCCG GCCTCCTCGC GCAGTACGAC GACGACCCGA CCACCGCGCT TAACGATGCA GTGACCGACA CCGGCGCGGG TATCGGCACT GCAGTGACCG ACACCGGCGC GGGTATCGGC ACCAACGACG GTGCGCTCAG CCTGCGTGTC CGTCAGACTC AGGCCTCGAC GACCGCCAAC CAGGATCCCG CTGAACTGCA GAGCAACATC CTGGGTAACA TGACGGTCTT CGCAGACGAG GAGACGAACA ACTACTACGT CGTCTACGAC CTCGACGACA CGTCGGCTGA GGATGGCGAA GCGTTCGACG CACGCTTCCG CGTGCAGGAC GATCGCCTCC TCAACCCGTC TGACTCGGAC CGTGACGCTC TCTCGACGAA CGAGCTCACC AACGAGTACT ACCAGAGCGT GACCGCTTCC TTCGACGTGG CAGAGCGTGA ATTCGAGTTC GACCAGGACC CTTACAACGT GACGAACGCT GAGGGTCAGG CCGTCTCGGG CACCTCGAAC GTTGCGCCCG GTACTGAAGT GAACGTCCGC CTCCGCTCGG CGTCTGGTAC GAGTCCCTCG TTCATCGAGA CGAGCGAGGG CGTGCGTGTC AACGCTGACG GCACGTGGAT GACTGAATTC GACTTCAGTG ACACCTCGGT CGGTGACGAG TACACGTTCA CGGTCCGGCA GACGGGCCTT GACGAGAACC CGTCCGTCGA CGGTACGGTC ATCGAAGCAG TCGATGACGG TACCGACGAT GGTGACGACG GCAACGTGAC CGACGGTGAC GACGGTGACG ACGGTAACGT TACCGACGGT GACGATGGTG ACGACGGCAA CGTCACTGAC GGTGACGACG GCACCGACGA TGGTGACGAC GGAACCGACG ACGGTGACGA CGGCTCCGAC GATGGATCCG ACGGCTCCGA CGGCGGTGAC GACGGCGGCG ACTCCGAAGA CGGCACGCCC GGCTTCGGTG CGCTCGTCGC TCTCGTCGCC CTCATCGCGG CTGCGCTCCT CGCGACGCGG CGTAACGAGT AA
|
Protein sequence | MTNDNNTRKK ASAVFFAAIM VVSMVAIGFA AAPAAADLDG TNSTIDSAAA EPVTAGFDSS EQDVQVKVTI SDNASDNVTV DLSEAADVEA VPSLQDVTIS PSTTGNVNLL GSEFDESTNV LEFQVKGTTA GDDVATVTAS FEHDLSNVEG PAGTTNRLSY SVGAGTETDS TIANFEVYSA EDVSLDAPLT EDNVNSADFT VSHPANTPVA GDEVSLFITG NDNSTVVDPT SATTDADPTT FTGVNLDSQG IASGAGDEIQ FHAVLAADAD NGTSGFEASL NNVQADGTAT VDDSDADIIT DSDWDGSQLW EGQTVTVDLS GSNVAPGDEV TVREVTNRNG NGYATNTRLA RSLTVSGDRT ISIETDRLRG EAEYVLRTSN GLLAAGTAGA GTDGEFRMVL GVSDIAEAQV LTQDLSAEFA EDEANNDEKI DVDVASLRSE FDVEVSGDLD DSELSEEELE NIFDDESATG LDDGDDDTVL IEDVQDGEAF VANFTDVDGG NYSFDFDVED TTASDTDSIE VTELGEGELT LGGESIVTEQ QGDVANITVT FDGTAETGTL LVGDEDDVGY QGNITIDSNG EDEVNVLFNS YAAGSSGNGT VFELANPDDT DAELDNFQQN QISDVLSDGD YTLSVSTSSD YDDTLDDPDT IGTLVLEQRA TTNQQIWTTS EGTVTDIVDA ADADDEDGLE ELNSQIEGDN VTQASTIAEG DYVIHQIEAS GLSGLLAQYD DDPTTALNDA VTDTGAGIGT AVTDTGAGIG TNDGALSLRV RQTQASTTAN QDPAELQSNI LGNMTVFADE ETNNYYVVYD LDDTSAEDGE AFDARFRVQD DRLLNPSDSD RDALSTNELT NEYYQSVTAS FDVAEREFEF DQDPYNVTNA EGQAVSGTSN VAPGTEVNVR LRSASGTSPS FIETSEGVRV NADGTWMTEF DFSDTSVGDE YTFTVRQTGL DENPSVDGTV IEAVDDGTDD GDDGNVTDGD DGDDGNVTDG DDGDDGNVTD GDDGTDDGDD GTDDGDDGSD DGSDGSDGGD DGGDSEDGTP GFGALVALVA LIAAALLATR RNE
|
| |