Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_3219 |
Symbol | |
ID | 7399345 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012028 |
Strand | - |
Start bp | 461000 |
End bp | 464896 |
Gene Length | 3897 bp |
Protein Length | 1298 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643707016 |
Product | hypothetical protein |
Protein accession | YP_002564638 |
Protein GI | 222476117 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCCCTC AATCAGCCAC CTCAGCCACC GAATTTGTCG AGCAGGCATT CGAGACGACC GTCGACGATC ACGCACGGGC GGATCTTCTA CAGATCATCG ACGACGCCGT GTCGACGCTC CGTAAGAAGA TGAAGAAGGA CGAGACGGTA GAGCGGATGC TACGGAGCGG CGGTGGCGGC TCTTACGTCT TGAAAAGTAG CATGACCCGT GACGGGTTAC AACCGGAACC GTTCACTCAA AGTGCCGTCA TCGAGCCGCT GTTGGACAAA CTCGGCTACG ACTACGACAC GGAGGCGGGT GGCCTCTCAG GTGGCCGGAC GGAGGTGGCC GACTACACGA TCTCGCTACG CGACTACGAC GATATCGACT CTACCCGTCT ATTGATCGAG GCCGAGCCGA TCAACAAAGA CCTCGATTCT CGAAAGCACG GTGCGGGACA GGTTCGCAGC TGGCTCAGTC AGCGAGAGTT TGAGTCCGAC TTTGGCTTCG CCACGGACGG ACTTCGCTGG ATGTTCATTC GGTACGACCC GGATTCCTAC ACCCACAACG TCATTGAGGA GGTCGATCTA CAGCCAGTGT TTGTCGCTCT ATTTGAGAAC CAAGTTGGCC CGCGAGAGCC GCTCGAGGAG GCTGTTTTCG ATGCCGATCT TGAACGAGTT GACACGCTGC TACGGACCTT CGAGTTTGCT AACTTCCGGT CGATTGCAGG TGAGGCCCGG CAGGTCATCA AGCAGAAACA GGAGGAGATC ACCGACGAGT TTTACGACGA CTATATCCGG TACGTCTTCG GGATTGTCAA CGAGTCCGAG GAGACGCCCA GATCGCTCAC CCACGACGGC GTGATCGCAC CGGAGGGAGC AACCGCCGAC GACACGCGGC TGTTTTCGGT CGAGATGATG AACCGGCTTA TCTTCATCAA ATTCCTTGAG GACAAGGGAA TTGTCCGTCC GGACCTCCTA CAGAGCATCC TTGACACGTA TGAGGACGGC CTTTACACGG ATTCTCTCTA CCAGCAGTTC ATTCAGCCGT TGTTCTACGA CGTACTCAAC AAACGCCCGG ACAAACGCTC GCCACAGATA CAGGATATCG AGCTGTTCGC TGATATTCCG TACCTGAATG GTGGGTTGTT CCGGCCGTCG ATACAACACG ACGGCAGTGA CGACCGTGAG CAGTTCAAGG AGGCCGATTT TGACGTTCGG AATAGTATTC TCAGGTCGAT TCTTGAATTA CTTGAAAGCT ACAGCTTCTC CACGGATGGA TCAGTGACCG ACCTCGATCC GAGCGTGCTG GGTAACGTCT TCGAGAAGAC GATCAACTAC ATTACAGCAG ACAACGCCGA CCAGAACAAG GAACTCGGAG CCTACTACAC GCCGAAGGAG ATTACCCGGT TCAGTGCAGA GCGAACGGTT CGACCGGCTC TATTCGACCG ACTCAAACAG GTGGTAATTG AGGAACGCGG ATGGCCTGAA GCCGAGTTGG AGAACTACGA TACCGTCTAC GAGTTGATCG AATCGCTCCC GGCGTCGATG GACCTGATTA CCACATTACT CGGTGAGGTC GACAACTTTC GGGTGGTCGA CCCCGCCTGT GGGAGTGGGC ACTTTCTAAC CTCAGTGTTG GAGGAGATCG TCGGCGTTCG ACGAGCGTTG TGGGCACATA CCGATTCGTA CCCGCACGAA CAGGCACTCA AGAAGACGAC TGTCCAGCAC AACATATACG GCGTCGACAT CGTCGGCCCG GCCGTCGAGA TCGCCAAGCT CCGGTGTTGG CTGTCGGTGA TCGCTGAACT GCAACAAGAG GATCTAGAGT CGATGGATCA AGAGGAGTTG GCCCTACCTA ACATCGCGTT CAATCTCCGA CAGGGCAACA GCCTGATCGG GTACACGGGG TTCCCAGAGA CGACCGAAGA CGGCGACGGT TACACGCTCG ATAGCTTCAA CGAGGATACG GTCAGGACAC GCTACGAGAA CATCATCGAC GAGATCACAG CCTACGAAGA GGCGATTGAG AGCGAGCAGG CCGAACAACA CCGAAAAGAG GCCAATCGTC TACTCGAAAA TGCAAGAGAT GAGCTGGTCG ATGATGTGAA AGACGAATTC GTGGCGGCAG GTGTCGACGA TATCACGCCC GAAAAGGTCG AAACATTTGA TCCATTCCAC TGGGTGCTTG AGTACGCAGA AGTCTACTCT GATGGTGGTT TCGACGTGAT TGTAGGAAAT CCGCCGTGGG ATAGATTATC ACCACGTCGA GATGACTACT TCTCACGGTT TGATTCGGCG TTCAGAACGC TGATGCCGGA TGAAAAACAG GAACGACAAG AAGAATTACT GACTGATCCA GAGATCGCGG AAGGATGGGA GGAATACAAA CGAGAAACTG AAATCTTTGC TACTTACTTT AAAAATAGTG ACTCCTATGA ACTCCAGCAA CCGAAGGTAG CAGGAAGAAC GGCAGCAACT GAGAGCGATC TTTCTGCACT GTTTTTGGAG CGTGTTTTTC AGATAGCAAG AGACGACGGG TATCTCGCAC AAATTCTTCC AGGTGCGATT TTTAATGGCC TGTCGACCAA GGATCTCCGA CTACACCTTC TTGATGAAAC GAGCATCGAT TCGCTAGTCA CGTTTGAGAA TAATGGAATA TTCTCTGATA TTGATAATAG ATACAATTTT GGAGTGGTGA CTTTTGAGAA TCAAGGGGAA ACAACCGATG TAAGAGGGAT CTTCAAACAG ACTGATGTAG ATATACTTCA GAATTTTGAG GATCAAGCCC TGTCGCTCTC CCGGCGTGTT CTACGCAATT ATTCCCCAGA AGCCGCGATA TTCCCATATC TTCAGTCCCA GCAAGAAGTT GACGTTCTCG ACACAATCCT GCAACACCCA CCAATTTCAG AGGAAATTGG GTCATCGTGG TACGTAGAAC CTTACAGAGA GTTGGATCGA GGTAACGATG TCGACCGGTT TGTAGAGGAT GAGGAGGAAG GCGATTATCC TGTTCTCGGA GGAAGTAATA TATTCCAATT TGCATATAGT GACGCCTATT TCGGTGTTGA GTCACCGAAA TTCTGGAGTG TAGATGAAGA CAAAGATCCT GAACTAAGCG CGAAGAAACG AATACGTGGA AAGAATTTGC GGAAACTGAA ACGTGCGGTG TATGATGCCT TCGACGGCAC TGGTTCGCAA GTTGGGTTTG TAAACGACCT GCTTGAAAAA CGACGAAGCA AAGAACTCTC TGACGAGGAC GTTCTTCTTG ACTGTACAGA GTACCGTATC GTATACCGAG ATATTGCGAG GTCGACGGAC GAACGAACCA TGATTTCGAC TGTCATTCCG AAAGGTGTCG TCTGTCACGA CAAAGCCCCA CAACTACGTC CTTACAGTAT TGAACCGAGC GAGAAGGACC TCTCTGAAGA CACGCTACAC AGTGCCTACA AACGCATTTA TAGCGATGAA GAACTGTTTG TCGCCACTGG ATTACTAAAC AGTCTTCCGT TCGACTTTCT GATGAGGACT AAAATAGATT CGACTGTTGT ATTCTATAAA TTGAAGGAGT CACAAGCACC CCGACTCACC AAAGGTGATG AATGGTTTGA GTACATTTGG CGACGATCTG CTCGGCTCAA CTGCTATGGA GACGAATTTG CAGAAATGCG AGATCGACTG GACGGGATCG AGCCCGTTGT CGACGTTACC GAGCGTCGAC GGGTACAGGC TGAACTCGAT GCAGCGGCCT TCCACGCCTA TGGCCTCAAT CACGAGCAGA CGGCCTTCGT ACTGGGTGAC TTCCACCGAG TACAGAGCCC CCGGCTCATG GATGAAGACT ACTTCCAGCT GGTGCTTGAG AAGTACGAGC AGTTGGCCGA GGTAGAGGTC GAACAGGTCC AAGAGTCGAC GCAGTAA
|
Protein sequence | MPPQSATSAT EFVEQAFETT VDDHARADLL QIIDDAVSTL RKKMKKDETV ERMLRSGGGG SYVLKSSMTR DGLQPEPFTQ SAVIEPLLDK LGYDYDTEAG GLSGGRTEVA DYTISLRDYD DIDSTRLLIE AEPINKDLDS RKHGAGQVRS WLSQREFESD FGFATDGLRW MFIRYDPDSY THNVIEEVDL QPVFVALFEN QVGPREPLEE AVFDADLERV DTLLRTFEFA NFRSIAGEAR QVIKQKQEEI TDEFYDDYIR YVFGIVNESE ETPRSLTHDG VIAPEGATAD DTRLFSVEMM NRLIFIKFLE DKGIVRPDLL QSILDTYEDG LYTDSLYQQF IQPLFYDVLN KRPDKRSPQI QDIELFADIP YLNGGLFRPS IQHDGSDDRE QFKEADFDVR NSILRSILEL LESYSFSTDG SVTDLDPSVL GNVFEKTINY ITADNADQNK ELGAYYTPKE ITRFSAERTV RPALFDRLKQ VVIEERGWPE AELENYDTVY ELIESLPASM DLITTLLGEV DNFRVVDPAC GSGHFLTSVL EEIVGVRRAL WAHTDSYPHE QALKKTTVQH NIYGVDIVGP AVEIAKLRCW LSVIAELQQE DLESMDQEEL ALPNIAFNLR QGNSLIGYTG FPETTEDGDG YTLDSFNEDT VRTRYENIID EITAYEEAIE SEQAEQHRKE ANRLLENARD ELVDDVKDEF VAAGVDDITP EKVETFDPFH WVLEYAEVYS DGGFDVIVGN PPWDRLSPRR DDYFSRFDSA FRTLMPDEKQ ERQEELLTDP EIAEGWEEYK RETEIFATYF KNSDSYELQQ PKVAGRTAAT ESDLSALFLE RVFQIARDDG YLAQILPGAI FNGLSTKDLR LHLLDETSID SLVTFENNGI FSDIDNRYNF GVVTFENQGE TTDVRGIFKQ TDVDILQNFE DQALSLSRRV LRNYSPEAAI FPYLQSQQEV DVLDTILQHP PISEEIGSSW YVEPYRELDR GNDVDRFVED EEEGDYPVLG GSNIFQFAYS DAYFGVESPK FWSVDEDKDP ELSAKKRIRG KNLRKLKRAV YDAFDGTGSQ VGFVNDLLEK RRSKELSDED VLLDCTEYRI VYRDIARSTD ERTMISTVIP KGVVCHDKAP QLRPYSIEPS EKDLSEDTLH SAYKRIYSDE ELFVATGLLN SLPFDFLMRT KIDSTVVFYK LKESQAPRLT KGDEWFEYIW RRSARLNCYG DEFAEMRDRL DGIEPVVDVT ERRRVQAELD AAAFHAYGLN HEQTAFVLGD FHRVQSPRLM DEDYFQLVLE KYEQLAEVEV EQVQESTQ
|
| |