Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_3548 |
Symbol | |
ID | 8744168 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | + |
Start bp | 3649659 |
End bp | 3651959 |
Gene Length | 2301 bp |
Protein Length | 766 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 646514129 |
Product | hypothetical protein |
Protein accession | YP_003405083 |
Protein GI | 284166804 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCCAGATC CCCCCGACGA TCCGACGGAC GACGGCCCCG ACTATCGACA GCTCTCGGTC GTCGCGCTCG CGGCGCTGGC GATTGTGCTC GCCGCGTTTC TCGCCCCGGC CGGCAACGGG ATCATGAGCC CCGATGCCAA CGTCAATCCC GACGCCGACC CGAACGTCGA CCCGAACGCG AACCCGAACG TCGATCCGGG ATCGACCCCG GCCGAGCCGC AGGGCGACGG TCCCGATAGC ATCCCGGACC TCGATATCAA CTGGGACGGA CTGCTCGAGT GGACCGAGTT CGACTGGCTC GAGGACGAAG GCCCCGACGC CGAACCCACC GACGACGCCG ATCGCGAGAT CGAGCGGGGC GACGGCGGCC TCGCTACGCC GGCGTGTACG ATCTGGCTGG ATCGCGAACC GACGCCGGGC AGCGAGGTGG CCGCGAGGAT CCAGTACGAG GGCGAACCCG TTACCGGCGC CGACGTCTGG TACAACGACG ACTACGTCGG CAAAACAGAC GAGCACGGAC AGGTGACGGG TACGGTTCCC TACGTCGAGA ACCTGAACGT CCGCATCGAG TCGGACGAGT ATCCGGCGTG TTCGGGGACC GCGGAGACGG CGAGCGTGAC GGCGTCCGCG ACCGGGCTAC CCGGCGCCGG TGCGGGGACG ATCACGACCG CAACCGCGAG TGCGAGCGCG ATGCCAACGG ACTCGACAGC GAACGCGCCG GCACTCGCGA GCGCCCAGGA AGTCGACGAG GTCGACAACG GCAGCGTCGA GTACGAGGTC GAGGGAGAGG TCGAAATCAC CGTGCGCGGG CCGCCGGATC CAGGCGAGTC CGTGACCGTC GAGGCGTCCA TCGAGGGCGT CCCGATGGCT AACGCGAGCG TGACGATCGA CGGCGAGGTG GTCGCCGAGA CGGACGCGAA CGGCACCGCG ACCGTCGACG TCCCCGACGA CGGCACCGAA TCGTTCGAAC TCGGCGTCGC CCGCGGCGAT TTCGCCGGGA CGACGACCGT CGAGGTCCGA CTGCTCGAGG TGTCGCTCTC CCCCGAGGGG CTGGTGCCGA TCCCCGGCAG CCCCGGCGTC CTCGAGGCGA CCGTCGACGA CGAGCCGGTT GCGGACGCCG AGGTGCGGAT CGGCGGCGAG GCGATCGGCA CAACGGATGA TGACGGTCGC CTCGCCGTGG CTTTACCCGT CGATCCGACG ACGGCGATCA CGGTCAGCAC GCCCGATCGG ACCGAGACCG TGAGCCTCGC CGGCCAGTAC GGCGGTATCG CGGCCGTTCT CGGCGTCGCC GTCGTCGGGC TGACCGCCGT CGCCGCTCGG ACCCACGGCA GGCGCGGCGC GGCTGCCGTC CTCGGCGCGA CAGCAGCGGC GCTGGTCGTC CTCACCGTCG AGGCGTTCTA CGGACGAACT GCCGGGACTC TCGCACTCCT CGCCCTCGCG GCGCTGGCGC TCGCCGTCAC CTACGTCCGA AGCGACCGAA CGATCGTCGA CGAGCGCCCA TCTCCGGGCG ACGCCGCGGA GGACGCCCTC GAGTGGCTCG TCGACCGACT CCTCGCGCTG GTCGCCCTCC TCGAGCGCGT CGTCGACTGG GTCCGCACGC GGCTCGCGGC CGCTCGAACG TGGGCCTCGT CGCTGCCCCG ATCCGGCAGG GCCCTGGCCG CGGCGTTCGG CGGTTGGCTG GCCGCGCTCC CGGCGCGAGC GGTGACGATC GGGCGTCGGT GGCTCGAGGC GCTTCGAGGG GTCCCGTCGG CGGTGATTGT CGGCTCCCTC GCCGCCGTTC CCGCCGTTTC CGCCGGCTAC ATCGTCGACG GGACGCGGGG AGCCACTGTC GTCGCGGCCG CGCTCGGCGT CGCGGCGCTG CTCTACCGAG GACGCGACGA AGACGACGAC CACGAGGTGC GCGAAGACGA GGGCGACGAG GACGAACGCG TCGAACGAAC GGACACCGTG CCCGACACCG CGCCGGAGAC CGACGATCGT CTCACCTTCC GCGAACTGTG GCGGGCCTTC GCCCGACGGG TCGCCCCCCG GCGGTGGCGC CACTACACGC CCGGTGAGGT CCAGCGCGCC GCCCGCCAGC AGGGCTATCC CGCCCGGCCG GTCGACGAAC TGACGACGCT GTTTCGGGAG GTCGAGTACG GCCGCCGTCC ACTCTCGAGG GGTGTCCGTG ATCGGGCCGA CACGGCCTAC GCGGCGCTCC TCGAGACGGA GTCGGACGGC GACGATAACG ACGCGGACGA TTCGGACGGA TCGACGGCCG ACGACACGAC CGATCGCGAC CCGGGAGGTG AGCGACCGTG A
|
Protein sequence | MPDPPDDPTD DGPDYRQLSV VALAALAIVL AAFLAPAGNG IMSPDANVNP DADPNVDPNA NPNVDPGSTP AEPQGDGPDS IPDLDINWDG LLEWTEFDWL EDEGPDAEPT DDADREIERG DGGLATPACT IWLDREPTPG SEVAARIQYE GEPVTGADVW YNDDYVGKTD EHGQVTGTVP YVENLNVRIE SDEYPACSGT AETASVTASA TGLPGAGAGT ITTATASASA MPTDSTANAP ALASAQEVDE VDNGSVEYEV EGEVEITVRG PPDPGESVTV EASIEGVPMA NASVTIDGEV VAETDANGTA TVDVPDDGTE SFELGVARGD FAGTTTVEVR LLEVSLSPEG LVPIPGSPGV LEATVDDEPV ADAEVRIGGE AIGTTDDDGR LAVALPVDPT TAITVSTPDR TETVSLAGQY GGIAAVLGVA VVGLTAVAAR THGRRGAAAV LGATAAALVV LTVEAFYGRT AGTLALLALA ALALAVTYVR SDRTIVDERP SPGDAAEDAL EWLVDRLLAL VALLERVVDW VRTRLAAART WASSLPRSGR ALAAAFGGWL AALPARAVTI GRRWLEALRG VPSAVIVGSL AAVPAVSAGY IVDGTRGATV VAAALGVAAL LYRGRDEDDD HEVREDEGDE DERVERTDTV PDTAPETDDR LTFRELWRAF ARRVAPRRWR HYTPGEVQRA ARQQGYPARP VDELTTLFRE VEYGRRPLSR GVRDRADTAY AALLETESDG DDNDADDSDG STADDTTDRD PGGERP
|
| |