Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4272 |
Symbol | |
ID | 8744900 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | - |
Start bp | 537327 |
End bp | 540086 |
Gene Length | 2760 bp |
Protein Length | 919 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 646514817 |
Product | putative PAS/PAC sensor protein |
Protein accession | YP_003405764 |
Protein GI | 284167486 |
COG category | [R] General function prediction only |
COG ID | [COG3413] Predicted DNA binding protein |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.312579 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGTCTC GATCGCTGAC GGAGGCCCTT CGGGAGACGC TCGCGCTCTT CGACGAGGCC GGGATCCCGC GGACGACGAC CGAACTGGCG GACGATCTCG AACTCGGTCG ACGGAGCACG TACGAGCGAC TGGAGCGACT CGTCGAACGC GGCGAACTCG AGACCAAGCG CGTCGGGGCG AGCGCCCGGG TGTGGTGGCG GCCCCCGTCG GCGAGAACCG ATCGACCGCT CGGTTCCGGA GACTGGCCGG TCGAAGCCGA GTCCCTGCTC GCCGCCGCTC TCGACAGGAC CGACATCGGC GTGTTCGTCC TCGACGAGTC GTTCAACGTC GCGTGGCTCA ACGAGACGGC CGAGCGATAC TTCGATCTCG ACCGCGAGCG CGTCCTCGGG CGGAACAAAC GCCGACTCGT CGAGACCGAC ATCGCGTCCG CGGTCGACGA CGCGACCGCG TTCGTCGACA CCGTTCTGGC GACCTACGAC GACAACACGT ACGCCGAACG ATTCGAGTGT CGCGTGACGG CGGAGGAGGA GCGCGAACCG CGCTGGCTCG AGCACCGCAG CGAGCCGATC GAGACCGGGG CGTACGCGGG CGGACGGATC GAACTCTACC ACGACGTCAC CGATCGGAAG CAGCGCGCGT CGCGCGAACG GGAACTCGAA CAGTACGAGC GGCTCGTCGA GACCGTCCCC GACGGCGTCT ACGCGGTCGA CGAGGACGCC CGGTTCGTCG CGGTCAACGA CGGCTTCTGT GAACTGACGG GATACGACCG GGACGAACTC CTCGGGGCGC ACGCGACGAC CGTCCACGAC GCCGAGATCA CGACGCGGGC CGAGTCGCTG GCCGAGGAAA TCGCGACGGG CGAACGAGGT TCCGCGACCA TCGAGCTCGA TCTACAGACC GACGGCGGAG CGACCGTTCC GTGTGAGAGC AGGCTGGCCC CGTTCCCGCT GGGCGAAACC GACGGCCGCT GCGGCGTGGT TCGCGACGTC TCCGATCGCA TCGAACGGGA ACGGACGCTT AGACGACGGA TCCGGCAACA GGAAGTCGTT GCGGACCTCG GTCAGCGGGC GCTCGAGAGC CGGGACCTGG ACGAACTGCT GGCCGACGCG GCCGACCTGC TCACCACGAC GCTCGAGACG GACTACTCCA AGGCGCTCGT TCTCGATCGG ACCGCCGACG AACTCCGGTT GCGACAGGGT TCCGGCTGGG ATCCGAACGT CGTCGGAACG ACGACCGTCT CGGCCATCGA AGACGAGTCA CAGGCCGCCT ATACGCTGGC GTCTGGGGAT CCGGTCCTGG TAGACGACCT CGACTCGGAG TCGCGCTTCA GCGGGCCCGA CCTGTTGACC GACCACGACG TCACGAGCGG TATCAGCGTC ATTATCGGTT CCAGTGAGGA TCCCTGGGGG GTCCTCGGAG TCCACGACAC CGATCACCGG GCGTTCTCGG AACACGACGT CGCCTTCGTT CAGTCGGTCG CGAACGTACT CGCGACCGCC ATCGACCGGA ACGATCGCGA GCGGGAACTG AAGCGCCAGC ACGAACAACT CGCCGCGCTC GACTCCCTCA ACGGCGTCGT CCGCGACATC ATCGATGAGG CCATCGACCG GTCGACGCGC GAGGAGATCG AGCGGAGCGT CTGCGAGCAC CTCGCCGCGA CGGACTCGTA TCTGTTCGCG TGGATCGGCG ACGTCGACGC CGCCGATCAG ACGGTGAACG TTCGAACCGA GGCCGGCGGC GTGGCGGAGT ACCTCGAGGG GATCGCCATC TCCGTCGATC CGGACGACGA GCGAAGCGGC GGAGCGACGG GCAGGGCCGT TCGGAAACGC GAGATCCAGA CCACGCAGGA TATCCGTGAC GACTCCAGAT ACGAACCCTG GCGGGGCCAC CTCGAGGAGT ACGAGATCCG GTCGTCGGCC GCGATCCCGA TCGTCCACGA AGAGACCATC TACGGCGTGT TGAGCGTCTA CGCCGATCGG CCGAACGCGT TCGATGGTCG AGAGCGCGCC GTGCTCGGCC AACTGGGGGA TGTGATCGGT CACGCCATCG CTTCGACAGA GCGCAAGCGT GCCCTGATGA GCGACGACGT CGTCGAACTC GACTTCCGAA TTAGAGATCT GTTGGACGAA CTCGACCTCG ACGTTCCGTC GGCCGGACGG ATCACGCTCG ACCACACGGT CCCGATCGAG GACGACGAGT TCCTCGTTTA CGGGAGCGCG ACTCCGGACG CGATCGACGG CCTGGAGGCG ATCGTCGAAA CGCTTCCCCA CTGGAAAGCC GTCACGTATC GAGGCGGCGA CGCAGCGACG CGGTTCGAAC TCCGACTCTC CGAGCCGCCG GTGCTGTCGA CGGTCGCTTC GCTCGGCGGG TCGGTCGAGA GCGCCGTCAT CGAAGGCGGC GATTATCGCA TGACGATCCA CGTGGCGTCG GGTGCGAAAG TCCGACAGGT TGTCAACGTC GTGCAGGACG CCTACCCCAC GGCGGAACTA CTGAAACACC GTCAGCTCGA ACGACGGCAG GACACGACCG ACCGCGCTCG CCACGCGCTG ACGGCGGTCC TCACCGATCG CCAGCGATCG GCCCTCGAGG CGGCATACCA CGCGGGCTAC TACGAGTGGC CGCGCGACGC GTCCGGCGAG GAGGTCGCCG AATCGCTGGA AATCGCACCG CCGACGTTCA ATCAGCACCT TCGGAAGGCG CAGCGAAAGG TGTTCGACTC GCTGTTGTCG AGGTCCGGAC AGAACCGATC GACGGAATAG
|
Protein sequence | MTSRSLTEAL RETLALFDEA GIPRTTTELA DDLELGRRST YERLERLVER GELETKRVGA SARVWWRPPS ARTDRPLGSG DWPVEAESLL AAALDRTDIG VFVLDESFNV AWLNETAERY FDLDRERVLG RNKRRLVETD IASAVDDATA FVDTVLATYD DNTYAERFEC RVTAEEEREP RWLEHRSEPI ETGAYAGGRI ELYHDVTDRK QRASRERELE QYERLVETVP DGVYAVDEDA RFVAVNDGFC ELTGYDRDEL LGAHATTVHD AEITTRAESL AEEIATGERG SATIELDLQT DGGATVPCES RLAPFPLGET DGRCGVVRDV SDRIERERTL RRRIRQQEVV ADLGQRALES RDLDELLADA ADLLTTTLET DYSKALVLDR TADELRLRQG SGWDPNVVGT TTVSAIEDES QAAYTLASGD PVLVDDLDSE SRFSGPDLLT DHDVTSGISV IIGSSEDPWG VLGVHDTDHR AFSEHDVAFV QSVANVLATA IDRNDREREL KRQHEQLAAL DSLNGVVRDI IDEAIDRSTR EEIERSVCEH LAATDSYLFA WIGDVDAADQ TVNVRTEAGG VAEYLEGIAI SVDPDDERSG GATGRAVRKR EIQTTQDIRD DSRYEPWRGH LEEYEIRSSA AIPIVHEETI YGVLSVYADR PNAFDGRERA VLGQLGDVIG HAIASTERKR ALMSDDVVEL DFRIRDLLDE LDLDVPSAGR ITLDHTVPIE DDEFLVYGSA TPDAIDGLEA IVETLPHWKA VTYRGGDAAT RFELRLSEPP VLSTVASLGG SVESAVIEGG DYRMTIHVAS GAKVRQVVNV VQDAYPTAEL LKHRQLERRQ DTTDRARHAL TAVLTDRQRS ALEAAYHAGY YEWPRDASGE EVAESLEIAP PTFNQHLRKA QRKVFDSLLS RSGQNRSTE
|
| |