Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4080 |
Symbol | |
ID | 8744708 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | + |
Start bp | 332436 |
End bp | 335681 |
Gene Length | 3246 bp |
Protein Length | 1081 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 646514641 |
Product | putative PAS/PAC sensor protein |
Protein accession | YP_003405588 |
Protein GI | 284167310 |
COG category | [R] General function prediction only |
COG ID | [COG3413] Predicted DNA binding protein |
TIGRFAM ID | [TIGR00229] PAS domain S-box [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTTCGT CCGTACCATC GCCTGCCACG GTCCAAGCGG TCTTCGACCA GCTCGGTCCC CCTGGGACCC CCTTCACGAC GCCGGAGATC GCCGCGGAGT TCGACTGTTC CGACCGAACG ATCTACAATC GACTCGACGC GCTCGTCGAC GAGGGCGTCA TTGAGACGAA GAAAGTCGGC GCTCGCGGAC GGGTGTGGTG GAGGCCCGTC GACGGCGACA TTCGACGGAA CGGCGGCGCT TTCAACGAAC GGAACCCCGT CTCATTCCGA GATGAGCAGG CACTATCCTT TCTCTCTGAT AGCGAGATGG CCGAACGCAT CCGCAAGTTC GAGTGGGCCA AAACGCCGCT TGGCCCGATG GACGGGTGGC CCCTAGAGTT GCGGGTCGCG GCCGACATCA TGCTGGGGGC GGACGAGGCC ATCGGTCTCT ACTGGGGAGA GGACCTGACA CTGCTGTACA ATGACGCCTG GCGGGAGTTG ATCGGTGACA AGCATCCGGA GGCGCTGGGG CGGCCTGCCC AAGAGGTGTT TCCCGAGATC TGGGAGACGA TCGAGCCGAT GTTCGCTGAC GTGCTGGATG GGAATGGAGT TGGGTTCGAA CGGGAACAGC GACTGTCGCT GGAACGTGAC GGCCAGATAG AGGACGCGTG GTTCGACTAC AGCGCCAATC CGATTCTGAT GGCCGACGGC TCCGTCGGTG GCGTCTTCAA CATCGCGAAC GAAATCACCG AGAGAAAAGA CGCCCAACAA ACCCTGCGCG ACAGAGAGGG GCGACTCAAC GCATTCGTCA CAAGTACCTC GGAAATCGTC TATCGAATGA GTCCGGACTG GTCAGAGATG TACTACTTGG ACGGCAAGGA TTTCATCGCC GACACGGAGG ATCCTCGGGA AACGTGGTTG AAGGAGTACA TTCCACCGGA CGAGCAAGAG CGGGTGATGG CCGCCATTGA GGAGGCCATC GAGACGAAAA GCATGTTCGA ACTGGAGCAC CAGGTTCACC AGGTCGATGG CACCCGAGGC TGGACGCACT CACGGGCAGT GCCGATACTG GACGACGACG GCGACATCGT CGAGTGGTTC GGGACGGCCA GTGACGTCAC CGAGCGAAGG CGCGCTGAAC AGGCTCTTCG TGAGTCAGAG CAACGCTATC AAAGGCTGTT CGACTCCGTC AACGAGTCCA TCGAAGAGGC CTTTTTCATC ATCGAGCGGG TGTCCGGCGA GGGCGAGACT GCCGCAAACG GGGACGACCC GGCAGCGTAT CGCTTCGTGG AAACGAACCC GGTGTTCGAG AAATCCATCG GTGTGACCGA CGTAGTCGGG AAACAAGTGA GCGATCTCGA TTTCGACGGC GACCTCCCCG GATCGGACGT GTGGGGAGAG GTGGTCCGGA CAGGCGAATC GAGACGCTTC GAGATCGAGA CCACTGGTGG TCCGCTCGCC GATGGCTGGT ACGATGTTCG CGTCTTCCCG TATGGCGGAG CGGACAGTCG AGCCGCTGCG TGTCTCGCCG ACGATATCAC AGACCAAAAG GAAGTCCAAC GGTCTCTTGA ACGGCTCACC GAGGCAAGCC GGGAACTGAT AGAGGCCGAT CCAGAGATGG TCCACGACCG CGTGGCTGAG CTCACGATAG CGGTGCTTGA CGTCGAGTAC GCTGCGCTCT GGCGGTATGA CGAGGCAAGC GGAGACTTGA TCGACGCCAT CAGCACTCTC GACACGGGGA TCGATGCCGA AACCGTCCGA CACCCGGATG ATGCCTCTGA GCACGTCTGG CAGGCGTTTA TCGACGATGA GACCGCCGTC ACAAACAACC TCCACGTCGA CGAGACCGTG CCGGACGCAG CGACCCTGCG GAGCCGCCTG CTCATTCCAC TGGGCAGACA TGGCGTTGTC TGTATCGGAT CCTTCCAGCC GAATGGCTTC GATGAGCGGA TAGTCGACCT TGCCGAAACT CTCGGGGCGA CGCTCGAAAC GGCGTGGGAC CGTGCCGACA GCGAACGCCA ATTGCAAGTG CAGAACGAAG AACTGCAACG CCTCGACCGT CTCAACACCC TCATACGAGA GATCGATCAG GCACTTGTGG GAGCCGATAC CCGCGAGGAG ATCGATGAGG CAGTCTGCGA ACGACTGGCC AGCTCCGACC TCTACGAGTT CGCGTGGCTC GGAGAGTACG ATCCGGGGAC CAACCGGATA GAACCGCGTG CGTGGGCTGG CGTCGACAGC GGCTACGTAG AGGAGCTCAC GATCACGGTC GAGGACACAC CGACCAACCG AGACCCAATC GCCCGTGCCC TCCGCACGGG CGACTTGCAG GTGGTTGCAG ACATCGCTAC GGATAGCGGC TTCGCCCCCT GGCGGGAAGC GACGCTCGAA CGCGGCGCCC GGTCGCTGGT CTGCATTCCA CTCGTTTACG ACGACGCGGC GTACGGCGTG TTGACAGTCT ACGCCGACCG TCCCCAGTCC GACGAGGACA AACGGAATCG GGACGTGCTG TCGGAACTCG GTGATACGAC CGCCCATGCG CTCAACGCGA GGGAGACGCG GGCGACGTTG CAGACCGACA GCGTCGTCGA ACTCACGCTC CGATTCGAGG ACGCTGACAC GCCGCTGTAT CGTCTCTCCC GGGAGACGGA GTATACCATC GAGCATCAGG GGTTCGTTCC CCGATCGAGC GGGCAGACCG ACGTCTTCTT CATCGTTCGT GAAATCTCGC CGGAAGATCT CCGGGCCACA GCAGAACGCT CGCTCGCGTT CGAGGACCTA GACTGTCTCA CCGAGAGGGC CGATGGAGCA CTGTTCAGGG CACGGGTGTC TGAGCCGACG CTCGCCGCAC GGGTCACTGA CGAGGGGGCT GTCGTGCGTT CGATTACCAT CGATTCCGGG GTTGCAACCG TTGTTCTCGA TATCCCCCAC ACGGCAGCGG TCCGCGAGTT CCTGAACCGA CTCCGCCAGT GGCATCCGGA ATTGGAGCTA CGCGCCCGCC AGTCGCGCGA ACGGCCACTG AAGACCCGGC AAACCTTCGT GACGGCGCTC GAGGACCGCC TGACGGATCG ACAGCGGGAA GTCCTGCAGA CGGCCTACCT GAGCGGCTTC TTCGAGATGC CACGGGTCAG TAACGGACAG GAGGTCACAC ACCTGGTCGG CGTCTCACAG CCGACGTTCT CTGAGCACCT GCGTGCCGCT GAACGTACCC TGTGTGAGGT CCTATTCGAG ACCGAACCGT ATGCCGAGGA TATCGTCTCG ACGTAG
|
Protein sequence | MGSSVPSPAT VQAVFDQLGP PGTPFTTPEI AAEFDCSDRT IYNRLDALVD EGVIETKKVG ARGRVWWRPV DGDIRRNGGA FNERNPVSFR DEQALSFLSD SEMAERIRKF EWAKTPLGPM DGWPLELRVA ADIMLGADEA IGLYWGEDLT LLYNDAWREL IGDKHPEALG RPAQEVFPEI WETIEPMFAD VLDGNGVGFE REQRLSLERD GQIEDAWFDY SANPILMADG SVGGVFNIAN EITERKDAQQ TLRDREGRLN AFVTSTSEIV YRMSPDWSEM YYLDGKDFIA DTEDPRETWL KEYIPPDEQE RVMAAIEEAI ETKSMFELEH QVHQVDGTRG WTHSRAVPIL DDDGDIVEWF GTASDVTERR RAEQALRESE QRYQRLFDSV NESIEEAFFI IERVSGEGET AANGDDPAAY RFVETNPVFE KSIGVTDVVG KQVSDLDFDG DLPGSDVWGE VVRTGESRRF EIETTGGPLA DGWYDVRVFP YGGADSRAAA CLADDITDQK EVQRSLERLT EASRELIEAD PEMVHDRVAE LTIAVLDVEY AALWRYDEAS GDLIDAISTL DTGIDAETVR HPDDASEHVW QAFIDDETAV TNNLHVDETV PDAATLRSRL LIPLGRHGVV CIGSFQPNGF DERIVDLAET LGATLETAWD RADSERQLQV QNEELQRLDR LNTLIREIDQ ALVGADTREE IDEAVCERLA SSDLYEFAWL GEYDPGTNRI EPRAWAGVDS GYVEELTITV EDTPTNRDPI ARALRTGDLQ VVADIATDSG FAPWREATLE RGARSLVCIP LVYDDAAYGV LTVYADRPQS DEDKRNRDVL SELGDTTAHA LNARETRATL QTDSVVELTL RFEDADTPLY RLSRETEYTI EHQGFVPRSS GQTDVFFIVR EISPEDLRAT AERSLAFEDL DCLTERADGA LFRARVSEPT LAARVTDEGA VVRSITIDSG VATVVLDIPH TAAVREFLNR LRQWHPELEL RARQSRERPL KTRQTFVTAL EDRLTDRQRE VLQTAYLSGF FEMPRVSNGQ EVTHLVGVSQ PTFSEHLRAA ERTLCEVLFE TEPYAEDIVS T
|
| |