Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_2813 |
Symbol | |
ID | 8743429 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | - |
Start bp | 2885018 |
End bp | 2887993 |
Gene Length | 2976 bp |
Protein Length | 991 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 646513400 |
Product | putative PAS/PAC sensor protein |
Protein accession | YP_003404358 |
Protein GI | 284166079 |
COG category | [R] General function prediction only |
COG ID | [COG3413] Predicted DNA binding protein |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTACCG CAGGAATCGA CGACGGCGGA ACCGTACTCG TCGTCGAACC GTCCGGAGCG GACCCGGCGT CGCTCCTCGA CTCGTTCGAC TCGCGACGGA GCGAGGAGAG GGTCTCGAGC GCGACGGCGG CCCTCGAGGC GCTCGAGACG GAGTCGGTCG ACTGCGTCCT GACGCCCCAG TCGGTGGCCG ACGCCACCGG CCTCGAACTC GTCGAACGGA TCCGCGACCG CGATCCGGAC GTTCCGATCG TCCTCGCGCC CCGCGAGGGC AATGGCGGCG AGAGCCTCGC CGGCGACGCG ATCGCCGCGG GCGTCACGGA GTACGTCCCG GCCGATCGCG ACGCCTCGAC CCTCGAGACG GCCGTCGAAC GGGCGATCGA GGCGGGCCGG GGTCAACGCG AGCGCCGCGA GCGTGCCCGC CAGTTCGGCG CGGTGTTCGA CGACCCCGAG ACGTACGCGT GGGTGCTGGA GATCGACGGG CGAGTGCGTC GGGCTAACGA GCGCGCGCTC GAGGCGCGCG GAACGCTCGA GGCGGCCGGG ACCGACGTTC GCGGGGAGTC GTTCACGGAT CTCCCGCTGT GGAACCGCCT CGAGGAGGAC CGATCGACGA TCCGGACCGC AGTCGACCGA GCGGCGAACG GGACGGTCGT CCACCGCGAG GTGACCCTCG AACTCGACCG CCGGGCGCGG ACGAACGGTG CGGGTGCGGA CGCCGGCGTC GACACCGAAA CCGGGTCCGG CGACGGTGCG ATCGACGACC CGCACACGCT CGAGGTGACG GTCCGACCCG TTCGCGACGA GTCGGGGACC GTCGTCTCCC TGCTCGCGCG GGCGAACGAC GTCACCGAAC GGGCCCGACT CGAGCGGGAG CTGCGCGAGT CGGAGGAGCT CCACCGGGTG ACGCTGAACA ACATGACCGA CACCGTCCTC ATCACGGACG ACGAGGGCGC GTTCACCTAC GTCTGTCCGA ACGTCCACTT CATCTTCGGC TACACCGACG AGGAGATCTA CGAGATGGGG ACGATCGACG AGCTCCTGGG GTCGGACCTC TTCGACCGCG AGGAGCTGGA GCGGGAGGGC GTGCTCACCA ACATCGAGTG TACGGCCACT GACCGGGCCG GCCGCGAGCA CGCGTTGCTG GTCAATGCCC GCGAAGTATC GATCCAGGGC GGGACGACCC TCTACAGCTG TCGCGACGTC ACGACGCGCA AACGTCGCGA GGAGGCCCTG ACCGCGCTCC ACCGGACCGC ACGCGAACTG CTTTACGCCG AGAGCGACCG CGAGATCGCC GACCTCGTGG CGGCCGACGC CGCCGACGTG CTCGGCCTCG AGGCCAGTGG CGTCTACCTC TTCGACGACG AGGAAAACGT CCTCGAGCCG GTCGCCGCCT CGCCCGGGAT GGATCGCCTC CACGGGCCGC TCTCGCGGCA TTCGGTCGGC GAGGACAGCG TCTCGGGACG GGCCTTCGTT GACGGCGAGT CCCGATTCTT CGCGGACGTC CGCGACGCCG AGGCGCTGGC GGATCCGGCG ACCGATATCA GGGGCGCAGC GGTCGTCCCG CTGGGCGATC ACGGCGTCTT CCTCGCGGGC TCGTCGGAGC GCGACGCGTT CGACGACGTC GACCGCGAAC TGACCGATCT GCTGGCGGCG ACGGCCGAGG CCGCCCTCGA CCGCGTCGAA CGAGAGCGGT CCCTCCGCGA GCGCGACCGG GAACTCAAGC GTCAGAACCG ACGGCTCACG CGGCTCGATC GGATCAACGA GATCATCCGC GAGATCGACG CGGCGCTGGT CCGGGCCGAG ACCCGCGAGG AGATCGAGAC CGCCGTCTGC GAGCGACTGA CCGCCGCCGA CCGGTTCGCG TTCGCCTGGA TCGGGACGAC CGACGCGCCG GGCGAGCGAC TCGAGCCGAG CACCCACGAC GGGACCGGCC GCGGCCGGGA CTACCTCGAC GGCGTCTCGC TCTCGCTTTC GGAGGCGACC GAACCCGCGG CCCGCGCGGC GGCCGACGGC GAGGTGACGG TCGTCTCCAA CGTCGCCGAC CGACTCCGCG AGGAGTCGTG GCGCTCGGCG GCGCTCTCGC GGGAGTACCA GTCCGTCGCG GGCGTGCCGC TGGCCTACGA CGAGTTCACC TACGGCGTGC TGGCCGTCTA CGCGGACCGC CCCGACGCCT TCGACGAGGT CACGCGGAGC GTCCTCGCGG AACTGGGCGA GACGATCGCC TCCGCCATCG CCGCCGTCGA GCGCAAACGG GCGCTGCTGA CGGACTCCCG GACCCGCCTC GAGTTCGACG TCGGCGACGA GAGCTTCGTC TTCTCCCGGT TGGCCCGGCG AGCCGACTGC GTCCTCTCGT TCGACGGCGG CGTCCGCCTC CACGAGGACG GCGCCGCGGT GTTCGCGACC GTCGAGGGCG CGTCGGGCGA GGCCGTCGCG GACGCGGCCG CCGATCTCGT CGCCGTCGAG AACGCTCGAG CGGTCGGCGC CGGCGACGAC GAGGCCGAGG CCACGGACGG TCGCGGCGGG ACGGTCCTGC TCGAACTGGC GCCGCCGTTT CTGGCGCTGC GCCTCGCGGA TCACGGCGTC GTGCTCCGCA GCGTGGAGGC GTCCCCCGAC GGTGCGCGCA TCGTCGTCGA CGTGCCGCCG ACCGTCGACG CGGGCAACAG CGTCGACGTC GTCTCGAACG CGCTCGACGA CGTCGAACTC CGCGCCAAGC GGACCGTCGA CAGAACGACC GCGCGCGACC TCCGAAGCGA ACTGCGCGAG CGACTGACCG AGCGACAGCT CGAGGTCGTC CAACTGGCCT ACTACGGGGG CTACTTCGAG TCGCCCCGGG AGCAGTCAGG CGAGGAAATG GCCGACGCGC TCGGCATCTC CTCGGCCGCG TTCTATCGCC ACGTGCGGGC GGTCCAGCGG AAACTCTTCG TCCTCCTGTT CGACGAGCTC GGACTTCCGG CAAACGCTGC ACTGGGGGTT GAATAG
|
Protein sequence | MVTAGIDDGG TVLVVEPSGA DPASLLDSFD SRRSEERVSS ATAALEALET ESVDCVLTPQ SVADATGLEL VERIRDRDPD VPIVLAPREG NGGESLAGDA IAAGVTEYVP ADRDASTLET AVERAIEAGR GQRERRERAR QFGAVFDDPE TYAWVLEIDG RVRRANERAL EARGTLEAAG TDVRGESFTD LPLWNRLEED RSTIRTAVDR AANGTVVHRE VTLELDRRAR TNGAGADAGV DTETGSGDGA IDDPHTLEVT VRPVRDESGT VVSLLARAND VTERARLERE LRESEELHRV TLNNMTDTVL ITDDEGAFTY VCPNVHFIFG YTDEEIYEMG TIDELLGSDL FDREELEREG VLTNIECTAT DRAGREHALL VNAREVSIQG GTTLYSCRDV TTRKRREEAL TALHRTAREL LYAESDREIA DLVAADAADV LGLEASGVYL FDDEENVLEP VAASPGMDRL HGPLSRHSVG EDSVSGRAFV DGESRFFADV RDAEALADPA TDIRGAAVVP LGDHGVFLAG SSERDAFDDV DRELTDLLAA TAEAALDRVE RERSLRERDR ELKRQNRRLT RLDRINEIIR EIDAALVRAE TREEIETAVC ERLTAADRFA FAWIGTTDAP GERLEPSTHD GTGRGRDYLD GVSLSLSEAT EPAARAAADG EVTVVSNVAD RLREESWRSA ALSREYQSVA GVPLAYDEFT YGVLAVYADR PDAFDEVTRS VLAELGETIA SAIAAVERKR ALLTDSRTRL EFDVGDESFV FSRLARRADC VLSFDGGVRL HEDGAAVFAT VEGASGEAVA DAAADLVAVE NARAVGAGDD EAEATDGRGG TVLLELAPPF LALRLADHGV VLRSVEASPD GARIVVDVPP TVDAGNSVDV VSNALDDVEL RAKRTVDRTT ARDLRSELRE RLTERQLEVV QLAYYGGYFE SPREQSGEEM ADALGISSAA FYRHVRAVQR KLFVLLFDEL GLPANAALGV E
|
| |