Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_0662 |
Symbol | |
ID | 8741244 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | + |
Start bp | 689154 |
End bp | 691088 |
Gene Length | 1935 bp |
Protein Length | 644 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 646511240 |
Product | putative PAS/PAC sensor protein |
Protein accession | YP_003402232 |
Protein GI | 284163953 |
COG category | [R] General function prediction only |
COG ID | [COG3413] Predicted DNA binding protein |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGAACG CCGATGCGGC GGAGAACGAG GGTCAGGTCG CGCCCGCAGT CGCTGCGCTC GAGGCCGTCG TTGACCCCGT CGTCGCCGTC GTCGACGGGA CGATCACGTA CGCGAACGAC GCGGCGCTGA CGGCTTTCGA TCTGGCAGCA TCGACCGAGA GCGAGGCCGA CGACGAGGCC GGCGAGTGGG ACGCGGCGAG CGCGCTCGAC TCGTGGCCGC GACTCGAGAC GGCCGTCGAC GAGACGACCG TCGGGACGGT CCGCCGGGTG CCCCTCGAGG ACGAGACGTA CGACGCGCGC GTCCACCGGG ACGCGGCGAT GGCGACGATC ACGTTCGACC GCGAGCGCAC CGCGAGCAGC GAGTCGGCGG CGGAGACCGA CAGCGCGGCG CTCGGCGAGG GCGACCGGAC GGTCAAGGAT CGCGCGATCA ACGAGGCGCC GGTCGGGATC ACTATCTCCG ATCCAGATCT CGAGGACAAC CCGCTGGTCT ACGTCAACGA CGCCTACCAG GAGATCACCG GCTACGGCTA CGACGAGGTC GTCGGCCGGA ACTGCCGGTT CCTGCAGGGA GAGGACTCCC AGGAGGTGGC CATCGCCGAG ATGGCCGCGG CCATCGACGA GGAGCGGCCG GTCACCGTCG AACTGAAGAA CTACCGCAAG GACGGCACCG AGTTCTGGAA CGAAGTGACG ATCGCGCCCG TCCGCGACGA AGACGGGACG GTCACCCACT ACGTCGGCTT CCAGAACGAC GTGACGGCGC GCAAGGAGGC CGAACTCGCC CTCGAGCGCC GTACCGAGGA GCTCGACGAT CTCTTAGAGC GCGTGGAGGG GCTGATCCAG GATGTCACGG ACGTCGTCGC GGGCTCGACG GACCGGTCGG AACTCGAGGC CGCGGTCTGC GAGCGGATCG CCGCGGAGGC GGGCTACGAC GGCGCGTGGA TCGGCGAGCG AAACCCCGCG ACGGGGTCGA TCGACGTCCG AGCGAGCGCC GGCGCGTGCG ACGATCCGGA GGGTGAGCCG ATCGACGCTG ACCACCCCGC CGCCGCGGCG CTCGAGGAGC GCGCTGCTAC GACCGAGGCG GTCGAGGAGG GGACTCACGC TGCGTTCCCG CTGTCGTACA ACGGCATCGA GTACGGCGTG CTCACTGTCC GCACCGACCG GGACCGCGAA ATCGACGAGC GCGAGCGGGT GATTCTCTCA GCGCTGGCCC GCGCGGTCGC CAGCGGCGTC AACGCCCGCG AGACCAGCCG CGTGCTCGAG ACCGACGCCG TCGTCGCCGT CGAACTCACG CTGACCGATC GCTCGGTCGC GCCCGTCGCG CTCACCGCGG GAGCCGACTG CCGACTCGAG TACCGCCGCT CGGTCCACCG CACCGACGAC GAGACCGCGT CGCTGTGTAC CGTTACGGGC TCCGAGGCCA CCGCGGCCGA CCTCGTCGCA GCGGCCGACG CCGCCGAACT GGACTGCCGG GTCGTCCTCG AGCGCGAGGG GGAGTGTCTG GTCGAACTCG CCGGCGGCGA CGACCTCGTC GGCTGGCTCT CCGAGCGGGG CGTTCGCCTC CAGTCGATCG AGAGCGAGGA CGGGCGGGCC CGCGTCACGC TCGAGATTCC GCGCTCGGCC AACGTCCGTT CGATCGTGGA GGCCCTCGAG GACCGGTACG CCGGGACCGA CGTCATCTCG TTCCAGCAGC GCGAGCGCGA GGGCGAGACC CGCCAGGAGT TCGCGGCCCG CCTCGAGCGG GACCTGACCG AGCGCCAGTT CGCCGCGCTC CAGCGGGCGT ACCTGAGCGG CTACTTCGAG TGGCCGCGTC CGACGACGGG CGAGGATCTC GCCCAGTCGA TGGGTGTCTC CCGGCCGACG TTCCACGAAC ACCTTCGAAC CGCGGAAGCG AAGCTGTGTG GCGCGTTCTT CGGAGACACT GAGTCTTCGG GCTGA
|
Protein sequence | MENADAAENE GQVAPAVAAL EAVVDPVVAV VDGTITYAND AALTAFDLAA STESEADDEA GEWDAASALD SWPRLETAVD ETTVGTVRRV PLEDETYDAR VHRDAAMATI TFDRERTASS ESAAETDSAA LGEGDRTVKD RAINEAPVGI TISDPDLEDN PLVYVNDAYQ EITGYGYDEV VGRNCRFLQG EDSQEVAIAE MAAAIDEERP VTVELKNYRK DGTEFWNEVT IAPVRDEDGT VTHYVGFQND VTARKEAELA LERRTEELDD LLERVEGLIQ DVTDVVAGST DRSELEAAVC ERIAAEAGYD GAWIGERNPA TGSIDVRASA GACDDPEGEP IDADHPAAAA LEERAATTEA VEEGTHAAFP LSYNGIEYGV LTVRTDRDRE IDERERVILS ALARAVASGV NARETSRVLE TDAVVAVELT LTDRSVAPVA LTAGADCRLE YRRSVHRTDD ETASLCTVTG SEATAADLVA AADAAELDCR VVLEREGECL VELAGGDDLV GWLSERGVRL QSIESEDGRA RVTLEIPRSA NVRSIVEALE DRYAGTDVIS FQQREREGET RQEFAARLER DLTERQFAAL QRAYLSGYFE WPRPTTGEDL AQSMGVSRPT FHEHLRTAEA KLCGAFFGDT ESSG
|
| |