Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_5231 |
Symbol | |
ID | 8745779 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013747 |
Strand | + |
Start bp | 126395 |
End bp | 129235 |
Gene Length | 2841 bp |
Protein Length | 946 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 646515588 |
Product | DEAD/H associated domain protein |
Protein accession | YP_003406535 |
Protein GI | 284176258 |
COG category | [R] General function prediction only |
COG ID | [COG1201] Lhr-like helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGAGG GCGACGTCGC GGCGTTTACG CACCTCGGAC CGACCGTTCG CGGGGCGCTG TCCGAACGCG GCTTCTCGAC GCCGACGGCA CCGCAGCGAC TGGCGATTCC GCCGCTGTCG GCCGGCCAGC ACACGCTCGT GATCGCACCG ACCGGGAGCG GCAAGACCGA GACGGCGATG TTGCCCGTTT TCGACCATCT GGTTGCGGAT GACGGACCGC CAGAGGGGTT CGGAGCGCTC TATATCACCC CGCTGCGGGC GCTCAACCGC GACATGCGCG AACGCCTCGA GTGGTGGGGC GAGTACTTAG ATCTCGAGGT GGACGTCCGC CACGGCGACA CCACGCAGTA TCAGCGGGGC AAACAGGCCG AGGATCCGCC GGACGTCCTG GTCACGACGC CGGAAACCGT CCAAGCGATG CTGACCGGCG AGCGTCTGCG CGAGGCGTTG CAGGACGTCT CCCACGTCGT GATCGACGAG GTGCACGAAC TCGCGGCCTC CAAGCGAGGG GCACAGCTGG CGATCGGCCT CGAGCGCCTG CACGATCTGG CGGGCGACAT GCAGCGGATC GGCCTCTCGG CCACGGTGGG CGATCCGGGG GAGGTCGGGC AGTTCCTGAC CGGCGGCCGG CCCTGCGAGA TCCGGGAGAT CGACGTCGGC AGCAACGTCG ACGTCGCCGT CCGTAATCCG GAGATCACTG CGGAGGACGA GCGGCTCGCG GGCGAACTGA TGACCGAGCC GGACACGGCC AGCCACGTCC GGCTGATCCG CGATATCGTC GCCGAGAACG AGTCGACCCT GATCTTCGTC AACACGCGCC AAACTGCGGA GGCGCTGGGA TCGCGGTTCA ACGAACTCGA CCTCCCGATC GGCGTCCACC ACGGCTCGCT CTCGAAGGAG GCCCGCATCG AGGTCGAGGA CCGGTTCAAA GCGGGCGAGC TCGACGGTCT GCTCTGTACG TCTTCGATGG AGCTGGGGAT CGACGTCGGC CGGGTCGACC ACGTCGTCCA GTACAAGAGC CCGCGGCAAG TGACGCGATT GCTGCAGCGA ATCGGACGAG CGGGGCACCG GCAGGACGAG GTCTCGAGCG GCACCATCGT CACGACCCGG CCGGACGACA CGTTCGAGGC GCTTTCGATC GCTCGCCGCG CCCGCGACGG CGAGGTCGAA CCGGCCGCGA TCCACGAGGG GAGCCTGGAC GTCGTGGCCA ACCAGCTTCC GGGGATCGTC CAGAGCCGCG GCGACACCCG CTTTCGGGAG GCTTACGAGA CGATCACGCG CGCGTATCCG TTCCGCGACG TGCCCGAGGA GACGGTCCGC GAGGTCGCCT CGGAGCTCCA CCGCAACCGA ATCATCTGGT TCGACGAGGG AGAGGACTGC CTCGAGACCA CCGGCGGCAC CTGGCAGTAC GTCTACTCGA ACCTCTCGAT GATCCCCGAC GAGGAGACCT ACGAGGTCCA CGACATCGCC TCGAGCCAGC AGATCGGGAC CCTAGACGAG CGGTTCGTGG TCAACTTCGC CCAGCCGGGC GAGGTCTTCA TCCAGCGCGG CGAGATGTGG CGCATCGCCG AGATCGACGA CGAAGAGGCC CGCGTGAAGG TGAGTCCGAT CGAGGATCCC GCCGGCGAGG TGCCGTCGTG GATCGGTCAG GAAATCCCCG TCCCCGCTGC GGTGGCGGGC GAAGTGGGGG AGATTCGGGC CGTCGCGGAG CCGCAGCTGT CGGCGGGGGC GGACGCCGCC GCGGTCGGCC GCGAGCTGGC CCATCGGTAT CCGGGCGACG AGTACACGCT GACCGAGGCC TGCGAGCAAC TCGAGCGTCA GGTCGACGCG GAGACGCCGA TGCCGACGGC GGATCGGCTC GTCCTCGAGC GACGGGGTCG GACCGTCGTC CTGAACGCGC CCTTCGGGCA CACGGCCAAC GAGACGCTCG GTCGCGTGCT CTCCTCGCTG CTCGGCCAGC AGGCGGGCTC TTCCGTGGGG CTGGAGACCG ACCCCTACCG GATCGAACTC GAGGTGCCGA ACTCGGTCGC GACCAGCGAC ATTCTGGCGG TGCTCGAGGA GACCGATCCC GGCCACGTCG AGGCGATCGT CGAACTCGGA CTGAAGAACT CTGACGCGCT GGCCTTCCGG CTGGCGCAGG TCTCGGCGAA GTTCGGCGCG CTCAAGCGCT GGCAGAGCAA CGGTTCCGGT CGGCTCTCGA ACGAACGACT GCTCGCGGCC TTGGAGGACA CGCCGATGTA CCAGGAGTCG ATCCGCGAGG TGTTCCACGA AGATCTGGAC GTCGAGCGGG CGAGCGCGGT CCTCGAGGGG ATCCAGTCGG GCGAGATCGA ACTGGTGACC CACCGCGGTC GGACGCCGGT CGGGCAGGGC GGCCGCTCGT CGGGCGGGAA GGAGCTGCTG GCGCCGGAGA ACGCCGACGC GAGCGTCATC AACACGGTCC GCGAGCGCCT GCAGAACGAC CGGATCATCC TGCTGTGTAC CCACTGCAAG GAGTGGAAGG CGAAGACGAA GGTCCGCCGC GTGGCCGACC AGCCCGAGTG CGGCGAGTGC GGCTCGACCC GGATCGCGGC GCTGAACCCG TGGGCCGACG AGGTCGTCGA CGCGGTTCGC GCCGCGGAGA AAGACGAGGA ACAGGAGAAG ATGACCGAGC GCGCCTATCG GAGCGCGAGT CTCGTCCAGA GCCACGGGAG GCAGGCCGTC ATCGCGATGG CGGCCCGCGG CGTCGGTCCG CACAACGCCG CCCAGATCAT CAACAAACTC CGAGAGGACG AGGACGAGTT CTACCGCGAC ATCCTCTCGA AGGAACGCGA GTACGCGCGC ACGCAGTCGT TCTGGGACTG A
|
Protein sequence | MSEGDVAAFT HLGPTVRGAL SERGFSTPTA PQRLAIPPLS AGQHTLVIAP TGSGKTETAM LPVFDHLVAD DGPPEGFGAL YITPLRALNR DMRERLEWWG EYLDLEVDVR HGDTTQYQRG KQAEDPPDVL VTTPETVQAM LTGERLREAL QDVSHVVIDE VHELAASKRG AQLAIGLERL HDLAGDMQRI GLSATVGDPG EVGQFLTGGR PCEIREIDVG SNVDVAVRNP EITAEDERLA GELMTEPDTA SHVRLIRDIV AENESTLIFV NTRQTAEALG SRFNELDLPI GVHHGSLSKE ARIEVEDRFK AGELDGLLCT SSMELGIDVG RVDHVVQYKS PRQVTRLLQR IGRAGHRQDE VSSGTIVTTR PDDTFEALSI ARRARDGEVE PAAIHEGSLD VVANQLPGIV QSRGDTRFRE AYETITRAYP FRDVPEETVR EVASELHRNR IIWFDEGEDC LETTGGTWQY VYSNLSMIPD EETYEVHDIA SSQQIGTLDE RFVVNFAQPG EVFIQRGEMW RIAEIDDEEA RVKVSPIEDP AGEVPSWIGQ EIPVPAAVAG EVGEIRAVAE PQLSAGADAA AVGRELAHRY PGDEYTLTEA CEQLERQVDA ETPMPTADRL VLERRGRTVV LNAPFGHTAN ETLGRVLSSL LGQQAGSSVG LETDPYRIEL EVPNSVATSD ILAVLEETDP GHVEAIVELG LKNSDALAFR LAQVSAKFGA LKRWQSNGSG RLSNERLLAA LEDTPMYQES IREVFHEDLD VERASAVLEG IQSGEIELVT HRGRTPVGQG GRSSGGKELL APENADASVI NTVRERLQND RIILLCTHCK EWKAKTKVRR VADQPECGEC GSTRIAALNP WADEVVDAVR AAEKDEEQEK MTERAYRSAS LVQSHGRQAV IAMAARGVGP HNAAQIINKL REDEDEFYRD ILSKEREYAR TQSFWD
|
| |