Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_0549 |
Symbol | |
ID | 8382816 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 553839 |
End bp | 556670 |
Gene Length | 2832 bp |
Protein Length | 943 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644971611 |
Product | DEAD/H associated domain protein |
Protein accession | YP_003129469 |
Protein GI | 257051636 |
COG category | [R] General function prediction only |
COG ID | [COG1201] Lhr-like helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0909316 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGAGG CGACGCGCGA GGGTGACGCC GTCTTCGCCC ACCTCGGCGA CGCCGTCCGG TCGGCACTGT CCGACCGCGG CTTTACGACG CCCACCGAGC CACAGCGTCG CGCGATCCCG CCGATCGCCG ACGGTGAGGA CGCGCTGGTG ATCGCCCCGA CCGGCAGCGG CAAGACCGAG ACTGCGATGC TCCCGGTCTT CGACGCGATC GAAGGCGATC CACCCGACGG CATCGCCGCG CTGTACGTCA CGCCGCTGCG GGCACTCAAT CGCGACATGC GGGATCGCCT GGAGTGGTGG GGTGAGACAC TTGACCTCGA GATCGACGTC CGCCACGGCG ACACGACCCA GTATCGTCGC CAGCAACAGG CCGAGGATCC GCCGGACGTT CTGGTCACCA CGCCCGAGAC CGTGCAGGCG ATCCTGACCG GTGAGAAGCT TCGGGAGGGA CTCGCAGACG TCGCCCACGT CGTCGTCGAC GAGGTCCACG AACTCGCGGC CGCAAAACGC GGCGCGCAGT TGACCGTCGG CCTCGAACGG CTCCGGGACC TCGCCGGCCC GTTCCAGCGG ATCGGGCTTT CGGCGACCGT CGGCGATCCC GCGGAAGTCG GGCGGTTTCT CACGGGCGAT CGGGGATGTT CGATCGTCGA GATCGACGCC GGCAGCGACA TCGAGATCGA CGTCGTCCGC CCTTCGATCG AAGAGCGCGA CGAGAACGTC GCCGGCGAGA TCGTCACCGA TGCCGAGGTC GCCAGTCACG TCAGGCAGAT CGACGAACTG ATCGAAACCC ACGAGTCGAC GCTGGTGTTT GTCAACACCC GACAGACGGC CGAAGCGCTG GGCTCGCGAC TCAAGGAATA CGGCACGGAC GTTGGCATCC ACCACGGGTC GCTGGCCTCG GACACTCGCG TCGAGGTCGA GGACGCGTTC AAGGCGGGCG ACCTGGACGC GTTGCTGTGT ACCTCCTCGA TGGAGCTCGG GATCGACGTC GGGCGTGTCG ATCACGTCGT CCAGTACAAC AGCCCCCGGC AGGTCTCCCG GTTGCTCCAG CGCATCGGGC GGGCGGGCCA TCGCCGGGAT CGAACGTCCG CAGGGACGAT CGTCACGACG AGCCCCGACG AGACCTTCGA GGCGCTCGCC ATTGCGCGAA TGGCCCGGGC CGGCGAGGTC GAACCGGCAG CGATCCACCA CGGGAGTCGG GACACCGTGG CCAATCAGAT CGCGGGCGTC GTCATGGACC TGGGAGAGGT CGACGCCCGC CGCGCCTACG AGATCGTCAC GCGGGCGTAC CCCTTTGGCG ATATTTCCGA AGCTGTCTTC CGGGAGATCG TGCGTGAACT CGCCGACAAT CGCGTCATTT GGCTCGAGGA AGACGACGAC GAGCTCTCGA AACGCCGGGG GACCTGGCAA TACTTCTATC ACAACCTCTC GATGATCCCC GACGAGGCGA CCTACGACGT CGCCGACGGC ACGAGCGGCC GGCAGGTCGG GACGCTGGAC GAACGCTTCG TCGTCAACTT CGCCGCGCCG GGGGAGGTGT TCATCCAGGC CGGCGAGATG TGGCGGATCA CCGAGATCGA CGAGGACGAG GAGACGGTCC ACGTCACGCC GGTCGGCGAC CCGGCCGGCG AAGTTCCCTC CTGGACCGGC CAGGAGATCC CGGTGCCCTA CGACGTAGCG CAGGAAGTCG CCGAGCTCCG GGGGGTCGCG GCCGAGCAAC TCCGGGCTGG GGGCGAGCCC GAAAGCGTCG CCCGGCACCT TGCTGGCAGA TATGATGCCG CCGAGGCGAC GATCCGCTCC GCGCTCGAAC AGATCGAATA TCACGAGGGA CCGGTGCCCG GCCCGAATCG GATCGTCGTC GAGTTCGAGG GGCGGGACGT CGTGCTTAAT GCTGCCTTCG GCCACAAGGT CAACGAGACA CTCGGTCGAC TCCTCTCGGC ACTGCTCGGC CAGCGTACGG GCTCGTCGGT CGGCCTCGAC GTCGACCCGT ACCGGATTTC CCTGGAGGTG CCACGCAACG TCACCGCCGG CGACGTCGTC GACGTACTCG AATCGACCGA TCCCGCTCAC GTCGAGGGGT TGATCGAACT CAGTCTGAAG AACGCCGGCG CGCTGAAGTT CCGACTCGCG CAGGTCGCCA CCAAGTTCGG CGCGCTCAAG AACTGGCGTG GGCGAGGGAG CAACCGCTTC GGACGGGATC GGCTGCTCGA AGCACTCGAG GGGACGCCGA TCTACGACGA AGCTCTCCGG GAGTTGATCC ACGAGAAACT CGACATCGAC CGGGCGAGCG AGTTACTTCG GGAGATCCAG TCGGGCGACA TCGTTGTCGA GACTGTCGGC GGCCGGACGG CGATCGGTCG CGGCGGCCGC TCGGGCAGCA AGGAACTCCT CGCCCCCGAG AACGCCGACG CGAGCGTGAT CCAGACGCTG AAAGACCGGA TCCAGGAGGA CCGCGTCATC CTGTTCTGCG TTCACTGTGA GTCCTACAAA ACCACCAAAC CGGTCAAGCG AGTGCGCGAC CAGCCGAAAT GCCCGGAGTG TGGCTCGACC CGGATCGCGG CATTGAACCC CTGGGACGAG GAGACCGTCA CGGCAGTCAA GACCGACGAG AAAGATGACG AGCAGGAACG ACGGACGAAA CGCGGCTACC AGGCCGCGGA TCTCGTCCAG AGCCACGGCA AACAGGCCAT CATCGCGCTG GCAGGGCGCG GCGTCGGGCC GACGAACGCG GCCCGCATCA TCAACAAACT CCGGGAGAAC GAGGACGATT TCTACCGGGA TATCCTCGCC CGGGAACGCG AGTACGCCCG GACGCGGTCG TTCTGGGAGT GA
|
Protein sequence | MSEATREGDA VFAHLGDAVR SALSDRGFTT PTEPQRRAIP PIADGEDALV IAPTGSGKTE TAMLPVFDAI EGDPPDGIAA LYVTPLRALN RDMRDRLEWW GETLDLEIDV RHGDTTQYRR QQQAEDPPDV LVTTPETVQA ILTGEKLREG LADVAHVVVD EVHELAAAKR GAQLTVGLER LRDLAGPFQR IGLSATVGDP AEVGRFLTGD RGCSIVEIDA GSDIEIDVVR PSIEERDENV AGEIVTDAEV ASHVRQIDEL IETHESTLVF VNTRQTAEAL GSRLKEYGTD VGIHHGSLAS DTRVEVEDAF KAGDLDALLC TSSMELGIDV GRVDHVVQYN SPRQVSRLLQ RIGRAGHRRD RTSAGTIVTT SPDETFEALA IARMARAGEV EPAAIHHGSR DTVANQIAGV VMDLGEVDAR RAYEIVTRAY PFGDISEAVF REIVRELADN RVIWLEEDDD ELSKRRGTWQ YFYHNLSMIP DEATYDVADG TSGRQVGTLD ERFVVNFAAP GEVFIQAGEM WRITEIDEDE ETVHVTPVGD PAGEVPSWTG QEIPVPYDVA QEVAELRGVA AEQLRAGGEP ESVARHLAGR YDAAEATIRS ALEQIEYHEG PVPGPNRIVV EFEGRDVVLN AAFGHKVNET LGRLLSALLG QRTGSSVGLD VDPYRISLEV PRNVTAGDVV DVLESTDPAH VEGLIELSLK NAGALKFRLA QVATKFGALK NWRGRGSNRF GRDRLLEALE GTPIYDEALR ELIHEKLDID RASELLREIQ SGDIVVETVG GRTAIGRGGR SGSKELLAPE NADASVIQTL KDRIQEDRVI LFCVHCESYK TTKPVKRVRD QPKCPECGST RIAALNPWDE ETVTAVKTDE KDDEQERRTK RGYQAADLVQ SHGKQAIIAL AGRGVGPTNA ARIINKLREN EDDFYRDILA REREYARTRS FWE
|
| |