Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_2129 |
Symbol | |
ID | 8384423 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 2167117 |
End bp | 2170083 |
Gene Length | 2967 bp |
Protein Length | 988 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644973198 |
Product | DEAD/DEAH box helicase domain protein |
Protein accession | YP_003131029 |
Protein GI | 257053196 |
COG category | [R] General function prediction only |
COG ID | [COG1204] Superfamily II helicase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGTTG CGGACATTCC GGGGGTACCC GCGTGGTTGC CCGAGCACCT GCAGGCGGAG GGCATCGAGG AGTTGTATCC ACCACAGGCC GAGGCCGTCG AAGGTGGCGT CACCGACGGG GCGAACCTCG TCGCGAGCGT GCCCACGGCC AGCGGCAAGA CACTCATCGC CGAACTCGCG ATGTTCTCGG CGATCACCGA CGGGGAAGGC GATGAGGCCG GCGACGACGA AGGTGTAGAC CACGCGGCGG AAACTGACGG CACCGCGCTC TACATCGTGC CGCTCCGGGC GCTCGCCAGC GAGAAACGCA CCGAGTTCGA ACAGTTTGAG GCCTACGGCC TGGAGGTCGG CGTCTCGACA GGTAACTACG AGAGCGACGG CGGATGGCTC GGTGACAAGG ATGTCGTGGT CGCCACCAGC GAGAAGGTCG ATTCCCTCGT GCGAAACGAT GCGCCATGGA TCGACGACCT CGATTGCGTC GTGGCCGACG AGGTCCACCT CGTCGACGAC GGCGAGCGCG GCCCGACCCT GGAGGTCACG CTCGCCAAAC TTCGAAAGCG CAATCCGGAT CTCCAGACGG TCGCGCTGTC GGCGACGATC GGCAACGCCG ACGCGCTCGC CGAGTGGCTG GACGCCGAAC TCGTCGACTC GACCTGGCGG CCCATCGACC TCAAAAAGGG CGTCCACTAC GGGCAGGCGT TGCAATTTGA AGACGGCAGC CAGCAAGAAC TCCGAGTCCA CGATAATGAG AAGGCAACTG CTGCCATCCT CAGGGACACT TTATCTAGTG AAATTCAGGA TCTAGCACAG GTACCCAAAA ATAAGAGAAA TCTGATAAAA GAAACTCTTG ATGGAACTTT TGACCCCCAA GTGGAATATC AACAGTCAAT AAAGGAAGAG ATACAAAATC TGTGTATCGA TCCTCTTCAA TATGATATTG GTGAAGAGTT TGGAATATTT GGCATCGCGA TAACGCACGA TATTGGCTCT GCAGAAGATA TTGCTGTTCG ACTTTCGGAA GATACGACGG CTTTCGTCTC ATCCTCTGAG AGAGACACAC TCAACGAGGT TGCAAAAGAA GTCCAGGCCG CAGGCAATTC ACGCTCTGTA CAAAAGTTAT CTGAACTAAT TCCGACTGGG GTTGGATTCT ATCACGGAAA ATTACCGGAA AAGTGTCGTC GTCGAGTCAA AGAATTAGCT AACGACGGGT TGTTAAAATC TATAGTCTCG ACCACTTCGC TTGATCTTGA TAACGTTGAT AACACCAACA CATTCCATGA GGGTGGCTCT TTGCTTGTAT TTGTCAACTC TCGGAAAAAC GCCGAAGCCG CGGCTCGACG GCTCGCATCA ACGACCGATC CCCTCCTTTC CGGCGACGAA CGCGACCGTC TCGCCGAGAT CGCGGCGGAG ATCCGCGATG TCAGCGACAC CGAGACCAGC GACGATCTGG CCGACGCTGT CGAGGGTGGC GCGGCCTTCC ATCACGCCGG GCTCTCCCGG GAACACCGCT CGCTCGTCGA GGAGGCGTTC CAGGAGCGTC TCGTCAAAGT GATTTCGGCG ACCCCGACCC TGGCGGCGGG CGTCAACACC CCCTCCCGGC GGGTCGTCGT CCGGGACTGG CGGCGCTACG ACGGGACGGC GGGCGGGATG CAGCCCCTCT CGGTGCTGGA GGTCCATCAG ATGATGGGGC GGGCGGGCCG ACCCGGGCTG GACCCCTACG GCGAGGCGCT GCTGCTGGCG AGCAGCCACG ACGAACTCGA CGAACTGTTC GAGCGCTACG TCTGGGCCGA CCCCGAACCG GTCCAGTCGA AACTCGCCGC CGAACCCGCG CTCCGGACGC ACATCCTCTC GACGGTCGCC TCCGGGTTCG CCAACTCCCG GGCGGGCTTG CTCGACTTTT TGGAGGCCAC GCTCTATGCC AGCCAGACGA CCGAGGGCGG CCGCCTGGAG ACGGTCGTCG ACGAGGTGAT CGCCTACCTC GAGGCCAACG ACTTCCTCAC GCGCGAAGAC GATCCGGACG GCACCCTTCG GGCCACATCC ATCGGCCAGA CTGTCTCGCG ACTCTACCTC GACCCGATGA GCGCGGCGGA GATGCTCGAC GGCTTGCGCG AGTTCGAGCG GACGGCCGGG GAGCGATCGA CTCGATCACC CCACGACGGG GAGCGGGACG ACGAGCCGCC GGGCTTCGAG CCGGCCAGCG AACTGGTGTC GGACGCGGGC GACGACATCG GTGAGTCCGA CGACACTGAC GACACCCCCC AGCCGACCGC GATGGGGCTG TACCACCTCG TCTCCCGGAC GCCGGACATG TACGAACTCT ACCTCCGGTC GGGTGACGAG GAGGAGTACT CCATGGAAGC CTACGAGCGC GAAGCGGAGT TCCTCGGCGC GATGCCCAGC GAGTTCGAGG AGGGTCGCTT CGAGGACTGG CTGTCGGCGC TGAAGACCGC CCGACTCCTC GAAGACTGGG CCGACGAGGT CGAGGAGGGG ACGATCACCG AGCGCTACGG CGTCGGGCCG GGCGACATCC GCGGGAAGGT CGAGACGGCG TCCTGGCTGC TCAACGCCGC CGAGCGACTC GCCGGTGAAG TTGGCCTGGA CGTGACGCCG GCGATCAGGG AGGCTCGCGT GCGCGTCGAA CACGGCGTCC GGGCGGAACT GGTCGACCTC GCGGGCGTCC GCGGCGTCGG TCGGAAACGT GCCCGCCGAC TGTTCGCCGC CGGCATCGAG TCTCGCGAGG ACCTCCGAGA GGCCGACAAG GCGGTCGTGT TAGGGGCGCT TCGTGGCCGG GAGAAGACCG CTGAAAACAT CCTGACGAAC GCCGGCCATC GCGACCCCTC GATGGATGGG GTAACGCCGG CGGCGGGGAG TGACGAACTC ACAGCCGCGA ACGGCGCTGG ATCCGGAGAC CGGGACGGTG CCGAAGACGA GCAACCGGCC GATCAGTCCA GCCTGGGTGA TTTCTGA
|
Protein sequence | MDVADIPGVP AWLPEHLQAE GIEELYPPQA EAVEGGVTDG ANLVASVPTA SGKTLIAELA MFSAITDGEG DEAGDDEGVD HAAETDGTAL YIVPLRALAS EKRTEFEQFE AYGLEVGVST GNYESDGGWL GDKDVVVATS EKVDSLVRND APWIDDLDCV VADEVHLVDD GERGPTLEVT LAKLRKRNPD LQTVALSATI GNADALAEWL DAELVDSTWR PIDLKKGVHY GQALQFEDGS QQELRVHDNE KATAAILRDT LSSEIQDLAQ VPKNKRNLIK ETLDGTFDPQ VEYQQSIKEE IQNLCIDPLQ YDIGEEFGIF GIAITHDIGS AEDIAVRLSE DTTAFVSSSE RDTLNEVAKE VQAAGNSRSV QKLSELIPTG VGFYHGKLPE KCRRRVKELA NDGLLKSIVS TTSLDLDNVD NTNTFHEGGS LLVFVNSRKN AEAAARRLAS TTDPLLSGDE RDRLAEIAAE IRDVSDTETS DDLADAVEGG AAFHHAGLSR EHRSLVEEAF QERLVKVISA TPTLAAGVNT PSRRVVVRDW RRYDGTAGGM QPLSVLEVHQ MMGRAGRPGL DPYGEALLLA SSHDELDELF ERYVWADPEP VQSKLAAEPA LRTHILSTVA SGFANSRAGL LDFLEATLYA SQTTEGGRLE TVVDEVIAYL EANDFLTRED DPDGTLRATS IGQTVSRLYL DPMSAAEMLD GLREFERTAG ERSTRSPHDG ERDDEPPGFE PASELVSDAG DDIGESDDTD DTPQPTAMGL YHLVSRTPDM YELYLRSGDE EEYSMEAYER EAEFLGAMPS EFEEGRFEDW LSALKTARLL EDWADEVEEG TITERYGVGP GDIRGKVETA SWLLNAAERL AGEVGLDVTP AIREARVRVE HGVRAELVDL AGVRGVGRKR ARRLFAAGIE SREDLREADK AVVLGALRGR EKTAENILTN AGHRDPSMDG VTPAAGSDEL TAANGAGSGD RDGAEDEQPA DQSSLGDF
|
| |