Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_1920 |
Symbol | |
ID | 8384211 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 1940774 |
End bp | 1943689 |
Gene Length | 2916 bp |
Protein Length | 971 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644972988 |
Product | helicase domain protein |
Protein accession | YP_003130822 |
Protein GI | 257052989 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0553] Superfamily II DNA/RNA helicases, SNF2 family |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGACA CGACATTCTC TTCCGGCCAA CAAGTGATTC TCAACGGCAC GTCCGCGGAG GTCATCCAGA CGCGGACAGT TGGTGACATC GAGTATCTTC GAGCATACAT CGACGGCGAG GGCGTCAAGA CGGTCTGTCT GGACGACGTC GATATCCAGC CACACCAAAC CGGGCTGGAG ACCCTCTCCG GGCAGCAACT CGATGATCTT CATCCGGATC ACGAGGCCGT ATCTGCCCAG TGGTTCGACC TCCACACGCA GGCGACGAAG CTGAAGCTCG CCCACGAGCA GGGACAGTTG CTGAGCATTT CCAACTCGCT CGTCCGGCTG GAACCCTACC AGCTGGCGTG CGTCAATTGG GTGATGCAGA AACTCCGACA GCGTGCGCTC ATCGGGGACG ACGTCGGTCT CGGGAAGACC ATCGAGGCGG GGCTCATCCT CAAAGAATTG AGTGCCCGAA ACCGGGCCGA CCGCGTCCTC TTCGTCGTCC CTGCCCACCT CCAGAAGAAG TGGATCCGGG ATATGGATCG CTTCTTCGAC GTGGATCTCA CCGTCGCTGA CCGGGCGTGG GTCGAAGGCG AGCGTCGGCG TCTCGGAGAG GAGGCCAATA TCTGGAATCA AGACCAGCAG CAACTCGTCA CCAGTATGGC CTTCCTGCGG CAAGAGGAGT TCCGGTCGGC GCTCCGAGAC GCGTTCTGGG ATGTCGTCGT GGTCGACGAG GCCCACAAGG CCGCGAAGCG GGGCGACTCC CCGAGCAAGA CGGCGAACAT GGTCGAGACT GTTGCGGGCA ACTCCGACTC ACTCCTCCTC CTCAGTGCGA CGCCTCACGA CGGGAAGGGA GAGGCATTCC GGTCGCTCGT CGAGTACATC GACCCCTTCC TCGTCGCCGA GAACCGCGAG CTCTCGCAGG AGACAGTCGA CCGTGTGATG ATCCGACGCG GCAAGACGGA CATCTACGAC GACGACGGCG AGCGCATCTT CCCCGACCGG GAGGTCAACT CTGTCTCCGT CTCGATGACC CACGACGAAC GGCAGTTCTA CCGCGCCGTC ACGGACTACG TCAAGAACGT CTACAATCGC TCTGAGAAGC TGAACGAGCC CGCGGTCGGG TTCGCGATGG CGCTCATGCA GAAGCGGCTG GTCAGCAGCG TCGGCGCGAT TCACGCCACG CTCCGCCGGC GGCTGAACGA TTTGCTCGAT GAACAGACGG CGACGGACGA ACTCTCCGAG GAGGCCGAGG CCTACCTCGA CGGCGAGGAT CTGGACGAGG ACGACAAGCA GCGAGCGGAA GACGAGATTG CCGGGCTGAC CGTCGCCAGC AACGATGAAC AACTCCAAGA GGAGATCGAC ACGCTCCGCG ATCTCGTCTC ACTCGCCGAA GACCTGCCCG TCGACTCGAA GGCCCAGAAG GTACGGCGGT TCATCAGCCA ACTCCTCGAA GAGCAACCAG ACGAGAAGCT CCTGCTGTTC ACCGAGTACC GGGATACGCT CGACTACCTC CTCGACTTCG TACAGGACGA GCCGTGGGCC GAAGAGATTC TGGTCATCCA CGGTGACGTC GACAAAGAGG AGCGGGCCCG CATCGAAGAC GAGTTCAATC ACGGCCAGTC GCGACTGCTG TTCGCGACCG ACGCGGCGAG CGAGGGGATC GACCTCCAGC ATAGCTGCCA CATTATGGCG AACTACGAGT TGCCGTGGAA TCCCAACCGG CTCGAACAGC GCATCGGCCG GATCCACCGC TACGGGCAGG ACAGGGAGGT CAAGGTCTGG AACTTCCTCT TTGATGACAC TCGTGAGAGT GAAATCTTCG AGATGCTCCA GACCAAAGTC GAAGAGATTC GCTCGAAGCT CGGCAACACG GCGGACGTGC TCGGAATCCT CGACGACATC GACGTGGACT CGCTCATTAT GGAGTCCATC CAGAACGACG AGCCGCCGAG CGCAACCAAG GAGGAACTCG AGGACCTGAT CGAGGAGCGC CAGCGCACGC TCGAAGAGTG GTACGAACGG AGTCTCGTCG ACACCAGCAC GTTCGACGCG GAGAGCCGCC GGCAGATACA GGAGGTCGTC GACGAGTCCG AAGACGTCTA CGGGAGTGCC GGTGACATTC GGGAGTTCTT CGAGCAGGCA GTCGAGGCCT TCGGCGGCGA GTTCGAGAAG CGCGGCACCA ATCTCTATCA GGCCGAATTA CCCGAGGACA TCCGACCCCC GGATGAGGAT GCGACCTTCG GCCCGTTCAC GTTCGACCGC GAATTTGCGA TGGAGCACGA GGAAATCACG TTCGTAGCGC CCGACACGGA CGTCCTCCAG CGACTCATGG CACGCGTGCT GGAGTTCGAT CGGGGAGATG TCGGCCTCAA ACTCCTCCCG TTCGTCGACA CGCCGGGAGT TACCTACAAT TATCGCGTAG CGTTCGAGGA TGGCACCGGA GAGGCAATCC GGGAGGAGAC AATCCCCGTC TTCGTCGACG CGGAGCAACG GGATGCCCAA CAAGGGCTCG GTGAGCGCGT CGTCGAGGGC GAGACGGTGT CTGCGAAACC AGGGGTGGAT GACATCCGGA CAGTAGTGGA CGCCGAAGAC GAACTTCGTG AAGCCGCCGA CCGCTATGTG AGTGCTCGGG TGAACGAGAT CAAGTCCGAC CTCAGTTCCA AACGCCACGA AGAGACTGCC CGGGAACTCG AAAACCTCAA CGAGTACGCA CAGTCCGAAC GCGAGCGCAT CGAGTCCTTT ATCGAGGAGT ACGAACGCAA GTCCGAGGCC GGCTCGGATA TGGACATCGC GATCCGGGGC CAGCGTGAGC GTCTCGAAAA ACTCGAAGAG CGAATCGAGA CGCGTCGCCA AGAGCTGAAA CGTCGGGAGC AGGTCATCTC GCTGGCGCCC GAGGTCGAGA ACTACTGTTT GACACTACCA CTCTGA
|
Protein sequence | MTDTTFSSGQ QVILNGTSAE VIQTRTVGDI EYLRAYIDGE GVKTVCLDDV DIQPHQTGLE TLSGQQLDDL HPDHEAVSAQ WFDLHTQATK LKLAHEQGQL LSISNSLVRL EPYQLACVNW VMQKLRQRAL IGDDVGLGKT IEAGLILKEL SARNRADRVL FVVPAHLQKK WIRDMDRFFD VDLTVADRAW VEGERRRLGE EANIWNQDQQ QLVTSMAFLR QEEFRSALRD AFWDVVVVDE AHKAAKRGDS PSKTANMVET VAGNSDSLLL LSATPHDGKG EAFRSLVEYI DPFLVAENRE LSQETVDRVM IRRGKTDIYD DDGERIFPDR EVNSVSVSMT HDERQFYRAV TDYVKNVYNR SEKLNEPAVG FAMALMQKRL VSSVGAIHAT LRRRLNDLLD EQTATDELSE EAEAYLDGED LDEDDKQRAE DEIAGLTVAS NDEQLQEEID TLRDLVSLAE DLPVDSKAQK VRRFISQLLE EQPDEKLLLF TEYRDTLDYL LDFVQDEPWA EEILVIHGDV DKEERARIED EFNHGQSRLL FATDAASEGI DLQHSCHIMA NYELPWNPNR LEQRIGRIHR YGQDREVKVW NFLFDDTRES EIFEMLQTKV EEIRSKLGNT ADVLGILDDI DVDSLIMESI QNDEPPSATK EELEDLIEER QRTLEEWYER SLVDTSTFDA ESRRQIQEVV DESEDVYGSA GDIREFFEQA VEAFGGEFEK RGTNLYQAEL PEDIRPPDED ATFGPFTFDR EFAMEHEEIT FVAPDTDVLQ RLMARVLEFD RGDVGLKLLP FVDTPGVTYN YRVAFEDGTG EAIREETIPV FVDAEQRDAQ QGLGERVVEG ETVSAKPGVD DIRTVVDAED ELREAADRYV SARVNEIKSD LSSKRHEETA RELENLNEYA QSERERIESF IEEYERKSEA GSDMDIAIRG QRERLEKLEE RIETRRQELK RREQVISLAP EVENYCLTLP L
|
| |