Gene Huta_1920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_1920 
Symbol 
ID8384211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp1940774 
End bp1943689 
Gene Length2916 bp 
Protein Length971 aa 
Translation table11 
GC content61% 
IMG OID644972988 
Producthelicase domain protein 
Protein accessionYP_003130822 
Protein GI257052989 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGACA CGACATTCTC TTCCGGCCAA CAAGTGATTC TCAACGGCAC GTCCGCGGAG 
GTCATCCAGA CGCGGACAGT TGGTGACATC GAGTATCTTC GAGCATACAT CGACGGCGAG
GGCGTCAAGA CGGTCTGTCT GGACGACGTC GATATCCAGC CACACCAAAC CGGGCTGGAG
ACCCTCTCCG GGCAGCAACT CGATGATCTT CATCCGGATC ACGAGGCCGT ATCTGCCCAG
TGGTTCGACC TCCACACGCA GGCGACGAAG CTGAAGCTCG CCCACGAGCA GGGACAGTTG
CTGAGCATTT CCAACTCGCT CGTCCGGCTG GAACCCTACC AGCTGGCGTG CGTCAATTGG
GTGATGCAGA AACTCCGACA GCGTGCGCTC ATCGGGGACG ACGTCGGTCT CGGGAAGACC
ATCGAGGCGG GGCTCATCCT CAAAGAATTG AGTGCCCGAA ACCGGGCCGA CCGCGTCCTC
TTCGTCGTCC CTGCCCACCT CCAGAAGAAG TGGATCCGGG ATATGGATCG CTTCTTCGAC
GTGGATCTCA CCGTCGCTGA CCGGGCGTGG GTCGAAGGCG AGCGTCGGCG TCTCGGAGAG
GAGGCCAATA TCTGGAATCA AGACCAGCAG CAACTCGTCA CCAGTATGGC CTTCCTGCGG
CAAGAGGAGT TCCGGTCGGC GCTCCGAGAC GCGTTCTGGG ATGTCGTCGT GGTCGACGAG
GCCCACAAGG CCGCGAAGCG GGGCGACTCC CCGAGCAAGA CGGCGAACAT GGTCGAGACT
GTTGCGGGCA ACTCCGACTC ACTCCTCCTC CTCAGTGCGA CGCCTCACGA CGGGAAGGGA
GAGGCATTCC GGTCGCTCGT CGAGTACATC GACCCCTTCC TCGTCGCCGA GAACCGCGAG
CTCTCGCAGG AGACAGTCGA CCGTGTGATG ATCCGACGCG GCAAGACGGA CATCTACGAC
GACGACGGCG AGCGCATCTT CCCCGACCGG GAGGTCAACT CTGTCTCCGT CTCGATGACC
CACGACGAAC GGCAGTTCTA CCGCGCCGTC ACGGACTACG TCAAGAACGT CTACAATCGC
TCTGAGAAGC TGAACGAGCC CGCGGTCGGG TTCGCGATGG CGCTCATGCA GAAGCGGCTG
GTCAGCAGCG TCGGCGCGAT TCACGCCACG CTCCGCCGGC GGCTGAACGA TTTGCTCGAT
GAACAGACGG CGACGGACGA ACTCTCCGAG GAGGCCGAGG CCTACCTCGA CGGCGAGGAT
CTGGACGAGG ACGACAAGCA GCGAGCGGAA GACGAGATTG CCGGGCTGAC CGTCGCCAGC
AACGATGAAC AACTCCAAGA GGAGATCGAC ACGCTCCGCG ATCTCGTCTC ACTCGCCGAA
GACCTGCCCG TCGACTCGAA GGCCCAGAAG GTACGGCGGT TCATCAGCCA ACTCCTCGAA
GAGCAACCAG ACGAGAAGCT CCTGCTGTTC ACCGAGTACC GGGATACGCT CGACTACCTC
CTCGACTTCG TACAGGACGA GCCGTGGGCC GAAGAGATTC TGGTCATCCA CGGTGACGTC
GACAAAGAGG AGCGGGCCCG CATCGAAGAC GAGTTCAATC ACGGCCAGTC GCGACTGCTG
TTCGCGACCG ACGCGGCGAG CGAGGGGATC GACCTCCAGC ATAGCTGCCA CATTATGGCG
AACTACGAGT TGCCGTGGAA TCCCAACCGG CTCGAACAGC GCATCGGCCG GATCCACCGC
TACGGGCAGG ACAGGGAGGT CAAGGTCTGG AACTTCCTCT TTGATGACAC TCGTGAGAGT
GAAATCTTCG AGATGCTCCA GACCAAAGTC GAAGAGATTC GCTCGAAGCT CGGCAACACG
GCGGACGTGC TCGGAATCCT CGACGACATC GACGTGGACT CGCTCATTAT GGAGTCCATC
CAGAACGACG AGCCGCCGAG CGCAACCAAG GAGGAACTCG AGGACCTGAT CGAGGAGCGC
CAGCGCACGC TCGAAGAGTG GTACGAACGG AGTCTCGTCG ACACCAGCAC GTTCGACGCG
GAGAGCCGCC GGCAGATACA GGAGGTCGTC GACGAGTCCG AAGACGTCTA CGGGAGTGCC
GGTGACATTC GGGAGTTCTT CGAGCAGGCA GTCGAGGCCT TCGGCGGCGA GTTCGAGAAG
CGCGGCACCA ATCTCTATCA GGCCGAATTA CCCGAGGACA TCCGACCCCC GGATGAGGAT
GCGACCTTCG GCCCGTTCAC GTTCGACCGC GAATTTGCGA TGGAGCACGA GGAAATCACG
TTCGTAGCGC CCGACACGGA CGTCCTCCAG CGACTCATGG CACGCGTGCT GGAGTTCGAT
CGGGGAGATG TCGGCCTCAA ACTCCTCCCG TTCGTCGACA CGCCGGGAGT TACCTACAAT
TATCGCGTAG CGTTCGAGGA TGGCACCGGA GAGGCAATCC GGGAGGAGAC AATCCCCGTC
TTCGTCGACG CGGAGCAACG GGATGCCCAA CAAGGGCTCG GTGAGCGCGT CGTCGAGGGC
GAGACGGTGT CTGCGAAACC AGGGGTGGAT GACATCCGGA CAGTAGTGGA CGCCGAAGAC
GAACTTCGTG AAGCCGCCGA CCGCTATGTG AGTGCTCGGG TGAACGAGAT CAAGTCCGAC
CTCAGTTCCA AACGCCACGA AGAGACTGCC CGGGAACTCG AAAACCTCAA CGAGTACGCA
CAGTCCGAAC GCGAGCGCAT CGAGTCCTTT ATCGAGGAGT ACGAACGCAA GTCCGAGGCC
GGCTCGGATA TGGACATCGC GATCCGGGGC CAGCGTGAGC GTCTCGAAAA ACTCGAAGAG
CGAATCGAGA CGCGTCGCCA AGAGCTGAAA CGTCGGGAGC AGGTCATCTC GCTGGCGCCC
GAGGTCGAGA ACTACTGTTT GACACTACCA CTCTGA
 
Protein sequence
MTDTTFSSGQ QVILNGTSAE VIQTRTVGDI EYLRAYIDGE GVKTVCLDDV DIQPHQTGLE 
TLSGQQLDDL HPDHEAVSAQ WFDLHTQATK LKLAHEQGQL LSISNSLVRL EPYQLACVNW
VMQKLRQRAL IGDDVGLGKT IEAGLILKEL SARNRADRVL FVVPAHLQKK WIRDMDRFFD
VDLTVADRAW VEGERRRLGE EANIWNQDQQ QLVTSMAFLR QEEFRSALRD AFWDVVVVDE
AHKAAKRGDS PSKTANMVET VAGNSDSLLL LSATPHDGKG EAFRSLVEYI DPFLVAENRE
LSQETVDRVM IRRGKTDIYD DDGERIFPDR EVNSVSVSMT HDERQFYRAV TDYVKNVYNR
SEKLNEPAVG FAMALMQKRL VSSVGAIHAT LRRRLNDLLD EQTATDELSE EAEAYLDGED
LDEDDKQRAE DEIAGLTVAS NDEQLQEEID TLRDLVSLAE DLPVDSKAQK VRRFISQLLE
EQPDEKLLLF TEYRDTLDYL LDFVQDEPWA EEILVIHGDV DKEERARIED EFNHGQSRLL
FATDAASEGI DLQHSCHIMA NYELPWNPNR LEQRIGRIHR YGQDREVKVW NFLFDDTRES
EIFEMLQTKV EEIRSKLGNT ADVLGILDDI DVDSLIMESI QNDEPPSATK EELEDLIEER
QRTLEEWYER SLVDTSTFDA ESRRQIQEVV DESEDVYGSA GDIREFFEQA VEAFGGEFEK
RGTNLYQAEL PEDIRPPDED ATFGPFTFDR EFAMEHEEIT FVAPDTDVLQ RLMARVLEFD
RGDVGLKLLP FVDTPGVTYN YRVAFEDGTG EAIREETIPV FVDAEQRDAQ QGLGERVVEG
ETVSAKPGVD DIRTVVDAED ELREAADRYV SARVNEIKSD LSSKRHEETA RELENLNEYA
QSERERIESF IEEYERKSEA GSDMDIAIRG QRERLEKLEE RIETRRQELK RREQVISLAP
EVENYCLTLP L