Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_1640 |
Symbol | |
ID | 8383920 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 1620761 |
End bp | 1623631 |
Gene Length | 2871 bp |
Protein Length | 956 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644972702 |
Product | protein of unknown function DUF1508 |
Protein accession | YP_003130547 |
Protein GI | 257052714 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTATCG GCGACATTCG CGACCAAAAA CTCGTCGAAC TGTACCACCG GTATATCGGA GAACCGGAGT CGAAACGTGA CGTATACGGC TACTGGCTGT TGCTTCTCGG GAGTATCACG GGGTTGCTGG GCGTGTCCGT CTTCCAGATC GAACAGCTCT TCTTCCCGGG TAATTTCGAG ATCCGGGAGA TCGCCATCGT CCTGTCGGCG ATCGGGCTTG CGATCGGGCT GTTTGCCGTT ATTGTGCTCT TGCCGGTGCG TCGACGTGGG ATGCAAGCGA GCGTCGTCGG GCTCGCGATC GCCTTTCTCA GTATCTTCGC CTTCACGCAG GTCTATCCGG GATCCTGGAC GGTCGGGCCA GCCTACAGCG CCGAGATCAT CGCCCTCTAC ACGCTCGGGA TCGGGATCCT CGTCGCCGTC GCGATCCTCG TCCCGATCGT CACCGGCGAG AAGGGGCTGC TGGTCGAACC GGAACTCGGA CTCGGGACGG AGGAGGCACC GATCCTCGTC GGTGACGCCA CGCGGGACGC ATTTTTCACC GTCTACGAGA CGCCGACGAA CGACTGGACC TGGCGAACCA TTCGGCGTGA CGCGATCGGG CAAAGTGCCA CGGCCGTCTC CACGGACACC GACGCCCGGA TGGAGGTCGA GACAGTCCGG GAGAAGATCG CGGGTGCCGG ATTGCTGGAC ATCACGACAG CGGCGTTCCG ACTGTATCAG ACCGCGGACG AAGCCTGGGA GTGGTCGCTG GTTACCGCAG ACGGCAGCAT CATCGCCGCA AGCGACGATT CCTACGGGGA CCGCGACGGT ATCGAGTCCG CCGTCAATTT CCTCAAAGAG GAGACCCCGG CGGCCAGCCT CGTCGAGATC CAGGGGGCCG CCTTCGATAT CACCCAGGAC GAGGGCGACC GCTGGCACTG GCGACTCATC GACGAACGTC ACCGACCACT GGCCGCGGGA CCGGACGATT ACGCCCAGGA GTCGGCCGTC GAGTCGGGTA TCGACCGCTT CGTCGACTGG GTCGAGGATC CGCGCGTCCT CGCCGTCGAG GCGGTCGCCA TCGAACTGTT CGATGACGAG GACGGCTGGC GGTGGCGAGT CATCGATGCC ACCGACACCC GGCTGCTCAC CAGCGACGCG ACCTTCGACG CGCGAGGCGA CGCCGAAGTC GCGGCGACGA CCATCGCCGA CCATCTCTCG GAGGCGGCCG TCATCGAACA TGGATCGCCC GGCTTCGAGG TCCACGAAAC GAACAGCTGG GCCGGGGGGG ATTCCGACTT CGAGGGGGCG GGCTGGACCT GGCGGCTCCG GGACCAGGCC GACGAGATCG TCGCCACGAT GGACGGCCAG ACCGCCACCG AGGGCGACGC CACCGACGCC GCCGAGCGCA CCCGCTCGGC ACTCGATCGG ACGGAAACGA TCGAGTTTGA GGGGGCTGAC TACGAGGTGT ATCCCGCCGA GGGCGACTGG CACTGGCGAC TGGTCTCGGC CGAGCGGGAC GTCCTGGCCG ACAGTACCGT CGCCTTCGAT GGGAGGGAGG CGGCCGAGAC GGCTGCCGAT CGGGTTCGCG AGCAGGCCCT CGCCGCCGAC CTCATCGAGT TCGAACAGGC CGCCTTCCAG CAGTACGAGT CCGACGGCGA GTGGCGCTGG CGGCTTATCG ACGAGGATGG CAAGGTGATG GCCGACAGCG GCGAATCCTA CGAGGACAAA AGCGAGGTCA TGGAGGGGAT GCGGACGCTG AAGGAAAACG CGCCCGACGC GGAGGTTCTG GAGATCGACA CCGCTGCGTT CGAGATTCAC CTGACGGCGG CCGGCGAGTA CGCGTGGCGA CTCATCGACG AGGGTGGGAA ACTCATCGCC GAGAGCGCCA GATCCTATCC GTCGCGGACG ATGGCCCGGG AGGCCGTGGA CTTCCTGATC GAACACATCG ATGACGCGGC CGTGCGGGCG ATGGAACACG CCACCTTCCA GCTCCTCAGC GACGAGGAGA CCTGGACCTT CCAGCTGATC GATACAGACG GGACGATCCT CGCCGAGTCC GTCGAGGACT ATCCCACCTA CGACGACGTG ACGACGGCCA TCGCGGACGT CCGCGAGGCC GGAGCGGGCG CACCGATCGA CACGATGCGA GAGGTGACGG TGCAACTGCG ACAGAATGCC GGGTATCACT GGCGACTCAT CGACCGGGAT CGGCGTCCCG TCGCCAGCGG CGAGCGAACC TACGAGACCC GGAGCGCCGC CGAGGCGGAC GTCGACCGCC TCCTGGAACA TGCGAGCCAC GCACCGGTGT TCGACATCGG CCGTGGCGTC GTCTGGATCG ACCGCCTCGA GGACGAGTGG CGGTGGCGAC TCGTCGACGC TGACCGGACG GACCTCGCCG TGAGTCCCCA GGCGTACGAT AGTTACGAGG CGCTCATCGA CGACGTCGAG ACGGTCCAGG CTCAGGCCGC CGACGCCGAA CCGATGGACA TCGAGACGCT TGCGTTCGAG CCATACCGTG AGGGTGACCT CGACGGCGAG GCGACGGCAG GGAGCGACGA CGGCGATGCG GGGGTCTGGC GGTGGCGACT CATCGACGAA CGCGAGACGG TGCGGGCGGT GAGTGCGGCC AGCTACGAGA GCCGGGACGC GGTCGCCGAC GCGATCGATG ACGCCAGAGC GACCACCGAG AAGGCAAGCA TCCTCGAAAT CGACGAGGTG TCCTTCGAGT TCGCCCAGCG CGACGACGGC TGGATCTGGC GGCTCATCGA CGAGAACGGC TCGGCGATCG CCGAGAGCGT CGAACCACAC GAGACTCGCC AGTCGGCCCG CGAGGAGATG CTCACCGTCA AAGAACACGC GCCCGAGGGG GAGACTGTCG TCGCCTGGTG A
|
Protein sequence | MAIGDIRDQK LVELYHRYIG EPESKRDVYG YWLLLLGSIT GLLGVSVFQI EQLFFPGNFE IREIAIVLSA IGLAIGLFAV IVLLPVRRRG MQASVVGLAI AFLSIFAFTQ VYPGSWTVGP AYSAEIIALY TLGIGILVAV AILVPIVTGE KGLLVEPELG LGTEEAPILV GDATRDAFFT VYETPTNDWT WRTIRRDAIG QSATAVSTDT DARMEVETVR EKIAGAGLLD ITTAAFRLYQ TADEAWEWSL VTADGSIIAA SDDSYGDRDG IESAVNFLKE ETPAASLVEI QGAAFDITQD EGDRWHWRLI DERHRPLAAG PDDYAQESAV ESGIDRFVDW VEDPRVLAVE AVAIELFDDE DGWRWRVIDA TDTRLLTSDA TFDARGDAEV AATTIADHLS EAAVIEHGSP GFEVHETNSW AGGDSDFEGA GWTWRLRDQA DEIVATMDGQ TATEGDATDA AERTRSALDR TETIEFEGAD YEVYPAEGDW HWRLVSAERD VLADSTVAFD GREAAETAAD RVREQALAAD LIEFEQAAFQ QYESDGEWRW RLIDEDGKVM ADSGESYEDK SEVMEGMRTL KENAPDAEVL EIDTAAFEIH LTAAGEYAWR LIDEGGKLIA ESARSYPSRT MAREAVDFLI EHIDDAAVRA MEHATFQLLS DEETWTFQLI DTDGTILAES VEDYPTYDDV TTAIADVREA GAGAPIDTMR EVTVQLRQNA GYHWRLIDRD RRPVASGERT YETRSAAEAD VDRLLEHASH APVFDIGRGV VWIDRLEDEW RWRLVDADRT DLAVSPQAYD SYEALIDDVE TVQAQAADAE PMDIETLAFE PYREGDLDGE ATAGSDDGDA GVWRWRLIDE RETVRAVSAA SYESRDAVAD AIDDARATTE KASILEIDEV SFEFAQRDDG WIWRLIDENG SAIAESVEPH ETRQSAREEM LTVKEHAPEG ETVVAW
|
| |