Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_2647 |
Symbol | |
ID | 8384952 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 2718707 |
End bp | 2723170 |
Gene Length | 4464 bp |
Protein Length | 1487 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 644973721 |
Product | hypothetical protein |
Protein accession | YP_003131541 |
Protein GI | 257053708 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCGTCT CGATGTTGCC GGTAGGGACC GTCTCTGCTG GGACAGTCAC CGAAACCGCG GGGAACACAG CGTCGGCTGA TACTGTCCCC GAGGTCTCCA TGGACGGGCC GCCGGTGGTG TTGAAAGCCC AGGCCATGGA ACGGATCGGG AACCTCTCGG CTCCCTCGAA CCGTCTTGCA TCCGCCAGGG ATCGCGCCCG AGCACGCGTG AACGATTCCT TTGCGTATTA CCAGGGGCCG AGACGCGTCA GCGACCAGCA ACTTTTCGTT GACGACGCCG TCGCGCTACG GGCGCTGGCT GCCTTCGACG GGACGAATCA AAGCAAGCGA ATCGGGCAGA TCGTCGAGTT GGTAGCGCGA GCGGATAATC AGTCTGCCAG GCAAGTCATC CGTGATGCTG AATCCGCATT CAAGGCGACT GAAGCGGACC TCGGCCCAGG AATGACACAT AGTGCTGCGG CGCACATCGA CAATGCTCGG CGACAACTCA ACCGTGCCGA ACGGATTCGT GACCGTGCAG CGGATAACTC GGGTGCCCAG TCCATCCGGA CGACGGCACG TGCCGTCCGG ACGTACGGCA CGGCGCTCAA CCAGGCACGG ACTGCCTTGG GATTGATCGG TGGCGAGATT GGCCCGGAGG TGACGCTCAC CCGACGGACG GATCCGGTCC GCAATGGAAG CGAACGAGCG CGCTATACGC TTGTGGGACA GGTCTCGGAT CCGACCGGGC TAGACGCGGT GAACGTCACT GCGACGATCA ACGATGATCG GACAGTCGAC CTTCCGCTTC GCCGTGGCTA CGCCAACGCC ACCTTCGCGA AAACGATCAA TCTCACCGCC CGTGTGAATA CGATCGAAAT CTTGGCCGTC GAAAGTACCG ACAACCAACA GTCGAACAAC GGCAAGCAGA AATCCAAGCA GAAGGGCAAA AAGAAGGGCA AAAAGAAGGG CAACAACGGC AACGGTGGCC AGTCATCCGG GCAGAGCCAG GCGAGCACGG TCGTGCTTCG CCTGGACGGT GACGGCCTGC CCGATACCTA CGAGAAGAAA GTGACGGGCA CGGATCCGCT GGATCCCGAT AGTGACGCGC TCTCGACTGA CGTGAACGAG GCCGACAATG GCACGATCGA CGGCCACGAG GATTTCGACG GCGATCGTCT GTCAACGATC CGAGAACGCG AACTTGGGAC GGATCCACTC GAGGCTGATA CTGATGGTGA TGGCCTCCCC GATGGCTACG AGCGTTTCGG GACCGGAACG GATCCACTCG ATATTGACAC GGACGGCGAT GGCACTTCCG ATGGTGCGGA AGACCTTGAC GGAGACGGGT TGACCAACAC TGAAGAGTAC GACGCTGAGA CCTCACCACA GTACGCCGAC ATCGACGGTG ACGGGTTGAC CGATCCCCAG GAACTCGCCA ACGGGACGGA TCCGTGGCAG GCCGACACTG ACGACGACGG CCTCGACGAT GGCGTCGAAC CGACCGACCC GTTCGGGACT GATCCACTGG ACCCTGACAC CGATGATGAC GGTGTCGACG ATGGCAACGA GACCTACACG ACCATGGCGG GAAACGAAAG TCTGGGGGTC GATGTCGAGT TGACTGGTCA GGGGAACGCT GCAGCCGAAA CGACAGTGCA GGAACCGTCC GATGCAATGT TCTCCAAGCG GTTTGGTCCG CCAAACGTCT CAGCATCTTC GTACGTGTCC ATCGAAACGC AAGCAGAGAT CAATAACGCG ACGGTGCAGA TCGGCTATGA CGGGCGTAAC GTAACGAACG AATCGACACT CGGTGTCTAC ACGTTTAACG AAACGAGGAA CACGTATCAA CCACTCCCCA GTACCGTGGA CCCCGCGAAC GACACGGTGA CGGCACGAAC GGAGCATTTC TCGAAGTTCG TCGTCTTCAA TCGGTCAGCG TGGCACGCAT TTCTCGACCG CCGCGCGGGG TATCCGGATG AAGATCTTGA ATTTGGCGTA ATAAATGAGA CTTTTGACGA CCTCGATGAG TGGACCTGTT CGAATGTTCC ACGAGGATCG GACGATTCCC ACCTCGATAC ACCGACGAAA GGCTTCTGTA AAATCGAAGA CGGCGCAGCG AAAGTCCAAG AGGAGACGAA CCGAGCGAGG TTTCTCAATC GGACGGTTTC CTTACCCAGT TCGCGGGTTC TCGATGGTCG ATCATTGCAT CTAAAGAGTT CACTACAGGC ACACGTCGAT GCCCAGTGGT CAAACGCAGC AATCAATCTA TATATTTCGC CAGTCAGCGA GACTGGTGCC GGGGCCAACT CCCAGCGGAT TTTCACATTG AGTAACGACG GCAGCAACGA AGACGAAACG ATCGCTGCCG CACCAACCAT AGACATCAGC AAGTTCGCCG GCCAGCAAGT CCGTCTCTCA ATCGAAGCAG ACGGACGGTG GACTTACCAG AACCAGAATA CGACGACGTG GATCAAAGCC GACTACATCA ACATCAGTAC GGCGAAGGCG GCGGATGAAG ACACTGACGG TGACGGAATC CCGGATAGCT TGGAGGAACG TGGCGTCCCA CTCTCGAACG GGGAGCGTGT CGACCTCATC TGGCTTTCTA GCAAGAAAAA TTCCACCGAT GAACCGCTCG GCTACGATAC GGACAACGAC GGGCTTGCAG ATGGCCAAGA AGTGTTGATC GACCAACCGG TGTACGCAGA CACGAGCCAT CGAGGAAAGA CCTTCGTGGG CTACGAGTGG CGGAGCCATC CCAAGACAAA GTTTTCCGAT GGGGATGGAT TGACAGATGG GATCGAGTAC AAGGGGTGGT CAATCGATAC GATCAACAAG AGCGGTCAAG CATATCGCTG GGCGAACAGC ACCGAGCAAC CAGCGAATGG GACACTAGAA GTCTCCTCAG ATCCCCAATC CAAGGATTCG GACGGTGATG GACTGACCGA TTTCGAGGAG AAGAAATACA CATATACGGA TCCGCGGAGT TCTGTGACGT ATTCACTCTC TCAAGAAAGA GCAGAATTAC TTGATGATGT ACGAATGCAC CGGGATAAAG AGTTCTTGAT GGATGTTCTA GGTGTCTCTC CCCAGTTGGC TCGCGGTCGG ATCACAGCGC AACCGGGTAT CGTTGACGAC GCCGGAACTG ATTTCGACTT CGTCGTCGAT GATTCGGTAC CCTCGGGCAT ATCCAAACCG CAACGAATCC ACCGTGTGAC GCTCATCGGA CTCGATGGCA AGAATCAGAC GGATATTTGG ATGTCGAACG AGGAAGAAAT CAGCTGGACG GCACCTTCAG GATGGGACGC AGAAAAACCG CTTGACCCGT GGGATCCTGA CACCGATGAC GATGGCCTGA CCGACGGTCA GGAAGTGCAT GGCTGCTCCG TGGTCACTGG CGGGTGGAAT GACCATCAGT ATACCTGGGA TCGGATTTAC AACCCGTCGA ATCCAAGTGA TCCTGACACG GACGACGACG GTTACTGGGA CGGCTTCATT GGCGTCCACG GTGTCGGGTA TTCGGACAAG GTGATTCTGT ATCGGGAACA CTTACACGAT AGTGATTCAG GTGTCCCAGC ACCGAGCGGG GTTCGCGGGG ATGAAACGGT CCCTGAGCAG GCGTTACTGC ACGAGGTATC CGGAGAGACG CCGGGCGCTA ATATCTACGA TAATGAGACT CGGTATCACT CGAAGGTTCA CGTCGGTGAG TTGCATTGGG GCACGTCGCC GACTGACGAA CACAGCGAAC CCGACACGTC CTGGACGATT GAAGTGGATT TCTACGATGG GATCCAACAC GAGGCGATCA ACACGTCAAC GTGGGAAAAC GGTATTGAGC AGAACATGCA GTTGTACGGT CTGGACGTGG ATCTGATCCG TGACGAAACG ATAACCGACC AGATGTTTCG TAACAGTACA GATGAGGAAC GGAGAGGCAT GCCAGATCAG GAACTAACCG ATGGAATCTC ATATACACTC CCGGAAGGGG GCAAGGAGTT AGAAACGACA TACGGCAATT TCAGCAAATC ACGACAGTAT ATCGACGAAT GGCTTCTCGT TGGCCCGAAT ATACAAAACA AACTACAGCC ACGTCCCTTT TACGGAGGGA ACCCCATCGG ACCGACGATG GCCGTACATG TGACAGATCT CCAATCACTC CGCTCTAATA TTTCGGTGCC AGTCCAGCAC TCACCCTATC AAGCCGAACT GGCCATGCTG ACGGCCGATA CCACAATGCA CGAAATCGCC CATACGTATT GTGTGGGTCT CGCCGACGAT GAGGGTGTGG ACATGAACAA GTGCCTGATG GAGAATGAAA CCTACAGTGG AAGCAACGCG GATGAAACGC CGGAATACAT TTGGAATGAA AAGGAGCGAT CGTTGATGAG TGCCATTTCC GAAGAGAAGT TTGGGCAACC GATGAACGGT GAATATTCGA TCTTCAGCAT TCAGGAGTTG CTAACGATCC ATGAACCTGA ATAA
|
Protein sequence | MIVSMLPVGT VSAGTVTETA GNTASADTVP EVSMDGPPVV LKAQAMERIG NLSAPSNRLA SARDRARARV NDSFAYYQGP RRVSDQQLFV DDAVALRALA AFDGTNQSKR IGQIVELVAR ADNQSARQVI RDAESAFKAT EADLGPGMTH SAAAHIDNAR RQLNRAERIR DRAADNSGAQ SIRTTARAVR TYGTALNQAR TALGLIGGEI GPEVTLTRRT DPVRNGSERA RYTLVGQVSD PTGLDAVNVT ATINDDRTVD LPLRRGYANA TFAKTINLTA RVNTIEILAV ESTDNQQSNN GKQKSKQKGK KKGKKKGNNG NGGQSSGQSQ ASTVVLRLDG DGLPDTYEKK VTGTDPLDPD SDALSTDVNE ADNGTIDGHE DFDGDRLSTI RERELGTDPL EADTDGDGLP DGYERFGTGT DPLDIDTDGD GTSDGAEDLD GDGLTNTEEY DAETSPQYAD IDGDGLTDPQ ELANGTDPWQ ADTDDDGLDD GVEPTDPFGT DPLDPDTDDD GVDDGNETYT TMAGNESLGV DVELTGQGNA AAETTVQEPS DAMFSKRFGP PNVSASSYVS IETQAEINNA TVQIGYDGRN VTNESTLGVY TFNETRNTYQ PLPSTVDPAN DTVTARTEHF SKFVVFNRSA WHAFLDRRAG YPDEDLEFGV INETFDDLDE WTCSNVPRGS DDSHLDTPTK GFCKIEDGAA KVQEETNRAR FLNRTVSLPS SRVLDGRSLH LKSSLQAHVD AQWSNAAINL YISPVSETGA GANSQRIFTL SNDGSNEDET IAAAPTIDIS KFAGQQVRLS IEADGRWTYQ NQNTTTWIKA DYINISTAKA ADEDTDGDGI PDSLEERGVP LSNGERVDLI WLSSKKNSTD EPLGYDTDND GLADGQEVLI DQPVYADTSH RGKTFVGYEW RSHPKTKFSD GDGLTDGIEY KGWSIDTINK SGQAYRWANS TEQPANGTLE VSSDPQSKDS DGDGLTDFEE KKYTYTDPRS SVTYSLSQER AELLDDVRMH RDKEFLMDVL GVSPQLARGR ITAQPGIVDD AGTDFDFVVD DSVPSGISKP QRIHRVTLIG LDGKNQTDIW MSNEEEISWT APSGWDAEKP LDPWDPDTDD DGLTDGQEVH GCSVVTGGWN DHQYTWDRIY NPSNPSDPDT DDDGYWDGFI GVHGVGYSDK VILYREHLHD SDSGVPAPSG VRGDETVPEQ ALLHEVSGET PGANIYDNET RYHSKVHVGE LHWGTSPTDE HSEPDTSWTI EVDFYDGIQH EAINTSTWEN GIEQNMQLYG LDVDLIRDET ITDQMFRNST DEERRGMPDQ ELTDGISYTL PEGGKELETT YGNFSKSRQY IDEWLLVGPN IQNKLQPRPF YGGNPIGPTM AVHVTDLQSL RSNISVPVQH SPYQAELAML TADTTMHEIA HTYCVGLADD EGVDMNKCLM ENETYSGSNA DETPEYIWNE KERSLMSAIS EEKFGQPMNG EYSIFSIQEL LTIHEPE
|
| |