Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_0156 |
Symbol | |
ID | 8382418 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 153216 |
End bp | 156470 |
Gene Length | 3255 bp |
Protein Length | 1084 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644971214 |
Product | alpha-L-rhamnosidase |
Protein accession | YP_003129077 |
Protein GI | 257051244 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCACAC CGGCGGACGC GACACCGAGC CACCTCCGCG TCGAGTACGA GAAGTCGCCG ACGAACGTCG ACCCGTCGCG TCCGCCCCGT TTCTCCTGGC GTATCGAGAC AGCACGTCGC GGCGCGGCCC AGCGAGCCTA CCGCCTCGTC GTCGGACGCA ATCCAGACGC AGTCGCCAAC GGCGACGGAA CGCTGTGGGA CTCCGAACGG GTCGACTCGG ATCGAGCCAC CAACGTCGTC TACGACGGTC CCGACCTCGC AGCCGACAGG ACCTACTACT GGTCGGTGAA GGTCTGGACG GACCGCGGCG AGACCGAGTG GGCCGAGCCC GAATCGTTTT CGACGGCGCT CCGTCCGGAA GACTGGTCGG GCGAGTGGAT CGCCCATCAG CCCGGCGTCG GCGACACCGA CGGCTGGCGT AGCCAGTGGC ACCGGCCCGA GGAGGACGCC GCGGAGTGGG TTCAACTCGA CCTCGGCGAA AGCCGGCCTG TCGACGAAAT CACGCTTCAC CCGGCCGATC CGATCTCGGT CGTGCGGACC CCCGACGATG TGGCGGTGAC GATCCACTGG TCGTCGAACC CACTCGCTGG CTTTGGCTTC CCCGAGACTT ATCGCATCGA ACTATCCGAC GATCCCGACT TCGCCGAGAG TACGGTCGTC GCCGAGCGGA GGATGCCGGA CCAGGGCGAA AGCACCACCG ACGGGCTCCC GACCGACCTC GCGACGCCGC CCCAGACCCA CGACGGGCTC GACGCCGGGG GGCGGTACCT CCGCATCACG GCGACCGACC TCTTCGAGTT CGTCCCCCCG ACGAGCCGCG ACGCGGCAGA CTCCCGCAAG ACCGAGGGGG TCCACCCCTG GCAGTGTTTC GCCCTCGCCG GCGTCACCGT CCGGGACGGG GAGGGGACGG ATCTGACCGG CGACGCGTCC GCCGAGGTCT CCTCGTCGGT CGAGACGGCG ACGTGGGGCC GCGAGCATCT CCTGAACGGC CACACCGACT CGCGGCTGGC CTCAACATCG CCACAGCTCC GCCGGGAGTT CGAGCTTGAG AAACCAGTCG AACGGGCCCG TGCGCACGTC GCCGCCGTCG GGTACGGGGA ACTCGCGATC AACGGCGAGA AAGTCGGCGA CCGCGTGCTC GACCCGGCCT GGACCGAGTA CGAGAAGCGG ATTCTCTACT CGACGGACGA CGTCACCGAA CAACTGACCA AGGGTGACAA CGCCGTCGGG CTGTGGCTCG GTCGCGGCTG GTTCGCCAAA CGCGGGGCCT ACTGGGTCGC CGACGGATCA CCGCGAGCCC GCGTCGTTCT CACGGTCGAA TTTACGGACG GGACGACCCG CCGGATATCT ACCGACTGCG ACTGGGCGGC CCGGGAGAGC CCGATCGTCG AAAACGACAT CTACGACGGC GAGACCTACG ATGCCCGCCG CGAGGCCGAC GGCTGGCGAC GCCCCGGCTT CGAGGACCCG GAGTGGGACG GTGCCGAAGT CGTCGACTCC CCGGGAGGAA CGCTCAGACC CGAGCGGATC GAACCGATGG AAATCGTCGA GACCTTCGAC GTCGAGGACG TTCACGAGCA CCCCGACGGG CCGATCCTCG ACTTCGGCCA GAACCTTACC GGCTGGCTCG AAATCGAAAT CGAGGACCCG GCGGCCGGCG AGGAGATTAC GCTCAAACAC GCCGAGACGC TGACCGACGA CGGCGACCTC GCGACGGTCG ACCTCCGGAG CGCCGACGCG ACCGACACCT ACCTCGCCCG CGGCGAGGGG AGCGAGACCT ACGAACCCCG ATTCACGTAC CACGGGTTCC GGTACGCCCA GGTCGAGGGA TATCCGAGCG AACTCGATCC CGAGGCCGTC ACGGCGAAGG TCGTCCACAC GGCGATGGAT CGACGCGGGG AGTTCGCCTG CTCGAATGAC GACCTGAACC AGGTCCAGCA CAACGCCGTC TGGGGGCTGG CCGGTAATGC CCACTCGATC CCGGAGGACT GTCCACAGCG CGACGAACGC TTCGGTTGGA CCGGTGACGC ACAGATCGCC GCTCGGTCGC TTGCGTTCAA CTTCGACGCC GCACGCTTCC ACGAGAAGTG GGCGCGCGAT CACGACGACG CGGCGAGCGA ACTGGGCTAC GCGCCGGACG TCATCCCAAA CAAGGCCCCG GAGAACCCGG GCGACCCGAC GTGGACGATT ACCCGCGTCG TGATCCCCTG GCATCGCTAC CGCCACGACG GCGACGAGCG CATTCTCGAA GAACAGTTCG AGGGCATGCG CGCCCACGTC GAGTACTGGC TGTCGGTGAC CGAGGACGAT CTGGTTCCGG GCGCGTACGG GAAGTTCGGC GACTGGCTGG CCTTCGAGAA CACCAACGGT CGGCGCGGGC TCCCCTACGA TCTGTTCACG ACGGCGTTCG TCTACCAGAT CACGGATATC CTCGCGAAGA TCGCCGATGT GCTCGACAAC GATGGCGACG CCGCTCGCTA CCGCGAGTGG GCCGATCGGC TCCCGACGGC GTTCAACGAG GAGTTCTTCG AACCCGCGGC GGGACGCTAT CGGCCCGGGA CGCAGGCCTC CTACGCCGTG CCGCTGTTCC TCGGGCTCGT CCCCGAGGAC CACGTCGAGA CGGTCGCGGC TGGGCTGGCC GAGAAGGTCC GTTCCGACGG CGGGAAGCTC AAGACCGGGT TCCTGGGGAC TCGCCCACTC ATCCAGACGC TCGCCGAGCA CGGCTACGAC GACCTGGCAT ACGAGGTCGT CAGCCAGCCC GAACGGCCGG GCTGGGTGTA CATGGCGCGT AACGGCGCGA CCACGATGTG GGAGCGCTGG AATTCGGACG ACAGCATCGG CTCGGGTATG AACTCGCTGA ACCACTCCCC GCTCACGCAC GTCTCGGAAT ACTTCTACGA GGTACTGGCC GGGATCAAGA TCGGCGATCG GCCCGTGACT GACCACGTCA CGATCGCGCC CTCGCTCGTC GAGGACCTGG AATGGGTCGA AGCCAGCTAC GAGACCCGTA ACGGGGAGAT GGCCGTCGAC TGGGAGCGGA CCGACGGGGG CTATGACCTG TCGGTGAGGG TGCCCTGGAA CACGTCGGCG ACCATCAGGC TCCCCGACGC GGCCGGTAGG TCGGTGTCCG AATCGGGAGT CGATCTATCC GTCGAGACGA CTCCCACACG CGAGGGTGTC CGATCGATCG ACTACGAGGA TGACGTGGTT GTCCTGGCTC TCGACGCCGG CGAGTTCGAT CTCTCGGTCC GGTAA
|
Protein sequence | MTTPADATPS HLRVEYEKSP TNVDPSRPPR FSWRIETARR GAAQRAYRLV VGRNPDAVAN GDGTLWDSER VDSDRATNVV YDGPDLAADR TYYWSVKVWT DRGETEWAEP ESFSTALRPE DWSGEWIAHQ PGVGDTDGWR SQWHRPEEDA AEWVQLDLGE SRPVDEITLH PADPISVVRT PDDVAVTIHW SSNPLAGFGF PETYRIELSD DPDFAESTVV AERRMPDQGE STTDGLPTDL ATPPQTHDGL DAGGRYLRIT ATDLFEFVPP TSRDAADSRK TEGVHPWQCF ALAGVTVRDG EGTDLTGDAS AEVSSSVETA TWGREHLLNG HTDSRLASTS PQLRREFELE KPVERARAHV AAVGYGELAI NGEKVGDRVL DPAWTEYEKR ILYSTDDVTE QLTKGDNAVG LWLGRGWFAK RGAYWVADGS PRARVVLTVE FTDGTTRRIS TDCDWAARES PIVENDIYDG ETYDARREAD GWRRPGFEDP EWDGAEVVDS PGGTLRPERI EPMEIVETFD VEDVHEHPDG PILDFGQNLT GWLEIEIEDP AAGEEITLKH AETLTDDGDL ATVDLRSADA TDTYLARGEG SETYEPRFTY HGFRYAQVEG YPSELDPEAV TAKVVHTAMD RRGEFACSND DLNQVQHNAV WGLAGNAHSI PEDCPQRDER FGWTGDAQIA ARSLAFNFDA ARFHEKWARD HDDAASELGY APDVIPNKAP ENPGDPTWTI TRVVIPWHRY RHDGDERILE EQFEGMRAHV EYWLSVTEDD LVPGAYGKFG DWLAFENTNG RRGLPYDLFT TAFVYQITDI LAKIADVLDN DGDAARYREW ADRLPTAFNE EFFEPAAGRY RPGTQASYAV PLFLGLVPED HVETVAAGLA EKVRSDGGKL KTGFLGTRPL IQTLAEHGYD DLAYEVVSQP ERPGWVYMAR NGATTMWERW NSDDSIGSGM NSLNHSPLTH VSEYFYEVLA GIKIGDRPVT DHVTIAPSLV EDLEWVEASY ETRNGEMAVD WERTDGGYDL SVRVPWNTSA TIRLPDAAGR SVSESGVDLS VETTPTREGV RSIDYEDDVV VLALDAGEFD LSVR
|
| |