Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_2395 |
Symbol | |
ID | 8384694 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 2447690 |
End bp | 2451355 |
Gene Length | 3666 bp |
Protein Length | 1221 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644973468 |
Product | glycoside hydrolase family 5 |
Protein accession | YP_003131294 |
Protein GI | 257053461 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.740563 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACACACA ACAATCCAGA CGACGACAGT ACAGCCCGAC GAACGACCGA GTCGACGGAG TCACCGTCAA CTGCCGGCAT CGCGAGCGCG TCACGGCGGG ACTTCCTGAA AGCAGCAGCG GCAGGCGGCG CGATCGCGAC CGGCTTCGGC GGTGGCCTCG TCGGGAGTGC GGCCGCGGAC GTCATCCCGA CGCCGCCGCT GCACGTCGAT GGCAACCTCA TCAAGGACCC CGACGGCGCG ACGGTCAACC TCCGGGGGGT CAACATGGCC GATCCCAAGC GGATCAACGT CACCGCGCCG GCCAGGGGCA AGACCGCGAC GGACGTGGTC GACCTGCTGA CGAACACGGA CGACGACTGG CATTCCCGGG TCATCCGGAT CCCGGTCCAG CCGGTCGACA TCGGCGAGCA CGAACCCGGT GAAGGGCCGC CGCCGGTCGC CTTCGACGAG GGCCAACTCG AGACGTATCT CGAGGAGCAT CTCGACCCGG TAATCGAGCG GTGTCTCCAG CGCGGCGCGT ACGCCATCAT CGACTATCAT CGCCACCGTG ACGTCCAGTG GAACGACGAC ACGCTGGGCG AGGAAGTCGA GATGTTCTGG GACACCGTCG CGCCGCGGTA CGCCGACCAG CCCCACGTCA TGTACGAACT GTACAACGAA CCGACCGAGC CGGGGATGTG GGGCGATCCG ACCCAGAGCC AGAACTGGGC GGACGTCTGG CGGGACTGGA AGGCGACGGC CCAGCCGTGG GTCGACACCA TCCGCGAGCA CGCGCCCGAC AACCTGATCC TGATCGGGTC GCCCAGCTGG TCACAGAGCC CCGAGGGGGC CCTGGTCGAG CCCTTCGACG GCGAGAACCT CGCCTACACC TTCCATATCT ATCCCGGGCA CAACTCGAGC CAGCAAAACG ACTGGGAGGA CGCCACCAAC AACGGCGAGG GTGTCGCCGG CGTCTACGAG GAGTACCCGC TGTTCGTCAC CGAGTGGGGC TGGGAGGAGA ACGGTGGCCA GTACATCGGC GGGACGACCT CCGGCTATGG CGAGCCGTTC CTCGAATTTT TGGAGAAGAG TGACGCCATC CACTGGACAG CTTGGTGTGC CGACCCCGTC TGGCGGCCAG TCATGTTCGA CCGGGCGTTC ACCGAGGAGA GCTTCGAGGA CAACATCGGC AACCCCTACG CGGAGGACGT GCCCGAAGAC TGTGCCGACC TGCCCTGCGA CTGGACGTTG CTCGGGGGCG ACAGCTACAT GGGCGAGACC GTCAAGAACG CCCTGATCGA CTACCAGGAC GCCAACCCGC CGACAGTCCC CTACGACGAG CAACCGCCGA CCACCCCGTC GAACCTCACC GCCGAAAACG TCACCGAGAC GACGGTCGAG CTCTCCTGGG ACGGCTCGAC CGACCAGGGC GAGGCGGGAC TTTCCCATTA CAACGTCACC GTCGACGGCC AGAAGATCAC ACAGGTGCCC GAAGCGACCA CGGCCACGAC CGTCGAGGGA CTGGAATCCG ATACGACAGT CACGATCGGC GTGAGCGCGG TCGATCGAGC GCGCAACGAA TCCGAGACGG TGACCGTCGA GGTGACGACC GATGCCTTCG AGGACTCGAC GCCGCCGTCC GTGCCGGCGA ACCTCACCTC GCCGGAGAAC ACCTGGCAGT CGGTGGCCAT TTCCTGGGAC GACTCCACCG ACGAAGGTGA CGCCGAAACC GCGGGACTCG ATGGCTACGT CGTCTACGTC GACGGCGAAC TCGAACGCGA GGTCGCCGCC GAGACCACGC AAGTCCAGAT CGGCGGCCTG GATTCGGACA CCACCTACGA GTTCGGCGTC AGTGCGGTCG ACCGTGCCGA CAACGAGTCC GACATCGCGA CCATCGACGT CACGACGGAT CTCGCCCGCG CGGGGCCGAA CGACCTGCTG ATCAACGACT ACGACGGCGA CCCGGCCTGG CCGGATTCGA ACGACCTCGG CAACTGGGTC GGCACCGGCG GCTTCGAGAG CGCAGAGGTC GTCGACGGCC GCCTCGAAAT CGACTACAAC GCCAGTGGCT GGTACGGCAC CGGCGTCTCC CAGGACATCA CCGACTACCC GACGCTGCGG ATGAAGGTGA CCGGCGAGAA CGGCGGCGAG CACCGCGGCA TCGAGTTGCA GTTTGCGGGC ATCGACCCAT TGCTCTCGGA GGTCACAGAC GACACGATCG GGACCACCGA GTCCATCGTC TCGGTCGACC TCGAGGCGGC CGGTGCTGAC CTCGAATCAC CCGGCCAGCT TACCCTCCGG TTCTACGATG CCGGCGATAG CTCGATCTCG ATCGACGAAC TATGGCTGGA CAGCGACGAA CCCGATGACG ACGGCGACTC GATCGCCCCG ACGGCCCCCG CGAGCGTCGA GTCGCCGACC CAGTCGGAGA CGGCCGTCGA GATCGAGTGG AGCGCTTCGA GCGACGACGG AGGCTCGGGA CTCGATCACT ACAACGTCTC CGTCGACGGA AGCATCGATC AGCAGGTGCC GGCTGGGACG ACCGCCGCGA CGATCGAGGG GCTCGACGCC GGGTCGAGCT ACGAGATCGG TGTCTCGGCG GTCGACGGCG CCGGCAACGA GTCCAGCCAG ACGACGGTGA CGGTCTCGAC GGCCGGTGGC GACGACGAAC AGGCACCGAG TGCGCCGGCG AACCTCACCT CGACGGACCG GACCGACACA TCGATCGATC TCGCATGGGA CGCCTCGACC GACGAGGGTG GGTCCGGCCT AGATCACTAC ACCGTCGCCG TCGCCGGTGA ACAGGTCCAG CAGGTTGACG CGGGGACGAC CACCGCGACG GTTTCGGAGC TCTCGCCAGG AACGAGCTAC GACATCGCCG TCACGGCGGT CGACGCTGCC GGCAACGAGT CCACTCCGGC GACGCTGACG GTGGCGACGA CCGACGGCGA CGACCAGCAA GCGCCCACGA TGCCCGGGAA CCTCTCGGTC ACCGGATCGA CCGCCGCCTC CATCGCCGTC TCCTGGGACG CCTCGACGGA CAGCGGTGGG TCCGGGCTGG ACCACTACAC CGTCTTCCTC GACGGGAGCC AGGACCAGCA GATCGAGGCC GGCACGACGG AGGCGACAGT CGTCGGGCTC TCCGCCGACA CAACCTACGA GATCGGGGTT TCGGCCGTCG ACGGCGCGGG CAACGAGTCC GAGACGGTAA CCATCGAAAC GACGACGCCC CCGGGCGATC CGGTGGCCGG ACTCGTCGTC AACGACTACG ACGGCGATCC GGCGTGGTCG AACCACCGGA ACGATCTGGG CAACTGGTGT GGGGCCGGCT CCTTCGCAAA CGGCGGTGGC GACGTCGAAG ATGGCGCACT CGTCCTCGAA TACGACAACG CCGGGTGGTT CGTCGAGCAG ATCCAGCAGG ACGTCAGCGA GTACTCCTCG ATCGTCTTCT CGATCGCCGG CGCGAGTGGC GGCGAGGGCG ATCACTTCGT CGTCGGCGTC GGCGGCAATC GCTCGACGTT CAGCGACGTC GCGGACGGCT CGATCGGGAC CTCGGTCGCT GACGTCGCGA TCGACATGGA ATCGGCCGGC ATCGACGCCG GATCACTGGG TGAACTCCGC CTGAACTTCT GGCAGGCCGG CTCCGGGAGC GGGACACTTC GGATCGAGGA GATCCGACTG GAGTGA
|
Protein sequence | MTHNNPDDDS TARRTTESTE SPSTAGIASA SRRDFLKAAA AGGAIATGFG GGLVGSAAAD VIPTPPLHVD GNLIKDPDGA TVNLRGVNMA DPKRINVTAP ARGKTATDVV DLLTNTDDDW HSRVIRIPVQ PVDIGEHEPG EGPPPVAFDE GQLETYLEEH LDPVIERCLQ RGAYAIIDYH RHRDVQWNDD TLGEEVEMFW DTVAPRYADQ PHVMYELYNE PTEPGMWGDP TQSQNWADVW RDWKATAQPW VDTIREHAPD NLILIGSPSW SQSPEGALVE PFDGENLAYT FHIYPGHNSS QQNDWEDATN NGEGVAGVYE EYPLFVTEWG WEENGGQYIG GTTSGYGEPF LEFLEKSDAI HWTAWCADPV WRPVMFDRAF TEESFEDNIG NPYAEDVPED CADLPCDWTL LGGDSYMGET VKNALIDYQD ANPPTVPYDE QPPTTPSNLT AENVTETTVE LSWDGSTDQG EAGLSHYNVT VDGQKITQVP EATTATTVEG LESDTTVTIG VSAVDRARNE SETVTVEVTT DAFEDSTPPS VPANLTSPEN TWQSVAISWD DSTDEGDAET AGLDGYVVYV DGELEREVAA ETTQVQIGGL DSDTTYEFGV SAVDRADNES DIATIDVTTD LARAGPNDLL INDYDGDPAW PDSNDLGNWV GTGGFESAEV VDGRLEIDYN ASGWYGTGVS QDITDYPTLR MKVTGENGGE HRGIELQFAG IDPLLSEVTD DTIGTTESIV SVDLEAAGAD LESPGQLTLR FYDAGDSSIS IDELWLDSDE PDDDGDSIAP TAPASVESPT QSETAVEIEW SASSDDGGSG LDHYNVSVDG SIDQQVPAGT TAATIEGLDA GSSYEIGVSA VDGAGNESSQ TTVTVSTAGG DDEQAPSAPA NLTSTDRTDT SIDLAWDAST DEGGSGLDHY TVAVAGEQVQ QVDAGTTTAT VSELSPGTSY DIAVTAVDAA GNESTPATLT VATTDGDDQQ APTMPGNLSV TGSTAASIAV SWDASTDSGG SGLDHYTVFL DGSQDQQIEA GTTEATVVGL SADTTYEIGV SAVDGAGNES ETVTIETTTP PGDPVAGLVV NDYDGDPAWS NHRNDLGNWC GAGSFANGGG DVEDGALVLE YDNAGWFVEQ IQQDVSEYSS IVFSIAGASG GEGDHFVVGV GGNRSTFSDV ADGSIGTSVA DVAIDMESAG IDAGSLGELR LNFWQAGSGS GTLRIEEIRL E
|
| |