Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_1097 |
Symbol | |
ID | 8383371 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 1074627 |
End bp | 1075895 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644972158 |
Product | polysaccharide deacetylase |
Protein accession | YP_003130009 |
Protein GI | 257052176 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0726] Predicted xylanase/chitin deacetylase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAGACC AAGATACGGG GGAGCGGACG ACGGTCGATC GTCGGCGGTT TTTGACGATC GCGGGGGCAA CAGGACTCGC AGCGCTTGCC GGCTGCGAGG AGGATACGAC GACAGAGGGC GGGGAAACAC CGACGGAAAC GTCGAAACCG TCGGATACAG AGACAGACAC GGAGACGGAA GCCGACACGG AAACGGAAGC CGAAACTGCG GATGAAACAG AAACCGAAAC GGAGACTGAA GCGGAAACCG ACACGGAGAC CGAACCGGAA CTGGATCCAG AAGTCCCGCA GTCGGCGAAC GGGCCGCTGG CTCCATTGCC GACGCCCGAC CGAAACGAGG TCCCGAAGCC GGCGGGGGAT GCAGGGGGCC TCGAAGTGCT CGACTGGGCC GGATTCGAGG GCGCTGTCAC CTACACGTAC GACGACGGCC AACCCTCGAA TCTCGAACAC TACAGCGCAC TCGCAAAGAC CGAGATGAAT ATGACGTTCT ATCTCACCAG CAACGTGAAC TTCGATGGGT ACGAAGGCGG CTGGACGCTG GCCGCTCAGG ACGGTCACGA ACTCGGGAAC CACACTGTCA GCCATCCCTA CGCCGATATG AGCGATAGCT CATTCGCCGA GCCAGCGGCG GACCCAGCAG CCGAAATCGA GGGGTGTTCC CGGTATATCA TCGACAATCT GGGCCAGGAA GACGTCTGGA CGATGGCTGC TCCATTCGGC GACACCGGCT GGAGTGAGCC GGCCAAGCAG TCCGATCTCT TCCTCAATCG TGGGGTCGGT GGCGGGGCCG TCGCTCCCAA TGACGACACT GACCCGTACA ACCTGCCGTG TTACATGGCC CAGGAGGGCG ACACTGCCGG GACCTTCACC GATCTGATAG ACAATGCACG GGACAACGGT GAGTGGCAAA TCTTCCTGTT TCATACGATC GCTCCGACCG ACGAGGTGTG GTACGCGCCA GTCGAAGTCG ACGCGATCAT CGACAGTATC GAGCACGCCA AATCCACCGG CGACGTCTGG ATCGACACGC TTGCCACCGT CGGCGCGTAC TGGCGCGGCC AGCAGATACT GGAATCCGCC GACGTCACGG AGTCCGACGG GGAAACCGTT TGGTCGTGGA CACTGTCCGA GTCGTTCCCG GCCGGTCGCC ACGTCCGTGT GACCGTCGAC GGCGGGACGC TCTCCCAGAA CGGCACGGAA CTGGAGTGGA ACTCCCACGG GTATTACGAG GTCGCGCTGG ACGAGGAGTC CCTGACGCTG TCGGCATAG
|
Protein sequence | MSDQDTGERT TVDRRRFLTI AGATGLAALA GCEEDTTTEG GETPTETSKP SDTETDTETE ADTETEAETA DETETETETE AETDTETEPE LDPEVPQSAN GPLAPLPTPD RNEVPKPAGD AGGLEVLDWA GFEGAVTYTY DDGQPSNLEH YSALAKTEMN MTFYLTSNVN FDGYEGGWTL AAQDGHELGN HTVSHPYADM SDSSFAEPAA DPAAEIEGCS RYIIDNLGQE DVWTMAAPFG DTGWSEPAKQ SDLFLNRGVG GGAVAPNDDT DPYNLPCYMA QEGDTAGTFT DLIDNARDNG EWQIFLFHTI APTDEVWYAP VEVDAIIDSI EHAKSTGDVW IDTLATVGAY WRGQQILESA DVTESDGETV WSWTLSESFP AGRHVRVTVD GGTLSQNGTE LEWNSHGYYE VALDEESLTL SA
|
| |