Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_1152 |
Symbol | |
ID | 8383427 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 1124663 |
End bp | 1126174 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644972211 |
Product | Alpha-N-arabinofuranosidase |
Protein accession | YP_003130061 |
Protein GI | 257052228 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3534] Alpha-L-arabinofuranosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.737895 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCTCTG CAATTCACGT CCAGCAGCGC GAACCGATCG CCGAGATCGA CCCGAACCTC TATGGCCACT TCGCCGAGCA CCTCGGGCGG TGCATTTACG GCGGACTGTG GGTCGGTGAG GACGACCGGG TCCCGACCGA GGACGGCGTC CGCATGGACA CGATCGACCT TCTCGCGGAT TTGGATATGC CTGTCCTGCG GTGGCCCGGC GGCTGTTTCG CCGACGACTA CCACTGGGAG GACGGGATCG GCCCGCGCGA GGAGCGGCCG ACCCGCCGCA ACGCCTTCTG GACGCAGGGA CGTGCGGACA TCCCCGAAGA GCCCAACGAG TTCGGCACGG AGGAGTTCAT GCGGGTCTGT GATCTGCTCG ATGCCGAACC CTACCTCGCG GCCAACCTCG GCAGCGGGTC GCCCGGCGAG GCGGCCGACT GGCTGGAGTA CTGTAACTAC GACGGCGACA CCGAACTCGC GAATCGCCGT GCGGAAAACG GCAGCGACGA CCCTCACGAC GTGAAGTTCT GGGGCGTCGG CAACGAGAAC TGGGGGTGTG GCGGCCGGAT GAGTCCGGAG CAATACGCCG AGGAGTTCCG TCGCTTCGCC ACGTACCTCC GCAGCGTCGA CGGCCTGCTG AGCGAGGAAG ACGTCGAACT CGTCTCCGTC GGGCACATCC ACGACGACTG GAACCGCAAA TTCCTCGACG CGCTCGACGA GTGTGCGAGC TTCACCGAAG GCCCCTACGA CCTGATGGAG CACATGTCCG TCCACCGGTA CTACGAGGCG GGCAGCGACA CGGACTTCGA CGACGAGGAG TACTTCCGAC TGCTCGCGCG CTCTCGCAAA GTCGGGGGTG ACGTCGACAA TGCGGTCGAC GCGCTGTCGG TCTACGCCCC CGAATCCGAC ATCAGTATCG TCGTCGACGA GTGGGGCGTC TGGCACCCCG AGGCGACCAA CACCAACGGC CTCGAACAGG AGAACACGGT CCGGGACGCG ATTTCGGCGG CCGGCGTCTT CGACGACCTC CACGAGCGGG CCGACGTCGT CGCGATGGCG AACATCGCCC AGACGGTCAA CGTCCTGCAA TGCCTGGTCC AGACCGACGA GGACGACGCC TGGCCGACCC CGACCTATCA CGTGTTCGAC CTCTACGAGG GCCACGCCGG CGGGACGGCC CTCGATACGG TGGTCGACAC CGACGCCCAC GAGGTCGAGG ACGAGCCATA CGACGTGCCG CTGGTGAGTG CCTCGGCCTC CGAGCACGAC GAGGAGATCT ATGTGACGCT GTCGAACCGG GCCCTGGAGA GCGAGGACGT CACCCTTACG CTCGACGACG ATGGCGCAAC TGTCCGCGAC AGCGCGGTGC TGTTCGCCGA CACCGACGTC GAGGCGTACT CGCGGAAGGA CAACGCGGCG GCGTTCGCGC CCGAGGAGAT CGACGTCGAG GCGACCGGCG ACGGGACGTT CGAGGTCACG GTCCCGGCGA GTTCGGTCGT GGCGCTGACC CTCGACGCCT GA
|
Protein sequence | MGSAIHVQQR EPIAEIDPNL YGHFAEHLGR CIYGGLWVGE DDRVPTEDGV RMDTIDLLAD LDMPVLRWPG GCFADDYHWE DGIGPREERP TRRNAFWTQG RADIPEEPNE FGTEEFMRVC DLLDAEPYLA ANLGSGSPGE AADWLEYCNY DGDTELANRR AENGSDDPHD VKFWGVGNEN WGCGGRMSPE QYAEEFRRFA TYLRSVDGLL SEEDVELVSV GHIHDDWNRK FLDALDECAS FTEGPYDLME HMSVHRYYEA GSDTDFDDEE YFRLLARSRK VGGDVDNAVD ALSVYAPESD ISIVVDEWGV WHPEATNTNG LEQENTVRDA ISAAGVFDDL HERADVVAMA NIAQTVNVLQ CLVQTDEDDA WPTPTYHVFD LYEGHAGGTA LDTVVDTDAH EVEDEPYDVP LVSASASEHD EEIYVTLSNR ALESEDVTLT LDDDGATVRD SAVLFADTDV EAYSRKDNAA AFAPEEIDVE ATGDGTFEVT VPASSVVALT LDA
|
| |