Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1246 |
Symbol | |
ID | 5733124 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1452499 |
End bp | 1454823 |
Gene Length | 2325 bp |
Protein Length | 774 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641278386 |
Product | alpha-xylosidase YicI |
Protein accession | YP_001544022 |
Protein GI | 159897775 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.758952 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATTTA CTGATGGATA TTGGATGATG CGTGAGGGCG TTAATGCGCT CTTTGCATCA CAGGCCTACG ATGGCCAAAT AAACGACACA ACTCTGACGG TCTATGCGCC AGGCAAGCGG ATCAATCATC GAGGCGATAC ACTGAACCTA GGCACAATTA CCGCCCGTTT TTCCTCGCCC ATGCCCGATG TGGTGCGGGT CAAACTTACC CATTTTGAAG GCCAACGTTC ACTTGGCCCC AATTTTGCAA TTGCTGAATC AGCCAATGAG AGCGTGAGCA CCAGCGAAGA TGAGCAAGAA TTTCGCTTGA CCAGCGGCAA ATTGAGCGTG CAAATCCCCA AAACTGGCGA TTGGAGTATC AATTTTTGGG CTGAGAATCG CCGCATTACC AGCAGTGGCC GCAAAGGCAT TGGCTACATT AGCATGGCCG ATGCTGGCGA ATTTATGCAC GAGCAATTAT CGCTTGGCGT TGGCGAGCAA GTGTATGGCT TGGGCGAACG CTTTACCGCC TTCGTCAAAA ATGGCCAATC AGTCGATGTT TGGAATCAAG ATGGTGGCAC GGGCAGCGAG CAAGCCTACA AAAACGTACC ATTCTATCTG ACCAATCGTG GCTATGGCGT GTTCGTCAAT CAGCCCGAAA ACGTGGCCTT TGAAATTGCC TCGGAAAAAG TTTCGCGGGT GCAATTTAGT GTGCCAGGCC AAAGCCTCGA ATATTTTGTG ATCTATGGCC CAACGCCCAA GGAAATTCTG GAAAAACTGA CTGCTTTGAC AGGCCGCCCA GCTTTGCCGC CAGCATGGTC GTTTGGTTTA TGGCTCACCA CCTCGTTTAC CACCTCGTAT GATGAGCAAA CTGTTACCAG CTTTATCCAA GGCATGGCCG ACCGCGATTT ACCGTTGCAT GTCTTCCATT TCGATTGTTT TTGGATGCGC GAATTTCACT GGTGCGATCT TGAATGGGAT TCACGCACCT TCCCCGATCC TGAGGGCATG CTCAAGCGGC TCAAAGATCG CGGCTTGAAA ATTTGTGTTT GGATCAATCC ATATATTGCC CAGCGTTCGG CGATGTTCCG CGAAGGTATG GAGCATGGCT ACTTGGTCAA GAAGCCCAAC GGCGATGTCT GGCAATCGGA TATGTGGCAA TCGGGCATGG GCTTGGTCGA TTTCACCAAC CCCGCTGCTT GCGCTTGGTA TGCAGCCAAA CTCAAAGGCC TGCTCGATAT GGGCGTAGAT TGCTTCAAAA CCGACTTTGG CGAACGCATT CCCACCGATG TAGCCTATTT CGACGGTTCC GACCCCCAGC GGATGCACAA CTACTACACC CATCTTTACA ACAAAACCGT GTTTGATTTG TTGAAAACTG AGCGCGGCGA AAACGATGCA GTGGTGTTTG CCCGATCGGC AACTGCTGGC GGCCAACAAT TCCCAGTGCA CTGGGGCGGC GACTGCGAAT CGACCTTCGA ATCGATGGCT GAAAGTTTAC GCGGCGGTTT ATCGTTAGGG CTTTCAGGTT TTGGCTTCTG GAGCCACGAT ATTGGCGGCT TCGAGGGCAT GCCACCAGTT GAAATTTACA AACGCTGGAT TGCCTTTGGC ATGCTTTCAT CGCACAGTCG TTTGCATGGC AACCATACTT ATCGCGTGCC ATGGATTTAC GATGAAGAAG CCGTCGATGT GCTGCGCTAC TTCACCAAAC TTAAATCACG TTTGATGCCC TATCTCTATG GGGCTGCTGT GACCGCTTCC ACCAGTGGCA TTCCAGTGAT GCGGGCCATG TTGCTAGAAT TCCCCAACGA TCCGACCTGT GATTTCCTTG ATCGTCAATA TATGTTGGGC GATTCGTTGT TGGTTGCGCC AGTATTCGCC TACGACAACA CGGTGACCTA CTATGTGCCT GCTGGCCGCT GGACGCACAT TACGACTGGT GCGGTGGTTG AAGGCCCACG TTGGGTCACT GAAACCCACG ACATGCTGAG TTTGCCATTA TTGGCTCGCC CCAACAGCTT GATTGCGATT GGTAACAACA GCGAGCGCCC CGATTACGAC TATAGCAGCG GTGTGACCCT GAATTTATAC CAATTGGGCG ATGGTCAAGC GGCCTACACC ATGGTTCCGG CAACCAATGG CGATATTGCC GCCTCGTGGA GTGCCCGCCG CGATGGCGAC ACGATCAGAA TCGTGCAAGA AGGCCAAGCT AACGATTGGC AAGTGGTGTT GGTTGGCGTG CAACAAGTTG CCAGCGCCGA TGGCGCTTTA GTCGAGCAAC ATCCGCTCGG TGTCCAACTC ACCGCACTTG ATCAAGCCAC CAAGTTGGTG GTCAAGTTGA AGTAG
|
Protein sequence | MKFTDGYWMM REGVNALFAS QAYDGQINDT TLTVYAPGKR INHRGDTLNL GTITARFSSP MPDVVRVKLT HFEGQRSLGP NFAIAESANE SVSTSEDEQE FRLTSGKLSV QIPKTGDWSI NFWAENRRIT SSGRKGIGYI SMADAGEFMH EQLSLGVGEQ VYGLGERFTA FVKNGQSVDV WNQDGGTGSE QAYKNVPFYL TNRGYGVFVN QPENVAFEIA SEKVSRVQFS VPGQSLEYFV IYGPTPKEIL EKLTALTGRP ALPPAWSFGL WLTTSFTTSY DEQTVTSFIQ GMADRDLPLH VFHFDCFWMR EFHWCDLEWD SRTFPDPEGM LKRLKDRGLK ICVWINPYIA QRSAMFREGM EHGYLVKKPN GDVWQSDMWQ SGMGLVDFTN PAACAWYAAK LKGLLDMGVD CFKTDFGERI PTDVAYFDGS DPQRMHNYYT HLYNKTVFDL LKTERGENDA VVFARSATAG GQQFPVHWGG DCESTFESMA ESLRGGLSLG LSGFGFWSHD IGGFEGMPPV EIYKRWIAFG MLSSHSRLHG NHTYRVPWIY DEEAVDVLRY FTKLKSRLMP YLYGAAVTAS TSGIPVMRAM LLEFPNDPTC DFLDRQYMLG DSLLVAPVFA YDNTVTYYVP AGRWTHITTG AVVEGPRWVT ETHDMLSLPL LARPNSLIAI GNNSERPDYD YSSGVTLNLY QLGDGQAAYT MVPATNGDIA ASWSARRDGD TIRIVQEGQA NDWQVVLVGV QQVASADGAL VEQHPLGVQL TALDQATKLV VKLK
|
| |