Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2545 |
Symbol | |
ID | 5734423 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3270955 |
End bp | 3273312 |
Gene Length | 2358 bp |
Protein Length | 785 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641279685 |
Product | Alpha-glucosidase |
Protein accession | YP_001545311 |
Protein GI | 159899064 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000167108 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACATT CATCAATGTT TACACTTGAT CAACGTGATG CCCGTAGCTG TCTGTTTAAA CGAGCAAATC AGCAAATTCG CATCAGCATC TGTAGCGAAC AGCTTATTCG GGTTTGTATT AGTAATGATG CACCATTAGC ACGGCGTTCA TGGGCGGTTA ATCGCGCTGA TGATGCATGG GAGGCACTTG CATGGTCGGT TTCAGAAAAT TCAGATTCAA CGAGGATTGA GACGACAAAG CTACAAGTAA ATGTTGCACA TGCTGATGGC CGAATTACAT TTAACAATCT GGAGCAGCAG CCATTTTTTA GTGACGTTAC GCCAGCAAGC TACAACGCCG ATGGATGGGT TGTTCGTAAG CAAATTTATA ATTCTGAGCA TTTTTATGGT TTTGGTGAAC GCACTGGCTG GCTTGAAAAA ACAGGTCAGC ATTTTTTAAA TTGGACGCTC GATCCTGAAC CACATCATAG CCCGCGTATT GATAATATGT ATGCAACAAT GCCAGTTTTT ATGGGATTGC AGCCTAATCT CTGCTATGGC GTGTTTTTCA ATACATCATT TCGCTCAAGT ATTGATGTTG GGGCGGCTGA TGCTGCATTG TTGAGCTTAA AAACCCAAGG CCCAGATCTC GATTATTATG TGGTTTTGGG TACAACACCT GCCGAAATTA CCGCTACTTG GCGTGAATTA TTGGGAGCAA TGCCTTTACC GGCCTATTGG GCGCTTGGTT ATCATCAAAG TCGCTGGGGC TACGATTCAA GCATGACGAT GCAGGCAATT GCTGATGAAT TACGTGCTCG CAATATTCCG TGCGATGCGA TTCATTTTGA TATTGATTAT ATGGATGGTT ATCGGGTTTT TACTTGGCAT CCTGAACGTT TTGCCCAGCC AGCTCAATTG TTGCAAAATT TGGCTCGTGA TGGCTTCAAT GTGGTAACAA TCATTGATCC TGGGGTCAAA ACTGACCCAA ATTATGCAGT ATTTGCCGAA GGAATCGCCA ACGATTATTT TATCAAGCGG GCTGATGGAA CGTTATTCAG TGGTTATGTT TGGCCTGATG ATAGCGCATT TGCTGATTTT ACCCGTGCTG ATGTACGTGA ATGGTGGGGA AATTTACATA AGAAATTGAT CGATGCTGGG GTACGCGGCA TTTGGGATGA TATGAATGAA CCAACCGTGT TTGACCGACC TTTTAGCGAA GGTGGTGGCA ATGGTGGTAC GATCGATCTG AATGCGCCGC AAGGATCTGC CGATGAGCGT ACAACTCACG CCGAAGTACA TAATTTGTAT GGTTTGTTGA TGGCTCGCTC AACTTATGAA GGCTTGCGAC AATTGCGCCC TAATGAACGA CCATTTGTAT TAACTCGCTC AGGTTTTGCT GGTTTATCAC GATGGGCGAC TCTCTGGACT GGTGATAATT CGGCGTTGTG GGAACATTTA GAAATGATGT TGCCGCAAAT TGCTAACTTG GGGCTTTCAG GAATTCCCTT TGTTGGCGTG GATATTGGTG GATTTTTTGG CAATGCGTCG CCAGAATTAT GGGCACGTTG GGTTCAAGTT GGGGCATTTC TGCCGTTCTG TCGTGGGCAC TCGTGTTCGG GCACACGTCC GGCTGAGCCG TGGGCGTTTG GCGAACGCAC CGAAGCAATT GCGCGGGCCT ACCTTAGTCT GCGCTATCGT TTATTGCCCT ACTTGTATAC GTTGTTTTAT CAAGCTTCAA CCACAGGTGC GCCAATTATT CGTCCATTGG TGTATGAATT TGCGGCTGAT CCGACCACTC ACGCCTTGCA CGATCAGGTG TTGTGTGGCT CGCAATTAAT GCTTGCGCCG ATTGTACGGC CTGGGACTGA ATATCGTTCG GTTTATTTGC CCGCTGGCGA GTGGTACGAT TGGTGGACGG GTGAGCGGAT CAAGGGTTCG CAGCATATTT TGGTGCATGC GCCGCTTGAA CGGTTACCGC TGTATGTGCG TGGTGGGGCG ATTTTGACTC TCGGCCCAGT ACTCAACTAC ACCAGCGAAG CCCCACTTGA TCCTTTAACC CTCGATGTTT ACCCCAGTGG CACAAGCGAA TGGACGCTCT ACGAGGACGA TGGCATCTCG TTCGATTACG AACAGGGCCA AGCAGCGACC ACAACGTTTA GCTGTGTTGA AACTGAGCAA ACAATTACGT TGATGATTGC CGCCCGCCAA GGTAGTTGGC AACCTGCCCT GCGCACAATC GTGGTCAATC TGCATTCGCT GCCGCCCAAA GCGGTTTTGT TTGATACAAA TGCAATCGAA TGGGTCTACG CTGAAGGCGC AACGACCGTG AGTTTTGCTG ATGATGGCTT GGCACACACG CTTGAGGTGC AGTTGTAA
|
Protein sequence | MEHSSMFTLD QRDARSCLFK RANQQIRISI CSEQLIRVCI SNDAPLARRS WAVNRADDAW EALAWSVSEN SDSTRIETTK LQVNVAHADG RITFNNLEQQ PFFSDVTPAS YNADGWVVRK QIYNSEHFYG FGERTGWLEK TGQHFLNWTL DPEPHHSPRI DNMYATMPVF MGLQPNLCYG VFFNTSFRSS IDVGAADAAL LSLKTQGPDL DYYVVLGTTP AEITATWREL LGAMPLPAYW ALGYHQSRWG YDSSMTMQAI ADELRARNIP CDAIHFDIDY MDGYRVFTWH PERFAQPAQL LQNLARDGFN VVTIIDPGVK TDPNYAVFAE GIANDYFIKR ADGTLFSGYV WPDDSAFADF TRADVREWWG NLHKKLIDAG VRGIWDDMNE PTVFDRPFSE GGGNGGTIDL NAPQGSADER TTHAEVHNLY GLLMARSTYE GLRQLRPNER PFVLTRSGFA GLSRWATLWT GDNSALWEHL EMMLPQIANL GLSGIPFVGV DIGGFFGNAS PELWARWVQV GAFLPFCRGH SCSGTRPAEP WAFGERTEAI ARAYLSLRYR LLPYLYTLFY QASTTGAPII RPLVYEFAAD PTTHALHDQV LCGSQLMLAP IVRPGTEYRS VYLPAGEWYD WWTGERIKGS QHILVHAPLE RLPLYVRGGA ILTLGPVLNY TSEAPLDPLT LDVYPSGTSE WTLYEDDGIS FDYEQGQAAT TTFSCVETEQ TITLMIAARQ GSWQPALRTI VVNLHSLPPK AVLFDTNAIE WVYAEGATTV SFADDGLAHT LEVQL
|
| |