Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0096 |
Symbol | |
ID | 5731989 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 122510 |
End bp | 125185 |
Gene Length | 2676 bp |
Protein Length | 891 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641277218 |
Product | alpha-1,6-glucosidase, pullulanase-type |
Protein accession | YP_001542876 |
Protein GI | 159896629 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1523] Type II secretory pathway, pullulanase PulA and related glycosidases |
TIGRFAM ID | [TIGR02103] alpha-1,6-glucosidases, pullulanase-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.380747 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTGATC AATCTGTTGT TGAAGAGCAG CCCAGCGTCG GGCAGCTTGA CCGGCTGCGG GCGTATTGGG TGCGTCGCGA TACGATTGTT TGGAATATTC ATGCGCTCCC CAATGCTCGC TATGAGTTGC ATTATTCGCT TGATGGTTCG TTGGAGCTAA CGCCGCAGGG AATTCAGCAT GGCACGGTTT TGCCGCTCGA ACGTGATCCC TTGGGCTTAG ATTCGACGTT GGCAGCGGCT TTTCCGCATT TGCAGGGCTT GCCAGTCTTG CGCCTCGCCC CTGAACACCA CGACTTGGTT GCCACAATCT TGCGTGGCCA AGTGGCGGTG CAGGCCTATC ACGCTGATGA TCAGAGTTTG ATCGATGCGA CGGCGTTGCA AATTCCTGGG GTGCTTGATG ATTTGTATCA TTACACTGGC GAGCTTGGGG TGAGTTTCGA TGAGCAGAAA CGCCCGATTT TGCGCTTGTG GGCACCAACC GCTCGCTCAG TTAGTCTGTT GCTCTACGCT GATTCGGCTA AGGGAACTCG CGCGATTCGC AGTGGGATGA TGTTTGACCC TCAGACTGGA GTTTGGAGCA TCGTTGGCAA GCCCAGTTGG AAGAACCAGT TTTATCGATT TTTGGTCGAT GTGTTTGTGC CGAGCACTGG CCAATTTGAG CAAAATAGCG TCACTGATCC CTATTCGCTG AGTTTATCGA CCAATAGCCA ACATTCGCAA ATTGTTGATC TGCAAAGCCA ACGCCTTAAA CCCAAAGGTT GGAGCAAACT CGCCAAGCCC AGCCTAACCC AGCCCACCGA TATTGTGCTC TACGAACTGC ATGTGCGTGA TTTTTCGATC ACCGATTCGA GTGTGTCAGC TGAGTTACGT GGCACCTATA AAGCCTTTAG CCAAGTTGAA TCGAATGGCA TGCAACATTT GCAGCGACTG GCTAAGGCCG GGGTGAGCCA TGTGCACTTG CTGCCAGTCT TTGATATTGC CACCATTGAA GAACATCGGC CTGATCGCAC GCGGATTGAT TTTGACTATT TAGCCAGTTT GCCTGCTGAT TCGACTGAGC AGCAAGCCTA TCTTTTTCCA ATTCGCGATC GTGATGGCTT CAATTGGGGC TATGACCCTC GCCATTACAC CACGCCCGAA GGCTCGTATA GCACCGATCC TGATGGTTCA ACGCGAATCT ACGAGTTTCG CGAGATGGTT CAAGCGCTGA ATCAGATTGG GCTGCGGGTG GTGATGGATG TGGTTTACAA TCATACCCAT TCGAGTGGTC AAGCCAGTTG CGCCGTGCTC GACCGAATTG TGCCAGGCTA CTATCATCGC TTGGATGCTG ATGGCAATGT TTGCAACAGC ACCTGCTGCG CCAACACCGC TAGCGAACAG CATATGATGG AAAAATTAAT GCTCGATTCG CTGCGCACAT GGGCCGTCGA TTATAAAGTT GATGGCTTTC GCTTCGATTT GATGGGCCAT CATCTGAAGC GTAATATGTT GGCGATTCGC GCCATGCTCG ATAGTTTGAC GTTGGCTGAG CATGGCGTTG ATGGTAAAGC GATCTATGTG TATGGCGAGG GCTGGAATTT TGGCGAAGTT GCCGACGGTG CACGCGGCGA AAATGCCTCG CAATATGCCA TGGCTGGTAC GGGCATTGGC ACCTTCAGCG ATCGTCTGCG CGATGCGGCG CGGGGTGGCG GCCCCTTTGT TGGTTTTCAA GTGCAGGGCT TTGCCACAGG CTTGCTTGAT CGGCCTAATG TGTACGAACA GCGGGCGTTT GTTGAGCGCC ATTATCAAGT TGCGGTGCTC AGCGATGTGG TGCGGCTGAG TTTGGCGGGC AATTTGGCTG AGTATCCAAT CCTCAGTTGC GAAGGCCAAC AAACTTTGGG CGGCCACTTG TATATCAATG GCAAACCAGC AGCCTATGGT TTACGTCCTG ATGATCATAT TGCCTATGTT TCAGCCCACG ATAATGAAAC CTTATTTGAT GGAGTGCAAG TCAAAGCCCC GCTCGAATCG CCAATCGCCG AGCGGGTGCG TATGCACAAT TTGGCCTTGA GTTTAGTGGC CTTAGCCCAA GGAATTCCAT TTTTTCATGC TGGCGATGAG TTGCTGCGCT CGAAATCGCT GGATCGCAAC AGCTACAACT CCAGCGATTG GTTTAATCGC ATCGATTGGA CGGGCCAGCA GAATACTTGG GGTTCGGGCT TGCCGCCCTC GGCGGATAAC CATGAACATT GGGCCACGGT CGGGCCGTTG TTAGCCAACC CTGCACTCAA GCCAACCCCT GAGGATATGG CATTTAGCTA TGCGCATTTT CAAACGATGC TACAAATTCG GCGTAGCTCA GGCTTGTTTC GGTTGAACAG TGCTGAATTG ATCAAGCAAA AAGTCTGGTT TCCCAATACT GGGCCTGATC AAGTGGTTGG TTTGGTGCTG ATGGTGCTTG ATGATGGTGT AGGCGAGCAG TGCGATCAGC AATTTAGCCG CATTGTGGTG GCTTTCAATG GCTCGCACCA TGATTTAAGT TATAGCGATG CCAGTTTTGG TCATTACAAT TTGCAACTGC ACCCCTTACT GGCAAATGGC TATGATCCAG TGCTGGCGCA GGCGAGCTTT GATCGCAACC TTGGCAGCAT CAGCGTGCCG GGGTTTAGCT GTGTGGTGTG GGTCGAATAT CGCTAA
|
Protein sequence | MLDQSVVEEQ PSVGQLDRLR AYWVRRDTIV WNIHALPNAR YELHYSLDGS LELTPQGIQH GTVLPLERDP LGLDSTLAAA FPHLQGLPVL RLAPEHHDLV ATILRGQVAV QAYHADDQSL IDATALQIPG VLDDLYHYTG ELGVSFDEQK RPILRLWAPT ARSVSLLLYA DSAKGTRAIR SGMMFDPQTG VWSIVGKPSW KNQFYRFLVD VFVPSTGQFE QNSVTDPYSL SLSTNSQHSQ IVDLQSQRLK PKGWSKLAKP SLTQPTDIVL YELHVRDFSI TDSSVSAELR GTYKAFSQVE SNGMQHLQRL AKAGVSHVHL LPVFDIATIE EHRPDRTRID FDYLASLPAD STEQQAYLFP IRDRDGFNWG YDPRHYTTPE GSYSTDPDGS TRIYEFREMV QALNQIGLRV VMDVVYNHTH SSGQASCAVL DRIVPGYYHR LDADGNVCNS TCCANTASEQ HMMEKLMLDS LRTWAVDYKV DGFRFDLMGH HLKRNMLAIR AMLDSLTLAE HGVDGKAIYV YGEGWNFGEV ADGARGENAS QYAMAGTGIG TFSDRLRDAA RGGGPFVGFQ VQGFATGLLD RPNVYEQRAF VERHYQVAVL SDVVRLSLAG NLAEYPILSC EGQQTLGGHL YINGKPAAYG LRPDDHIAYV SAHDNETLFD GVQVKAPLES PIAERVRMHN LALSLVALAQ GIPFFHAGDE LLRSKSLDRN SYNSSDWFNR IDWTGQQNTW GSGLPPSADN HEHWATVGPL LANPALKPTP EDMAFSYAHF QTMLQIRRSS GLFRLNSAEL IKQKVWFPNT GPDQVVGLVL MVLDDGVGEQ CDQQFSRIVV AFNGSHHDLS YSDASFGHYN LQLHPLLANG YDPVLAQASF DRNLGSISVP GFSCVVWVEY R
|
| |