Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2401 |
Symbol | |
ID | 5734282 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3059378 |
End bp | 3060736 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641279542 |
Product | Beta-glucosidase |
Protein accession | YP_001545169 |
Protein GI | 159898922 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase |
TIGRFAM ID | [TIGR03356] beta-galactosidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00110187 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTCAGC GTGCCTTTCC TTCCGATTTC CTCTGGGGTT CCGCCACATC GTCGTATCAA ATTGAAGGCG CTGCGTTTGC CGATGGCCGT AGCGAATCGA TTTGGGATCG TTTTTGTAAA CAACCAGGCG CTATTTTAGA TCAATCCAAT GGTGATATTG CTTGCGATCA TTACAATCGC TATCGTGATG ATGTTGCTTT GATGGCGCGT TTGGGTCTAC AAGCCTATCG ATTTTCGGTG GCTTGGCCAC GCGTGTTGCC CAATGGACGT GGTGCGGTCA ATCAGGCTGG CCTTGATTTT TATCGGCGCT TGGTCGATGA ACTACTGCAA CACAACATTC GGCCATTTGT AACGCTCTAC CACTGGGATT TGCCGCAAAT TCTCGAAGAT GCTGGCGGTT GGCCTGAGCG GGCTACGGCT GAAGCTTTTG TTGAATATGC CGATGCGGTC AGCCGTGCCT TGGGCGATAC TGTCAAAGAT TGGATTACTC ACAACGAGCC ATGGTGTGCA GGGTTGCTCG GCTACCAAAT TGGCGAACAT GCTCCAGGCC GAAAAAATTG GAATGACGGC TTGAAGGCCA GCCATCATCT TTTGCTTTCG CACGGCTGGG CGGTTGATGT AATTCGGCGC AACGTACCAC AGGCTAGCGT TGGGATCACG CTGAACTTTA CGCCAGCGAT GCCTGCTTCA CGTAGTACCG AAGATTTGAA TGCAACCCGC CACTTTGATG GTTTCTTCAA TCGCTGGTTC CTCGACCCAG TGTATGGCCG CGAATATCCT GCCGATATGG TGCGCGATTA CACTGAACTT GGCTACTTGC CTAATGGCCT CGATTTTGTT CACGATGGCG ACTTCAAGGC CATGGCCGCA ACTACCGATT TCTTGGGTGT AAACTACTAC AGTCGGGCGG TGATTCACGA TCCCAAAACT GGAACCGCGC CCAAACTAGA TTCCGAATAT ACCGATATTG GCTGGGAAGT GTACCCTCAA GGCTTAGGCG ATTTGCTCAA GCGCTTGGCC TTTGCCTACA ACCCAGGCAA AATTTATGTG ACTGAAAATG GTGCTAGCTA CAACGATGGC CCTGACGCGC ATGGCGAAGT CAATGATACC CGTCGCACCC AATATTTGCA CGATCACCTG AGCGTTTGCT CCGATGCAAT TGCTGCTGGG GTGCCATTGG CGGGCTACTT TGTCTGGTCA TTGATGGATA ACTTTGAGTG GGCCAAGGGC TATAGCCAAC GCTTTGGCGT GATTTGGGTC GATTACGAAA CCCAACAACG CATCCCCAAG GCTAGCGCTC ATTGGTATAG CCGTGTGGTC AAGGCCAACG CCGTCCAACC ATTGGAAGTT TTAGCCTAA
|
Protein sequence | MTQRAFPSDF LWGSATSSYQ IEGAAFADGR SESIWDRFCK QPGAILDQSN GDIACDHYNR YRDDVALMAR LGLQAYRFSV AWPRVLPNGR GAVNQAGLDF YRRLVDELLQ HNIRPFVTLY HWDLPQILED AGGWPERATA EAFVEYADAV SRALGDTVKD WITHNEPWCA GLLGYQIGEH APGRKNWNDG LKASHHLLLS HGWAVDVIRR NVPQASVGIT LNFTPAMPAS RSTEDLNATR HFDGFFNRWF LDPVYGREYP ADMVRDYTEL GYLPNGLDFV HDGDFKAMAA TTDFLGVNYY SRAVIHDPKT GTAPKLDSEY TDIGWEVYPQ GLGDLLKRLA FAYNPGKIYV TENGASYNDG PDAHGEVNDT RRTQYLHDHL SVCSDAIAAG VPLAGYFVWS LMDNFEWAKG YSQRFGVIWV DYETQQRIPK ASAHWYSRVV KANAVQPLEV LA
|
| |