Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2816 |
Symbol | |
ID | 5734697 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3578831 |
End bp | 3581089 |
Gene Length | 2259 bp |
Protein Length | 752 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641279959 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001545582 |
Protein GI | 159899335 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2273] Beta-glucanase/Beta-glucan synthetase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000106793 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCTCGA ATGTACGATT TTCTTCCCGT GGAGCATCAC GCCTAGCTTT GTTAGCATGT TTGGTGCTCA GCCCAATGCT TTTGTTGGGC CAGCCTCAAA GCACATTTGC CGCCACCAAC CTCGCGCTCG GCAAAACTGC CGTCGCCTCG ACCAGTGAAA ACGCTGATTT TGGCGCAGCA CTGGCACTCG ATGGTAACGT GAATACCCGT TGGAGCAGCA CGTTTGCCGA CGGTCAATGG CTGCGCGTCG ATCTTGGCAG CGTCCAAAGC TTCAATAATG TGGTGCTGCG CTGGGAAGCT GCTTACGCCA GCAGCTATCG CATTCAAACT TCAAACGATG CCAACACATG GACAACCATC CAAACCATCA CCAATAGCGA TGGTAATGTT GACGATTTGA GCATCAGTGG CACAGGTCGC TATGTTCGCG TCGAAAGCAT CACCCGCGCC ACCCCCTATG GCATCTCGTT GTTTGAATTT GAAGTCTATG GCAATGCTGG CAGCAGCAAT ATTGCCTTGA ACAAAACGAC CAGCGCCTCA AGCAGCGAAA ATGCTGCAAC GACCGCCAAT TTTGCCGTTG ATGGCAATGT GAATACTCGT TGGAGCAGCA CATTTGCCGA TGGTCAATGG CTGCGCGTCG ATTTTGGCAG CGTCCAAAGT TTCAATCGAG TGGTCTTACG CTGGGAAGCT GCCTACGCCA CCAGCTATCG GGTGCAAACC TCGAACGATG CTAACAGCTG GACGACCATC CAAACCGTCA CGAATAGCGA TGGCAATGTT GACGATTTAA GCATCAGCGG TAGTGGCCGC TATGTTCGGA TCGAAAGCGT TACTCGCGCC ACGCAGTATG GCATTTCGTT GTTCGAGTTG GAAGTGTATA GCGGCAGCAT CCAACCAACG GTGCAGCCAA CCACCCAACC AACCACTCAA CCGACGGTAC AGCCAACCGC CCAACCAACT ACTATTCCGG GCAACTGTAC AACTTCAACC AATATTGCCT TGAACAAACC AGCCTATGCC TGGTTCTACG AATCGAAAAC CTCAAGCCCA GCCCAAGCCG TCGATGGCAA TGCCACGACC ACTCGCTGGA GCCATCTCTG GTATCCAAGC GGCCCAGCCA ATGCTTGGTT GTATGTTGAT TTAGGCTCAA CTAATGCCAC AATCAACCGC GTGCGCATCT TGTGGGAATC AGCCTACGCC GCCGATTACC AAATTCAAGT CTCGAATGAT CGCAGCAACT GGTCGAACAT ACGCAGCGTG GTTGGCAACA CCCAAACCAC CAACGACTAC AATCTCACGC CAATTACTGG GCGCTACCTG CGAATCAACA TGACCAAAAA GGGCACTGAA TATGGCTACT CCATTTGGGA ATTAGAAGTG TATGGCTGTA ATGCAGGCAA CCCCAGCTAT GATCCCGCGC CAACCCCATT GACTGGCAAT TTCAGCCGCG TTTGGGTTGA AAACTTCGAT GGCACCAGCC TGAACATGAA CAACTGGGCC TATGATAACG ATGTTCATGT CAATGGTGAA CAACAACAAT ACACCAGCAA CAACGTCGCC GTCAGCGGTG GCACCTTGAA AATCACCGCC CGCAAAGAAA CCGCCAACGG CTATCCTTTC ACCTCGGGCC GGATCTTTGG CCTAGGTCGT CAGAGCTTCT TATATGGCAA AATGGTTGCC CGCATGAAGA TGCCAGTTGG TGAAGGCTAC TGGCCAGCCT TCTGGATGAT GGGCGCAAAT ATCAATGAAG TTGGCTGGCC TGGCAACGGT GAGCTGGATA TTATGGAAAA TATCGGCTAT GGCAACTGGA CTTCAGGGGC ACTTCACGGC CCAGGCTACT GGGGAGCAGG CTCGGCAGGC GGCTTAGTCA ACTTGCCAGC AGGCCAAACC ACCGGCGCTT GGCATACCTA CGCCGTCGAA TGGGACCCAA CCTATATCAA ATGGTTCGTC GATGATCGTG AAATTCTGAG CTTCACTCGC GCCCAAATCA TCGCCGATTA TGGCCAATGG ACGTATAACA ATCCCAAGTT CTTTATCTTG AATTTGGCCT TGGGCGGCGA ATATCCCGCT GGCTACAACG GCTGTACTGG CAATCCATTG CCAGCAGGCT GTGCCTACTT TGGAGTCAAG CAATCAACCG TCGATAGCAT TGTGGCTGGT AATGGCTTGC TCGAAGTTGA TTACGTTGAA GTATGGCAAA AACCAGCTGC AACAACCCAA GGTGCTCAAT ATTTGCCTTT GGCGCAACAA GAAAACTAA
|
Protein sequence | MFSNVRFSSR GASRLALLAC LVLSPMLLLG QPQSTFAATN LALGKTAVAS TSENADFGAA LALDGNVNTR WSSTFADGQW LRVDLGSVQS FNNVVLRWEA AYASSYRIQT SNDANTWTTI QTITNSDGNV DDLSISGTGR YVRVESITRA TPYGISLFEF EVYGNAGSSN IALNKTTSAS SSENAATTAN FAVDGNVNTR WSSTFADGQW LRVDFGSVQS FNRVVLRWEA AYATSYRVQT SNDANSWTTI QTVTNSDGNV DDLSISGSGR YVRIESVTRA TQYGISLFEL EVYSGSIQPT VQPTTQPTTQ PTVQPTAQPT TIPGNCTTST NIALNKPAYA WFYESKTSSP AQAVDGNATT TRWSHLWYPS GPANAWLYVD LGSTNATINR VRILWESAYA ADYQIQVSND RSNWSNIRSV VGNTQTTNDY NLTPITGRYL RINMTKKGTE YGYSIWELEV YGCNAGNPSY DPAPTPLTGN FSRVWVENFD GTSLNMNNWA YDNDVHVNGE QQQYTSNNVA VSGGTLKITA RKETANGYPF TSGRIFGLGR QSFLYGKMVA RMKMPVGEGY WPAFWMMGAN INEVGWPGNG ELDIMENIGY GNWTSGALHG PGYWGAGSAG GLVNLPAGQT TGAWHTYAVE WDPTYIKWFV DDREILSFTR AQIIADYGQW TYNNPKFFIL NLALGGEYPA GYNGCTGNPL PAGCAYFGVK QSTVDSIVAG NGLLEVDYVE VWQKPAATTQ GAQYLPLAQQ EN
|
| |