Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3463 |
Symbol | |
ID | 5735324 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4359817 |
End bp | 4361982 |
Gene Length | 2166 bp |
Protein Length | 721 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641280610 |
Product | glycoside hydrolase family 3 protein |
Protein accession | YP_001546227 |
Protein GI | 159899980 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGTATG AGCAACAAAT CGAAGCATTG TTGGCCCAAA TGACTCTTGC CGAGAAGATT GGCCAAATGC GCCAGCTTCA TGGCACTGGC GAAACCCAGC AGCAACTGGT GCGCGAAGGC AACTTGGGGT CGGTTCTAAA TGTGATTGAT GCTGATGCCC ACGAGATTCA GCGCATTGCC GTTGAAGAAT CACGCTTGGG CATTCCGCTG TTAATTGGCC GCGATGTGAT CCACGGTTTT CGCACAATCT TCCCAATTCC ACTTGGCCAG GCTGCTTCGT TTAATCCTCA GCTTGTGCGC GAAGCCGCGC GGATTGCCGC CCGCGAAGCC TCGGCCTCTG GGATCAACTG GACATTTGCC CCGATGATCG ATATTTCACG CGACCCACGG TGGGGGCGGA TCGCCGAAAG CTGTGGCGAA GATGCCTATC TTTCAAGTTT GATGGGTGTG GCGATGGTCG AAGGCTTTCA AGGCGACGAT TTGACCGCCC CCGATGCGAT TGCTGCTTGT GCCAAACATT ATGTGGGCTA TGGCGCTAGC GAAAATGGCC GCGATTACAA CACTGCTTGG ATTCCCGAAG TGCTCTTACG TGATGTTTAT TTAGCACCAT TCAAAGCTGC CGCCGATGCT GGCGTGGCCA CCATGATGAG CGCCTTCCAC GATTTGAATG GTGTGCCAAC CTCAGGCAAC GAATTTACGC TGCGTCAAAT TTTAAAAGGC GAGTGGAATT ACGATGGTAT GGTGGTCAGC GATTGGGCCT CGGTTGCCGA AATGATCGCC CATGGCTATG CCGCTGATTT GCGCGATGCT GCCTTGAAAG GTGTAACGGC TGGGGTCGAT ATGGAAATGG CCAGCACCAG CTACGCCGAA TATCTGGCTG CGTTGGTTGA AAGTGGCGCA CTCAGCCTCG ATTTAATTGA TGATGCTGTG CGGCGGGTGT TGCGCATCAA GTTCCGTTTG GGTTTGTTCG ATCAACCGTA TGCTAACGCT GCGGCGGCTG ATTCAGTCGT TGCGCCTGAT CATTTGGCTT TGGCTCGCCA AATTGCCAAA GAAAGTTGTG TGCTATTGAG CAATCAGCAA ACTTTGCCGC TCAACCCACA ACAAACGCGG GTGGCAATTG TTGGGCCGCT CGCCAACCAT GCCGCCGATC AACTTGGCTG CTGGGTATTC GATGGCAAGC CCGAAGATAG CCAAACTCCA TTACAAGCGA TTCGCGAATT GCTTGGTGAC GAGCGGGTGC AATTTGCCCA AGGCTTGCCC GAAGCCCGCA GCTTAGATCA AAGTCTATTT GGCGAGGCAG TCGCGGCGGC TCAAACTGCT GATGTGGTTA TTGCCTTCCT TGGTGAAGAT GCTGGCTTGA GTGGCGAAGC CCATAGCCGC GCATTCATCG ATTTACCTGG CGCACAACTG GCCTTAGTCG ATGCCTTGGT GGCAACCGGC AAACCAGTGG TTGCGGTTGT GATGGCTGGA CGCTCGTTGG TGTTGGGCGA ATTGCAGGAT AAAGTGCAGG CGATTTTATA TGCTTGGCAT CCTGGCACCA TGGCTGGCCC AGCGCTCGCC GATTTGCTGT TTGGCTTGGA TAACCCTTCA GGCCGCTTGC CAATTAGCTT CCCGCGCACC GTCGGCCAAG TGCCAATTTA TTACAATCGC AAAAACACTG GTCGCCCACC AAGCGAAGAT GCACCGAGTA TTCCCACGGG CACGCCGCTT GATCCGAGTG GTTTTACCTC AAGCTACCTC GATGTTGATC ATCGGCCCTT GTTTGCTTTT GGTTATGGCT TGAGCTACAG CACATTTAGC TATAGCAATT TGCGTTTATC TAGCCAAAAA CTGGCGGTTG GCGACACACT TAGCATCACC ACTACGGTGA CCAACACTGG CAAGTATGCT GGCGCAGAAG TGGTGCAATT GTATATTCGC GATCTGGTTG GCTGTATGAC TCGCCCAATC AAAGAACTCA AAGGCTTCCA ACGAATTCAT TTGGAGCCAG GCCAAAGCCA AACTGTAACA TTTGAACTTA GCAGCGCTGA CTTGAGTTTC CATAACAACG CCATGCAACG GATCGTCGAG CCAGGCGAAT TTAATCTCTG GGTTGCGCCA AGCAGCATTG GTGGTTTGCA GGCAAGCTTT GAATTAGTAG CCAAGAGCAA AGAACATCGA GCATAA
|
Protein sequence | MQYEQQIEAL LAQMTLAEKI GQMRQLHGTG ETQQQLVREG NLGSVLNVID ADAHEIQRIA VEESRLGIPL LIGRDVIHGF RTIFPIPLGQ AASFNPQLVR EAARIAAREA SASGINWTFA PMIDISRDPR WGRIAESCGE DAYLSSLMGV AMVEGFQGDD LTAPDAIAAC AKHYVGYGAS ENGRDYNTAW IPEVLLRDVY LAPFKAAADA GVATMMSAFH DLNGVPTSGN EFTLRQILKG EWNYDGMVVS DWASVAEMIA HGYAADLRDA ALKGVTAGVD MEMASTSYAE YLAALVESGA LSLDLIDDAV RRVLRIKFRL GLFDQPYANA AAADSVVAPD HLALARQIAK ESCVLLSNQQ TLPLNPQQTR VAIVGPLANH AADQLGCWVF DGKPEDSQTP LQAIRELLGD ERVQFAQGLP EARSLDQSLF GEAVAAAQTA DVVIAFLGED AGLSGEAHSR AFIDLPGAQL ALVDALVATG KPVVAVVMAG RSLVLGELQD KVQAILYAWH PGTMAGPALA DLLFGLDNPS GRLPISFPRT VGQVPIYYNR KNTGRPPSED APSIPTGTPL DPSGFTSSYL DVDHRPLFAF GYGLSYSTFS YSNLRLSSQK LAVGDTLSIT TTVTNTGKYA GAEVVQLYIR DLVGCMTRPI KELKGFQRIH LEPGQSQTVT FELSSADLSF HNNAMQRIVE PGEFNLWVAP SSIGGLQASF ELVAKSKEHR A
|
| |