Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0888 |
Symbol | |
ID | 5732789 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1014012 |
End bp | 1015025 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641278020 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001543664 |
Protein GI | 159897417 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCACATT ATGGCTTCAA TTTACAATGG ATGTATCATT ATGTTCCAGG CAAGTTGCCA GCCGAGCCAG ACCATAAAGC CTTAACCTGG ATCGCCAAAC AAGGCTTTAA TTTCATTCGG ATTCCAACCA ATTATTGGTT TTGGACTCGT GATTGGCACT ATACCAATCC CGATGAGACG ATCATCGGGT ATATTGACCG TTATATCTCA GCATGTCGTG AATATGGCCT GCATGTCAGC CTTAATTTGC ATCGAGCGCC TGGATATTGT ATCAATGGGA ACGATCTCGA ACGACATAAT TTATGGCTAG ATGCCGAAGC TCAGGCAGGC TTCGTGTGGC TGTGGGAGTA TTTCGCCCAA CGCTACAAGG GTATTCCGGC TAGCCAATTA AGCTTCGATT TGCTCAACGA ACCGCCCAAC GTCGATCAGT ATGGCTTGAC GCGACTCAAC CATGCGGCGA TCATGCAACG GACGGTCGCA GCAATTCGCG CAATCGACCC ACAACGGACG ATTGTGATCG ATGGACTTGG TGGTGGGCAC TTGGCGATGC CCGAACTCGC TGATTTGGGC GTAACCCACA GCGGGCGCGG CTATCAGCCA ATGGCGGTGA GCCACTATCA AGCGAGTTGG TGGGATGGTC ACGAAGGTTT AGCGGCACCA ACCTACCCTG TAACCTGGCA CAACCATTAT TGGGATCGGG CCGGCTTAGT TGAATTCTAC CAACCATGGC GCGACGTACA AGCGCGGAAC GTCGCCATCC ATATTGGTGA ATTTGGCTGC TACAACCGTA CCCCCAATGA TGTTGCTTTG CGTTGGTTCC GCGATCTGCT CAGCGTTTAT CAAGAATTTG GTTGGGGCTT CGGTCTCTGG GAGTTTGAAG GAGCATTTGG AATTATTAAT CATGGCCGAC CACATGCTCG CTACGAAGAT GTTGATGGCT ACTCCGTTGA CCGTGATTTA CTCGATTTGC TCCTCGCTGC CCGTATACCC GAAGCGCCAG AAGGAATACA ATAA
|
Protein sequence | MPHYGFNLQW MYHYVPGKLP AEPDHKALTW IAKQGFNFIR IPTNYWFWTR DWHYTNPDET IIGYIDRYIS ACREYGLHVS LNLHRAPGYC INGNDLERHN LWLDAEAQAG FVWLWEYFAQ RYKGIPASQL SFDLLNEPPN VDQYGLTRLN HAAIMQRTVA AIRAIDPQRT IVIDGLGGGH LAMPELADLG VTHSGRGYQP MAVSHYQASW WDGHEGLAAP TYPVTWHNHY WDRAGLVEFY QPWRDVQARN VAIHIGEFGC YNRTPNDVAL RWFRDLLSVY QEFGWGFGLW EFEGAFGIIN HGRPHARYED VDGYSVDRDL LDLLLAARIP EAPEGIQ
|
| |