Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0296 |
Symbol | |
ID | 5732191 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 351585 |
End bp | 354125 |
Gene Length | 2541 bp |
Protein Length | 846 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641277420 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001543076 |
Protein GI | 159896829 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAGTA CACCGCTCGA ACGTTCGAGC CGACGCTGGC GGATTATCGC CGGATTCGTT GCTTGTTTGC TGATTGCAGG GTTAATTATT AGCCCAACCA CCCCCACCAA AGCCGCCGAG CCACTCTATC GCTATGGCGA GGCGCTGCAA AAGTCCTTTT TCTTCTACGA AGCTCAACAA GCTGGGCCAA AACCAAGCTG GAATCGCGTT TCATGGCGCG GCGATTCGGT GCTTACCGAT GGCGCTGATG TTGGTCTCAA TCTCAGCGGC GGCTGGTTCG ACGCAGGCGA TCACGTCAAA TTTGGCTTTC CAATGGCGGC TTCGGCCACA ATGCTGGCGT GGGGCGCGGT CGAATATCGC GATGCCTATG CCCAAAGCGG CCAACTCGAC GAATTGCTGA ACAATTTGCG CTTCGTCAAC AACTACTTCA TCAATGCCCA TCCCTCGCCA AATGTGCTTT ATGGACAAGT TGGCAATGGC GGCAAAGACC ACGCCTTCTG GGGACCAGCT GAAATTATTC ACCTCGACGA CCAAGCAGGC CCACGACCAT CGTACAAAAT TGATGCAACT TGTGGTGGCT CAGATTTGGC AGGCGAAACC GCCGCTGCCA TGGCTGCCTC GTCGATGGTC TTTCGCCCAA CCGACCCTGC TTATGCTGAT ACGCTCCTAA GCCATGCTCG CCAACTCTAC ACGTTTGCCG ACACGGTGCG CGGCAAATAT AGCGACTGTA TCACCGACGC TACCTCGTTC TACAACTCGT GGAGCGGTTA CAACGATGAG TTGGTTTGGG GCGCAATTTG GCTCTATCGC GCTACGGGCG AAGCCAGCTA CCTGAGCAAG GCCGAGCAAT ATTATGCCAA TCTCAGCACC GAACCCCAAA GCACAATCAA ATCGTATCGT TGGAGCATCG CATGGGATGA TAAATCCTAT GGCTGTTATT TGTTGCTAGC CAAATTGACC GGCAAACAAC AATACAAAGA CGATACCGAA CGCTGGTTGG ATTATTGGAC AGTCGGCTAT AACGGCCAAC GTGTTACCTA TTCGCCAGGT GGCCTAGCAC AGTTGGATAC CTGGGGAGCC TTGCGCTACT CGGCCAACAC CTCATTTGCC GCCTTTGTCT ACAGCGATTA CATCACCGAT GCTACCAAAA AAGCTCGCTA CCACGACTTT GCGGTCAGCC AAATCAACTA TATGCTGGGC AGCAATCCTC GCAACAGCAG CTATGTGGTT GGCTTCGGCA ATAATTCACC AGTCAATGTC CACCATCGCA CCGCCCACGG CTCATGGACA GATTCATTGA GCAATCCAGT CAATCAACGC CACATTTTAT ATGGGGCTTT GGTTGGCGGC CCAGCCAAAG GTTCGGGCGA TGCTTACACC GATAGCCGCA ACGATTATGT GGCCAACGAA GTGGCGACCG ACTACAACGC AGGTTTTACT AGCGCCTTGG CACGGATGTA TAGTGAATTT GGCGGCGCAC CACTCGCCAG CTTCCCACCA ATCGAAACGC CTGAAGATGA ATTTTTCGTG GAAGCCAAAG TTAATGCTTC AGGCCCACGC TTCATCGAAA TTAGCGGCGT ATTGCACAAC CAAAGTGCTT GGCCGGCCCG CAACAGCACC AAACTCAGCT ATCGCTACTT TGTCGATTTG AGCGAAGTGT TTGCCGCTGG CTATGGCTTG AGCGACGTTA CGGTTAGCAC AGCCTATACC CAAGGCTCAG GCGTTTCTAG CTTGAAGCAA TGGGCTGGCA CAATTTACTA TGTCGAAATT GGCTTCAACG GAGTCAATGT CTACCCAGGT GGTCAATCTG AATCACGCAA AGAAGTGCAA TTCCGACTTT CGTTGCCAAC CAACACCAAT GCCCAACAAT GGGACAATAC CAATGACTGG TCGTTCAACG GCGTTGGCAC CAGCACCGAT CGGGTCAAAA CCCGCCGGAT TCCGGTGTAT GACAATGGCG TGAAGGTCTT TGGCGATGAG CCTGGTGGCA GCAACGTAAC CCCAACCGCA ACCAGCTTGC CAACCAACAC GGCTACGCCA ACCGTGCGCC CAACCAACAC CGCAACCCCA ACCACGGGGC CAAGCGCAAC CCCAACTATT CGCCCAACCA ACACGGCAAC CCCAACTGTT GGCCCAAGCG CAACCCCAAC CATCCGCCCG ACCAATACAC CCACGGCCTT GCCAACAAAC ACACCGTTGC CAACGAACAC GCCAGTGGCT GGGGCATGCC AAGTCAAATA TCGCGTTCCC AACGATTGGG GCAGCGGCTT CCTCGGCGAT GTCACAATCA CCAACGGCGG CGCAGCGATC AATAGTTGGA ACTTGACTTG GAGCTTCGCA GGCAGCCAAC AAATCACCAA CCTCTGGAGT GGGGTGGTGA GCCAAACCGG CCAAAACGTG AGCGTCAGCA ACGCTGGCTG GAATGGGAGC CTTGCCAATG GTGGCTCCGT CAACTTCGGC TTCCAAGCAA CCAACAACGG AACCAATAGC ATTCCTGCAA GCTTCAGCCT GAATGGGGCA GCTTGTACGA TTGTGCCATA A
|
Protein sequence | MMSTPLERSS RRWRIIAGFV ACLLIAGLII SPTTPTKAAE PLYRYGEALQ KSFFFYEAQQ AGPKPSWNRV SWRGDSVLTD GADVGLNLSG GWFDAGDHVK FGFPMAASAT MLAWGAVEYR DAYAQSGQLD ELLNNLRFVN NYFINAHPSP NVLYGQVGNG GKDHAFWGPA EIIHLDDQAG PRPSYKIDAT CGGSDLAGET AAAMAASSMV FRPTDPAYAD TLLSHARQLY TFADTVRGKY SDCITDATSF YNSWSGYNDE LVWGAIWLYR ATGEASYLSK AEQYYANLST EPQSTIKSYR WSIAWDDKSY GCYLLLAKLT GKQQYKDDTE RWLDYWTVGY NGQRVTYSPG GLAQLDTWGA LRYSANTSFA AFVYSDYITD ATKKARYHDF AVSQINYMLG SNPRNSSYVV GFGNNSPVNV HHRTAHGSWT DSLSNPVNQR HILYGALVGG PAKGSGDAYT DSRNDYVANE VATDYNAGFT SALARMYSEF GGAPLASFPP IETPEDEFFV EAKVNASGPR FIEISGVLHN QSAWPARNST KLSYRYFVDL SEVFAAGYGL SDVTVSTAYT QGSGVSSLKQ WAGTIYYVEI GFNGVNVYPG GQSESRKEVQ FRLSLPTNTN AQQWDNTNDW SFNGVGTSTD RVKTRRIPVY DNGVKVFGDE PGGSNVTPTA TSLPTNTATP TVRPTNTATP TTGPSATPTI RPTNTATPTV GPSATPTIRP TNTPTALPTN TPLPTNTPVA GACQVKYRVP NDWGSGFLGD VTITNGGAAI NSWNLTWSFA GSQQITNLWS GVVSQTGQNV SVSNAGWNGS LANGGSVNFG FQATNNGTNS IPASFSLNGA ACTIVP
|
| |