Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0295 |
Symbol | |
ID | 5732190 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 348845 |
End bp | 351409 |
Gene Length | 2565 bp |
Protein Length | 854 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641277419 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001543075 |
Protein GI | 159896828 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.271703 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTCGGC ACTATTACGC CCGAGGGGCG ATGCTGTTAG CGCTGCTAAC GATGATTGGT GGCCTGTTGA CCACCCAGAA TGCCAAGCCA ACCGCCGCCG CTGCCAGTTG TGTGGTCACC TATCGCATTC CCAACGATTG GGGCAGTGGC TTCCTCGGCG ATGTTAATAT TCAAAATAAT GGTGCAGCCA TCAGTAGCTG GACGGTTGGC TGGAGTTTCG CTGGCAATCA GCAAATTACC AACCTCTGGA GTGGGATTGT AACCCAAACT GGCAACCAAG TAAGCGTGCG TAACGCAGGC TGGAACGGCA CGATCAGCAG TGGTGGTGCA GTCAACTTTG GCTTCCAAGG AACCTACAGC GGTGCTAATG CAATCCCAAC GGTGTTTACA TTAAATGGTG TGGTTTGTGG TGAAACGAAT CCGAACCCAA CCGCAACCAC TCCACCAACC GCAACCACCC GCCCAACCAA CACGGTAGTT GTACCAACCA ATACACCACG GGCAACTAAC ACAACCGTAC CACCAACTAA TACGGCTGTG CCACCAACTA GCACAACTCG CCCAACTAAT ACGGCTGTGC CGCCAACCGC GACCAACGGC CCAACCAGCA CGCCACGGCC CACCAATACC CCAACGGTTG TGCCACCAAC CAGCACCCCA ACCCAACCAG GCGATGATAC CTACGATCAA CGCTTCTTGG AAATGTATGC TGAGTTGAAG AACCCAGCTA ATGGCTATTT CAGCCCTGAA GGTGTGCCCT ACCACTCAAT CGAAACCTTG ATTGTCGAAG CTCCGGATTA TGGCCACGAA ACCACTTCCG AAGCCTATAG CTATTGGTTG TGGCTCGAAG CGATGTACGG TGAGGCAACT GGCAATTGGC AACCATTGGC CGATGCTTGG CGCAACATGG AAATGTACAT CATTCCAACC AGCCAAGATC AACCAAGCAG TGGTTCGTAT AATGCCAATA GCCCAGCCAC CTATGCTGGC GAATGGGAAT TACCCAGCCA ATATCCATCA CAATTGCAAA CCAACGTTTC AGTTGGCCAA GACCCAATCG CCGCTGAATT GCGCTCGGCT TATGGCACCA GCGATGTCTA TGGCATGCAC TGGTTGCTCG ACGTTGATAA CTGGTATGGC TATGGCCGCC GTGGCGATGG CACCAGCAAA CCATCATATA TCAACACCTT CCAACGTGGC GCTCAAGAAT CAGTTTGGGA AACCGTGCCC CATCCATCGT GGGAAAGTTT CAACGATGGC GGCCCCTTCG GCTTCTTGAA CTTGTTCACT GGCGATGCGA GCTATGCCCG CCAATGGCGC TACACCAACG CCCCCGATGC TGATGCCCGT GCTGTCCAAG CGATTTACTG GGCCAAAGTA TGGGCCGACG AACAAGGTGG CTCACCAATC GTCAATGGTT TGGTAACCAA GGCCGCCAAG ATGGGCGACT ACTTGCGCTA CGCCTTCTTC GATAAGTACT TCAAGCAAAT TGGCTGTACT TCAACTTCAT GTCCAGCTGG TTCAGGCTAC AGCAGCGCTC ACTACCTATT GTCGTGGTAC TACGCTTGGG GTGGTTCGAT TGGCAATGGT GGTGGCTGGG CATGGCGCAT CGGCAGCAGC CACAATCACT TTGGCTACCA AAACCCAATG GCAGCCTGGA TTTTGGGCAG CCAACCAGCC TTCAAGCCAG CTTCAACCAA CGGCGCTCGC GACTGGAACA CCAGCTTGAC CCGCCAAATC GAGTTCTATA CTTGGTTGCA ATCAAGCGAA GGTGCCATCG CTGGTGGTGC TACCAATAGC TGGAATGGCC GCTACGAAGC AGCCCCAGCC GGAACCAGCA CCTTCTACAA TATGGCCTAC GACGAAAAAC CAGTCTATCA CGATCCCGCT AGCAACACCT GGTTTGGTTT CCAAGCTTGG TCGATGGAAC GGGTCGCTGA ATATTACTAC GCTTCAGGCG ATGTCAAAGC CAAGAACGTG CTCGATAAGT GGGTAACCTG GGCTTTGGCC AACACCACCT TGACCAGCAA TGGCAGCTAT GAAATTCCAT CAACCTTGGC TTGGAGCGGC CAACCAGCTA CCTGGAACGC CAGCAATCCA GCCGCTAACA CCAACTTGCA CGTCACGGTC GTCGATAAGA CCCAAGACGT TGGTGTTGCC GCCGCCTATG CCAAAACCTT GATGTACTAC AGCGCTGCAA CCAAGCGCTA TGGCACTCAA CATGTCGCTT CACAAACCAT GGCCAAAGAA TTGATCGACC GTATGTGGTC AGAATACCGC GATGATAAGG GTGTTGCGAA CCCCGAAACC CGCCGCGATT ACAACCGCTT CGATGATCCA GTGTCAGTGC CAAACGGCTG GACTGGCACC ATGGCCAATG GTGATCCAAT CAACAATTCA TCAACCTTCT TGAGCATTCG CACCAAGTAC GAAGATGATC CAGCCTTCCC AGCCGTGCAA GCCTACCTAA ACGGTGGCCC TGCTCCAACC TTCACCTACC ACCGCTTCTG GGCCCAGGCC GATATCGCAA TGGCTTACGC CGAATACGAT CGCTTGTTCC AGTAA
|
Protein sequence | MSRHYYARGA MLLALLTMIG GLLTTQNAKP TAAAASCVVT YRIPNDWGSG FLGDVNIQNN GAAISSWTVG WSFAGNQQIT NLWSGIVTQT GNQVSVRNAG WNGTISSGGA VNFGFQGTYS GANAIPTVFT LNGVVCGETN PNPTATTPPT ATTRPTNTVV VPTNTPRATN TTVPPTNTAV PPTSTTRPTN TAVPPTATNG PTSTPRPTNT PTVVPPTSTP TQPGDDTYDQ RFLEMYAELK NPANGYFSPE GVPYHSIETL IVEAPDYGHE TTSEAYSYWL WLEAMYGEAT GNWQPLADAW RNMEMYIIPT SQDQPSSGSY NANSPATYAG EWELPSQYPS QLQTNVSVGQ DPIAAELRSA YGTSDVYGMH WLLDVDNWYG YGRRGDGTSK PSYINTFQRG AQESVWETVP HPSWESFNDG GPFGFLNLFT GDASYARQWR YTNAPDADAR AVQAIYWAKV WADEQGGSPI VNGLVTKAAK MGDYLRYAFF DKYFKQIGCT STSCPAGSGY SSAHYLLSWY YAWGGSIGNG GGWAWRIGSS HNHFGYQNPM AAWILGSQPA FKPASTNGAR DWNTSLTRQI EFYTWLQSSE GAIAGGATNS WNGRYEAAPA GTSTFYNMAY DEKPVYHDPA SNTWFGFQAW SMERVAEYYY ASGDVKAKNV LDKWVTWALA NTTLTSNGSY EIPSTLAWSG QPATWNASNP AANTNLHVTV VDKTQDVGVA AAYAKTLMYY SAATKRYGTQ HVASQTMAKE LIDRMWSEYR DDKGVANPET RRDYNRFDDP VSVPNGWTGT MANGDPINNS STFLSIRTKY EDDPAFPAVQ AYLNGGPAPT FTYHRFWAQA DIAMAYAEYD RLFQ
|
| |