Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0324 |
Symbol | |
ID | 5732234 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 385895 |
End bp | 387979 |
Gene Length | 2085 bp |
Protein Length | 694 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641277448 |
Product | cellulose-binding family II protein |
Protein accession | YP_001543104 |
Protein GI | 159896857 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4124] Beta-mannanase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGGTCGA ACCTGAGCCG TTCGCGGTCG TTGTTGACCA TTGCGCTCCT TGGGGTCTTG CTTGGTTCAC TGAGTTTGTT ACCAACGCAC ACCACCAAGG CCGCGATTAC CGCCTATCAA GCCGCCGATC CAAATCTCGC CCCGTATGCC CAAAATGTGC TCGATCGTTT TGTCCAATAC AAAGGTCAGT ATTGGCTAGG TGGTCAGCAA GAAGTTCACT GGGATAATTC GCGCAAAGAT GAAATGTCGA ACGCAGTTTT TGCCCGCACC AATCCACAAC GCTACCCCGC GCTGCGTGGC TGGGATTTCC CGATCGGCGG CGCACTGCCC AACGATGGCC AATGGATGAT CGACGCGATC ATCAGCGATT GGACCAATGC CAACGTTATT CCAACCATCA GCCAACACTG GACACCGCTT GCCAGCCAAG GCACTAACCA TCAAGATATG TTCACGGTGG TTGATATTGA TCGGATGTTT GTTGATGGCA CGACTGAACG CACTAATTAT CTGATTTGGC TCGATAATAT TGCTGATGAT TTGCAACAGC TCGAAGATGC CAACGTGCCA GTGCTATGGC GACCCTACCA CGAAGCTGGT GGTGGTTGGT TCTGGTGGGA TAAAGATAAT GGAGCCACCA ATTATCGCCG CCTTTGGGAT GATATGTTTA CCTATTTGGT AACCACCCGT GGCCTGCACA ACCTGATCTG GGTTTGGACA CCTGGGGTAA AAGGAGTCAG CACGGCTTGG TATCCCGCTG GCCAAGCCGA TATTTTGGGC AGCGATGTAT ATAACGAAAC TTCGGGCAAT TACGTGAGTT GGTATGAAGA TCTTGGGCGT TTCTCGCAAA CCAAGATTAA AGCCCTCAGC GAAACCGACT ACATGATGGA CCCTGCTCTG TTGAGCAGTG CGCCATTTGC CTACTTTATG ATTTGGCACA CCGATATGTT TTATCGCAAT ACCGATAGCC GCATTCAAAG CACCTATGCG CATTGGGCAA CCCTGAATCG AACCAATGTT GGCCAGATTT GGAATGGTAC TCTTGGGATC GCTCCCACAG CAACGCCTGG TACACCAACG CCTACGCCAA TTCCTGGGAT CGTGGTCAGC GATTTTGAAG ATGGCACGCT GCAGGGCTGG ACTGGCACAA ACCTGGTTTC AGGGCCAACT GTCAACAACG AATGGGCCGC CAATGGTCAA CGTTCGATCA AAGCTCAAGT TAATTTAGCC GCTACGCCTG CCGATATTCG GCTCGCACAA GCACTGGATT TAACTGGTCA ATCACGCATC CAAATTCGCC TGAGTGCCCA AAATGTTGGG AGCGGTCTCA GTGCTAAGCT CTACATCAAG ACGGGAAGCG CCTGGAATTG GAAAGATAGC GGAACTGTGC TGATCGATTC GGGCATCAGC TTGCTGACAA TTGAATTAGC GGGTGTGCCT GATATCAACC AAGTGCGCGA GTTGGGGGTT GAATTCAACG CACTCACGGG GAATAGTGGC ACAGCAACGA TCTACGCCGA CTATCTGACC GTGGGTGTGG TCAATAGCAA CCAACCAACC CCAACCACGG GGCCAACGGC GACTGCAACG CGCACGCCAA CCCAAGCGCC AACGGCTACG CCAACCAATA TTCCAACCGC AACGCGCACG CCAACCCAAG GCCCAACCGC GACTGCAACC ACCATTCCAA CCGTCACGCC AACGAATATT CCAACCGCAA CGCGCACGCC AACCCAAGTG CCAACTGCTA CGCCAACCAG CACTGGCGGC GCTTGTAAGG TTGATTTCAA GGTGACAAGC CAATGGGGCG TAGGCTTTAT CGCCGATGTT ACCGTAACCA ATTTGCAGCC AAGCGCCTTA AATGACTGGA ATGTGAAATT CAACTTCCCC AGTGGCCAAA CCATCAGCAA CCTATGGAAT GGCACACTCA GCCAGACTGG TAGCGCAGTT ACCGTCACGA ATGCTGGCTG GAATGGCTAT CTTGCAGGTA ATGGTGGCAC AGCCAACTTT GGTTTCCAAG GTGTTGGTAG TGTTCCAATC TTGCCAAGCA ACAGCTTCCA ACTCAATGGC GTAACCTGCC AATAA
|
Protein sequence | MWSNLSRSRS LLTIALLGVL LGSLSLLPTH TTKAAITAYQ AADPNLAPYA QNVLDRFVQY KGQYWLGGQQ EVHWDNSRKD EMSNAVFART NPQRYPALRG WDFPIGGALP NDGQWMIDAI ISDWTNANVI PTISQHWTPL ASQGTNHQDM FTVVDIDRMF VDGTTERTNY LIWLDNIADD LQQLEDANVP VLWRPYHEAG GGWFWWDKDN GATNYRRLWD DMFTYLVTTR GLHNLIWVWT PGVKGVSTAW YPAGQADILG SDVYNETSGN YVSWYEDLGR FSQTKIKALS ETDYMMDPAL LSSAPFAYFM IWHTDMFYRN TDSRIQSTYA HWATLNRTNV GQIWNGTLGI APTATPGTPT PTPIPGIVVS DFEDGTLQGW TGTNLVSGPT VNNEWAANGQ RSIKAQVNLA ATPADIRLAQ ALDLTGQSRI QIRLSAQNVG SGLSAKLYIK TGSAWNWKDS GTVLIDSGIS LLTIELAGVP DINQVRELGV EFNALTGNSG TATIYADYLT VGVVNSNQPT PTTGPTATAT RTPTQAPTAT PTNIPTATRT PTQGPTATAT TIPTVTPTNI PTATRTPTQV PTATPTSTGG ACKVDFKVTS QWGVGFIADV TVTNLQPSAL NDWNVKFNFP SGQTISNLWN GTLSQTGSAV TVTNAGWNGY LAGNGGTANF GFQGVGSVPI LPSNSFQLNG VTCQ
|
| |