Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1666 |
Symbol | |
ID | 5733550 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1933604 |
End bp | 1934971 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641278805 |
Product | cellulose-binding family II protein |
Protein accession | YP_001544437 |
Protein GI | 159898190 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000449243 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGGTGA ATCCAAAACG GGGATGGCGT TTATTAATGA TTATGCTGAT GCTTGGATTG GTTTGGACAC AGAGCCAAAA ACCAGCCCAA GCGCAAACGA GCAGTGCTTG TTCGATTAGC TATAGCATTG TTGGGCAGTG GCAAGGTGGT TTTCAGGGCG ATATCGCGAT TCGTAACACT GGCACAAGCC CAATCAACAA TTGGACATTA ACTTGGAGTT TCAATAATCA AGCTATTTCA CAATTATGGG GTGGCAATTA TAGCCAAACA AACAATGCCG TCAGTGTTAG CAATATGGCT TGGAATGGCA CAATTGCCAG CAACTCCAGC GTAAGTTTGG GCTTTATTGC TAGCTGGACA GGCAGCAATC CAATTCCTAG CCATTTTCAG GTGAATGGTA TCGCCTGCAA TGGCGGCATT ACACCAACCG CGACCGGCGT TGTGCCAACT GCCACCCGCA CACCAATGCC GAGCACTGCA ACCGCAATTC CCACAGCAAC CCGCACGCCT GCGCCTTCAG CCACAACAGT TGTGCCAACC GCAACCCGCA CGCCTGCGCC AACCGCAACT CGCACACCTG CACCGACTGC AACTCGCACG CCAACCGTTA CGCCAACGGG CTGGAATCCG CCAAGCAATT TGGTAACACC ACTGAATGAA GTTTGGCAGC ATGTCGAATC GACCTATGGC AATTTGTATG GTTTTCGCAA TTACGGCTGG GATCAAGTGA TTGCAGGCAA TGGCTCGATT AATTATTGTG TACGCTGGGA TTCGGATGCT CCAGTTTCCG CCGCCTTGCG CGACCAAATT CATGCCTCGC TGGCGCGACA ATTCAAAAAA TGGATGGATG CGATGCTTGA TAATGGTCAA GGCACCAACG GCTGGCCTTA CACCAATGTC AATATCAAAG TGGTTGGCTG GGCCGTGCGC GATCGCAGCA CCCTGCAATG GAGCGATAAC TCAGTTGATA TTTATGTCAA TAATATCGCT GAAAATGCAC CGCAATGTGC TCCGCCATGT GGCCGTTTCT TCAACCAAGA TGGCGATTAT TCCGATTGTC CTGGCGGCTT CGATCACCAC TACGATATGT CGTTGTGGCT GACCAAGGGT ATGGCTGGCG GCGCTGGGGG CGATTGGGGT CAGCGGGTTG GCAGCGAATA TTTCGTCAAC AACCTCAACA GCGAGAATAT TCACATCTTT CTGCATGAAG TTGGCCATAC CTTTGGGCTT GACGATTTTT ACGATTGGAC ACCGACTGGC ATCAACAGCT TTATTATGAA TGCTGGCAGC GCTAGCCAAA TTACTGAGTT TGATAAATGG ATGTTGCGCG ACTTCTGGCG ACATCTCAAG AGCCGCTACG GCTACTAA
|
Protein sequence | MMVNPKRGWR LLMIMLMLGL VWTQSQKPAQ AQTSSACSIS YSIVGQWQGG FQGDIAIRNT GTSPINNWTL TWSFNNQAIS QLWGGNYSQT NNAVSVSNMA WNGTIASNSS VSLGFIASWT GSNPIPSHFQ VNGIACNGGI TPTATGVVPT ATRTPMPSTA TAIPTATRTP APSATTVVPT ATRTPAPTAT RTPAPTATRT PTVTPTGWNP PSNLVTPLNE VWQHVESTYG NLYGFRNYGW DQVIAGNGSI NYCVRWDSDA PVSAALRDQI HASLARQFKK WMDAMLDNGQ GTNGWPYTNV NIKVVGWAVR DRSTLQWSDN SVDIYVNNIA ENAPQCAPPC GRFFNQDGDY SDCPGGFDHH YDMSLWLTKG MAGGAGGDWG QRVGSEYFVN NLNSENIHIF LHEVGHTFGL DDFYDWTPTG INSFIMNAGS ASQITEFDKW MLRDFWRHLK SRYGY
|
| |