Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2431 |
Symbol | |
ID | 5734312 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3114634 |
End bp | 3116265 |
Gene Length | 1632 bp |
Protein Length | 543 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641279572 |
Product | licheninase |
Protein accession | YP_001545199 |
Protein GI | 159898952 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3405] Endoglucanase Y |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000786066 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGAAC AACCACGCTC TTTTTTTGGC TCACGCTTAA TTCAACGAGC TATTTTGATT TTGGCGATCT GCGCGGTGAT TGTGCCGTTA TTCGCCAGTA AATCATCATA TGCTGCCGCC ACGCCACGTC GCCCGTTTCC CCAACATACC CAATATGCCA GCGGCACGAT CAAGCCCAAT CATCGCAGCC AAGCCCAACT TGATAGCGAT GTTAAGGCAT TTTATGATGT TTGGAAAAGC CGCTATGTGG TTCGCGCTGG CACGAGCAGT GCTGGCAACC CCTACTATCG GATTAGTTTT GGCAGCAGTG CGCCCAACGT AACCGTTTCC GAAGGCCAAG GTTATGGCAT GGTGATTATG GCCTTAATGG CGGGCTATGA TCCCGAAGCT CAAACAATTT TTGATGGTTT ATGGGAGTTT TCGCGCACCA ATCCCAGCAA TATCGATTCG CGCCTGATGG GTTGGCGCAT TCCTAGCGAT GGCTCGGGCA ATGATAGTGC TTTCGATGGC GATGCTGATA TCGCTTATGG CCTGATTTTG GCCGATGCTC AATGGGGTAG CACTGGTCGA ATCAATTATG CCAGCGCGGC AAACACGGTT TTGGATGGGG TTTTATCATC GACCATTGGG CCAAATAGCC GCTTGCCCAT GTTGGGCGAT TGGGTTTCGC CGAATGGTAG CCCGCATAGC CAATATACGC CACGGCCCTC AGATTTTATG CCCAGCCACT TCCGCGATTA CCGAGCCTTT ACTGGCAATG CCACTTGGGA TACGGTGCTG AGCAAAACCC AAGGTGTGGT TGATAGCATT CAAGCCCAAT ATAGCCCCAA TACTGGCTTG ATGCCCGATT TTGTGGTGCA AGCCAACACA ACGCCTAAGC CATCGCCTGC CAACTTCTTG GAAAGCGAAA ACGATGGCAA TTATTACTAT AACTCGGGTC GTGTGCCATG GCGCTTGGGA GCCGATGCCG TGATTTTTGG CGACGCTGCT TCATTACGTC AAGCTCAAAA AATCTCGCGT TGGATCGAGC AAGCCACTGG TGGGACAGCA AGCAATATTC GGGCTGGCTA TAGCTTGAAT GGTACGGCCT TGCCCGATAG TGGCTATTTC AGCACCTTCT TTGCAGCGCC ATTTGGGGTT GCAGCCATGA CCGTGCCAGC CAGCCAGCAA TGGCTCAATC GAGTTTACGA TGCGGTGCGC AGTAATCACC AAGATTATTT CGAAGATACC GTAACGCTGC AATGTTTGCT ATTGATGTCG GGCAATTATT GGTCGCCAAG CCGCAGCAGC ACCAGCCCAA CCGCAACCCC ACGCCCTGCA ACTGCAACCC CACGCCCTGC GACGGCAACG CCACGACCCG CCACCGCAAC GCCACGTCCA GCAACTGCGA CCCCGCGCCC AGCCACCGCC ACCCCACGCC CAGCCACCGC CACCCCACGC CCAGCCACCG CCACCCCACA ACCTGCCACG GCAACCCCAA ACGGCGTTGC AGCGTGGGAT GGCAATATGC GAGCCTACAA AGTGGGCGAT CGGGTCAGCT ACAACGGGCG CATCTATCGC TGTTTACAAG CACATACCTC GTTATCAACT TGGACTCCTG AGGCTGTTCC GGCCTTATGG CAAGCTGAAT AA
|
Protein sequence | MAEQPRSFFG SRLIQRAILI LAICAVIVPL FASKSSYAAA TPRRPFPQHT QYASGTIKPN HRSQAQLDSD VKAFYDVWKS RYVVRAGTSS AGNPYYRISF GSSAPNVTVS EGQGYGMVIM ALMAGYDPEA QTIFDGLWEF SRTNPSNIDS RLMGWRIPSD GSGNDSAFDG DADIAYGLIL ADAQWGSTGR INYASAANTV LDGVLSSTIG PNSRLPMLGD WVSPNGSPHS QYTPRPSDFM PSHFRDYRAF TGNATWDTVL SKTQGVVDSI QAQYSPNTGL MPDFVVQANT TPKPSPANFL ESENDGNYYY NSGRVPWRLG ADAVIFGDAA SLRQAQKISR WIEQATGGTA SNIRAGYSLN GTALPDSGYF STFFAAPFGV AAMTVPASQQ WLNRVYDAVR SNHQDYFEDT VTLQCLLLMS GNYWSPSRSS TSPTATPRPA TATPRPATAT PRPATATPRP ATATPRPATA TPRPATATPR PATATPQPAT ATPNGVAAWD GNMRAYKVGD RVSYNGRIYR CLQAHTSLST WTPEAVPALW QAE
|
| |