Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2520 |
Symbol | |
ID | 5734398 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3218561 |
End bp | 3221338 |
Gene Length | 2778 bp |
Protein Length | 925 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641279660 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001545286 |
Protein GI | 159899039 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTAGCA ACCCACCCCG CGAACGAATT CTGTTTGATC AAGGCTGGCG TTTTGCCCTC GGCCATGCCT TCGACCCCGC CCAAGATTTT CAGCTGGATA CAAGTCATTT TTCGTTTCTG GCCAAGGCTG GCTATGGCGA CGGCGCAGCC TCAGCCAAGT TCGACGATCG GGCTTGGCGT TTGCTCGATC TACCCCATGA TTGGTGTGTT GAATTGCCCT TCGATCAACG CGGAACTCAC AGCCATGGCT ACAAAGCAAT TGGCCGTAAG TTTCCCGCCA ATAGCGTTGG TTGGTATCGC AAAACCTTTG CAATTCCCGC TGACGATCTT GGGAGACGCA TCAGCCTCGA ATTCGATGGG GTGCATCGCG ATTCTAAAGT TTGGGTCAAT GGCTTTTATC TAGGCAACGA GCCAAGCGGC TACACCAGTT TTAGCTACGA TATCAGCGAT TATCTGAATT ATGGCGGCCA AAACGTGGTG GTAGTACGCG CCGATGCCAC CACCGAAGAA GGCTGGTTTT ATGAAGGCGC TGGCATTTAT CGCCACGTTT GGTTGAGCAA AACGGCTCAA GTCCATGTCG CCCGCTATGG CACATTCGTC ACTAGCGAAA TTCATGAAGC TAGCGCCACG CTCACAATTC AAACAACGGT TATTAACGAA AGCCAACATC CAATCGAATT TAGCTTGCTG GAAACAATCA GCGATCCGGC TGCCAATAAT GTGCTGCAAG CCCAAAGTGC CAATTATCAG CTGGCGGCAG GCCAAAGTGC TGAATTCAGC CACCAGCACA CGCTCAACAA TCCGCAACTC TGGGATCTTG CTAGCCCACA TTTGTATCAA CTAACCACCA CTGTGCTGCT TGGTGAGCAG GCAGTAGATA GCTATCAAAC AACCTTTGGC ATTCGCAGCA TTCGCTTTGA TCCTGATCAG GGCTTTTTCT TGAATGGGCG TTCGGTCAAA TTGGTTGGCA GCAACCAGCA TCAAGATCAC GCGGGAGTTG GCACGGCTAT TCCCGATAGC TTGCAAGAAT ATCGCATCAA ACGCTTGCAG GAATTCAATC ATAATGCGAT TCGGACTTCG CACAACCCGC CAACCCCCGA ATTCCTCGCT GCATGCGATC GTTTGGGAAT GTTGGTGCTG AATGAAAATC GCTTGATGGG AACAAATGCT GAGCATTTAC GCCATGTTGA GCAGCTGATC AAGCGTGATC GGAATCATCC TTCGGTGATT TTGTGGTCGT TGGGCAACGA AGAATGGGCG ATCGAGGGTA CGATCATCGG CGCACGCATC TCCAGCACCA TGCAAAATTT TGCCCGTCAA TTCGATCATA CACGCCTTTT CACAATCGCT TGTAGCGGCG GCTGGGATAC TGGTATTGGA ATGGTGACCG ATGTGGTTGG CTATAATTAT ATCTTTCACG GCGATATTGA TGCCCATCAT GCGAAATTCC CATGGCAATC GGGCGTTGGA ACAGAGGAAA GTAATACCTA CGGTACACGC GGGATCTACC AAACCGATGC CAGTCGTGGG CATTTAGCGC CTTCTCCCAG TGGTGATGGC TACGCCGATA CGGAGTTCGG CTGGCAATTT TATGTCAAAC GACCATTCTT GGCTGGGTTA TTCTATTGGA CTGGCTTTGA TTATCGGGGC GAGCCAACGC CCTTTGAATG GCCAGCGGTC GCCAGTCAAT TTGGCTTTTT CGATTTGTGC GGCTTCCCCA AAGATATTGC CTATTACACC AAGGCTTGGT GGGGCCAGCA ACCAGTCTTG CACATTGGCT ATCACTGGGA TTGGCCGAAT GATCTTGATC GAGTGAAAAG CTTTCCAATT TATAGCAATT GCCAAGAAGT TGAGCTATGG CTGAATGGTC AGAGCCTTGG CCGCCGCCCA ATTCCCGAAA ACGGCCACCA ACAATGGGAA GTTGCCTATC AACCAGGCGA ATTGTTAGCG CGTGGCTATA ACAACGGGGT TGAAGTGCTC AGCGCTAGTT TGCGCACCAG CCAAGCTGCC AGCCAAATTC AACTGGTGCT CGATTATGGT CAATTGCAAC AAACAGGTGA TCTAGCCGTT GTCACCTTGC AAGTTTGCGA TGATCATGGC CAGATTGTGC CAACCGCCAA TCAACTACTC GATCTGCAAT TAACCGGTTC GGCCAAAATT TTAGGGGTCG GCAACGGTGA TCCAGCTAGC CACGAGCCTG AGCAATATCA CGCTAGCTAT CAACTTTGCC CAATCCAAAT TAATGCCGAG CAAACCGTGG CCGAGTTGAG CACAGGTTTG GAGCGAGCGC TAGGAGCCGA AACCCAAGTT TGGCAACCAG CATTCTTGCA TCACAACGAC GATCAACAGG CTAATCCTCA GGCCTATATT CTGTTGCGTG GCAGCTTCGA GCTACCACAG TACAGCGCTG CCAGCAGCAT TCGCTTGTTC AGCAAAAGCA TAGCCCACGA TTACAGCGTC TATATCAACG GCCAATTGAT CATCAGCGAT ATTGCGCTTG AGCAAGGCGA TCAAGCGCTT GAATTGAGCC ATGAACTGGT GCACCCAGGC CAAAACGAAT ATTTGGTGAC TGGGCGCTGG ATCAAAAAAG TCAATCCTTG GACGTTCCCC AATCGTGAGC CTGGCGTGGT GCAGGTTATC GAGCCAGCCG CCGCTTGGCA ACGCCGCGCC TTCAACGGCC TAGCCCAAGT CATCGTGCAA CATACTGGCG GCGACCAGCC AATTAGCCTT AGTGCCAGCA ACCCTGAGCT AGGCACGGCA ACGCTTAAGT TCGATTAA
|
Protein sequence | MISNPPRERI LFDQGWRFAL GHAFDPAQDF QLDTSHFSFL AKAGYGDGAA SAKFDDRAWR LLDLPHDWCV ELPFDQRGTH SHGYKAIGRK FPANSVGWYR KTFAIPADDL GRRISLEFDG VHRDSKVWVN GFYLGNEPSG YTSFSYDISD YLNYGGQNVV VVRADATTEE GWFYEGAGIY RHVWLSKTAQ VHVARYGTFV TSEIHEASAT LTIQTTVINE SQHPIEFSLL ETISDPAANN VLQAQSANYQ LAAGQSAEFS HQHTLNNPQL WDLASPHLYQ LTTTVLLGEQ AVDSYQTTFG IRSIRFDPDQ GFFLNGRSVK LVGSNQHQDH AGVGTAIPDS LQEYRIKRLQ EFNHNAIRTS HNPPTPEFLA ACDRLGMLVL NENRLMGTNA EHLRHVEQLI KRDRNHPSVI LWSLGNEEWA IEGTIIGARI SSTMQNFARQ FDHTRLFTIA CSGGWDTGIG MVTDVVGYNY IFHGDIDAHH AKFPWQSGVG TEESNTYGTR GIYQTDASRG HLAPSPSGDG YADTEFGWQF YVKRPFLAGL FYWTGFDYRG EPTPFEWPAV ASQFGFFDLC GFPKDIAYYT KAWWGQQPVL HIGYHWDWPN DLDRVKSFPI YSNCQEVELW LNGQSLGRRP IPENGHQQWE VAYQPGELLA RGYNNGVEVL SASLRTSQAA SQIQLVLDYG QLQQTGDLAV VTLQVCDDHG QIVPTANQLL DLQLTGSAKI LGVGNGDPAS HEPEQYHASY QLCPIQINAE QTVAELSTGL ERALGAETQV WQPAFLHHND DQQANPQAYI LLRGSFELPQ YSAASSIRLF SKSIAHDYSV YINGQLIISD IALEQGDQAL ELSHELVHPG QNEYLVTGRW IKKVNPWTFP NREPGVVQVI EPAAAWQRRA FNGLAQVIVQ HTGGDQPISL SASNPELGTA TLKFD
|
| |