Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2646 |
Symbol | |
ID | 5734526 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3394887 |
End bp | 3397325 |
Gene Length | 2439 bp |
Protein Length | 812 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641279788 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001545412 |
Protein GI | 159899165 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00270036 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGTTC ATTTCCAACG GCGGAAACGA TCACGCACGC TGATCGCAGC CTGTAGCACG GTTGCATTGA CGGCAAGTGT GGCTTTTGGG AACATCTCCT CAGTGTTTGG GGTTATCCAA CACGAATCCC GCAATACGAC CCCGGCTAAT CTCCCCGCCG CTGCGGTAGC AGCTTCCTCA CTGATTCGGG TCAACCAACA TGGCTATTTG CCCAACGCGA TCAAACGCGC TAGCTTGGTC AATAGCAGTA ACTCGCCTGT GGCATGGCAA TTGCGCAATA GCTCTGGCAC AACCGTACTT TCAGGCAATA CCACGGTTTT TGGCTACGAT GCCAGTTCTG GCGATACGAT TCATATCATC GATTTTTCGG GCTACACTGG CACAGGCAGC AGCTTTACGT TAGCAGCGGC TGGCAATCTG AGCCATCAAT TTGATATTAG CTCAACAGTG TATCGCCAAT TAAAATACGA TGCCCTCGCC TACTTCTACC ATAATCGCAG CGGAATTGCG ATCACCATGC CCTATGCTGG ACGCAATGAT CTGACTCGAC CTGCTGGCCA CGTTGGCATC GCTCCCAACC GTGGCGATAC CAATGTACCC TGTGCGCCTG GCACTGGCTG TAGCTATTCG CTGAATGTAG TTGGCGGCTG GTACGATGCT GGCGATCATG GCAAATATGT GGTCAATGGT GGGATTTCGG TCTGGACATT GCTTAACCAA TATGAGCGCA ACCAATATTT AGGCTCGTCG GCAGCCGATT TTGGCGATGG CCGCATGAGT ATTCCTGAAA ATAGCAATGG TGTAGCCGAT TTGCTCGACG AGGCGCGTTG GGAATTGGAT TTTATGTTGC GCATGCAAGT GCCTGCTGGC CAACCACGGG CTGGCATGGT GCACCACAAA ATGCACGATG CCAACTGGAC TGGCATTCCA CTGCGCCCCG ATCAAGATTC ACAGGATCGG GTATTACGCC CACCAAGCAC CGCCGCAACC TTGAACTTGG CGGCAACTGC GGCCCAAGGC TCACGGATCT GGCGTTTCAT TGATCCAACG TATTCAGCAC GTTTGCTGAG CGCTGCCGAA ACTGCTTGGG CAGCGGCGCG GGCTAACCCC AACGTAATTG CTTTGGATAG CGATGGAACT GGCGGTGGTT CGTATGGCGA TCCGCAAGTT GGCGATGAAT TTTATTGGGC CGCCAGTGAA CTCTACATCA CAACTGGCAA AGCCGAATAC AAGAGCTATT TGCAAAGCTC AAGCTACTAC CAAGATGTAC CCAGCGATTA CAGCAGCCGC GAACCAGCCA TGACGTGGGG CACAACCGAG GCACTTGGCA CAATTTCGTT GGCGGTTGTA CCCAGCGGTT TCAATAGCAG CGAACTGGCG GGCGTGCGCA ATGCAGTGAT CAGCGCAGCC ACCGTGTTTG CCAATAACAC CACTGGACAA GGCTATGGCA CACCATTTAA TTCAACCAGC ACTGGTTATC CATGGGGTTC CAACTCGTTT GTCTTGAATA ATGGCATTAT TTTGGGCTTG GCCCACGATT TCACTGGCAA CGTGAGCTAT CTCAATGCGA TGAGCCAAGG TATGGATTAT TTGCTGGGCC GCAATGCGAT GGACAAATCG TATGTGACGG GCTATGGCGA AAACCCCTTG CAAAACCCGC ATCACCGTTT CTGGGCCTTT CAAGCGAATA GTGGCTTTCC TCGCCCACCA GCTGGGGCAG TTTCAGGCGG CCCTAACTCG GCCTTGCAAG ACCCTTATGC TCAAAGCATT GGCTTGCCTG GCTGCAACGC CCAAAAATGT TTCGTCGATC ATATCGAATC GTGGTCAACC AACGAAATTA CCATCAACTG GAATGCACCA TTAGCTTGGG TCGCCGCCTA TTTAGATGAA AAGGCTGGTT CGGTAATACC ACCGACGGCT ACGCCAATTC CGCCAACTGC GACTCCAATT CCGGCGACCG CCACGCCACG CCCAGCCACA GCTACGCCAA TTCCGGCGAC GGCAACCCCG ATTGGCCCAA CCGCGACGGC TACGCCAATT GTGCCAACCG CCACGCCACG TCCAGCGACG GCAACTCCTG TGCCAACCAA CACGCCAGTT GTGCCAACCC AAACGAACTC AAGTTGCCAA GTGACCTATA CAATTTCTAA CCAATGGCCA ACCGGCTTTA CCGCCGATGT TGTGATCAAA AACAACGGTG CGGCGATCAA TGGCTGGAAC CTTGCTTGGA CGTATGCTGG TAATCAATTT ATTAGCAATT TGTGGAATGG CAATGTGAGC CAAGTTGGTC AAGCGGTCAA TGTGACCAAC GTCAATTGGA ATGGCTACTT AGGCACTGGG GCAACTGCTA GTTTTGGCAT GCAAGCGAGC TTTAGTGGCA CAAATGCCAA ACCAACTGCT TTCAGCTTGA ATGGTATCGC CTGTAGCGTT ACGCCGTAA
|
Protein sequence | MTVHFQRRKR SRTLIAACST VALTASVAFG NISSVFGVIQ HESRNTTPAN LPAAAVAASS LIRVNQHGYL PNAIKRASLV NSSNSPVAWQ LRNSSGTTVL SGNTTVFGYD ASSGDTIHII DFSGYTGTGS SFTLAAAGNL SHQFDISSTV YRQLKYDALA YFYHNRSGIA ITMPYAGRND LTRPAGHVGI APNRGDTNVP CAPGTGCSYS LNVVGGWYDA GDHGKYVVNG GISVWTLLNQ YERNQYLGSS AADFGDGRMS IPENSNGVAD LLDEARWELD FMLRMQVPAG QPRAGMVHHK MHDANWTGIP LRPDQDSQDR VLRPPSTAAT LNLAATAAQG SRIWRFIDPT YSARLLSAAE TAWAAARANP NVIALDSDGT GGGSYGDPQV GDEFYWAASE LYITTGKAEY KSYLQSSSYY QDVPSDYSSR EPAMTWGTTE ALGTISLAVV PSGFNSSELA GVRNAVISAA TVFANNTTGQ GYGTPFNSTS TGYPWGSNSF VLNNGIILGL AHDFTGNVSY LNAMSQGMDY LLGRNAMDKS YVTGYGENPL QNPHHRFWAF QANSGFPRPP AGAVSGGPNS ALQDPYAQSI GLPGCNAQKC FVDHIESWST NEITINWNAP LAWVAAYLDE KAGSVIPPTA TPIPPTATPI PATATPRPAT ATPIPATATP IGPTATATPI VPTATPRPAT ATPVPTNTPV VPTQTNSSCQ VTYTISNQWP TGFTADVVIK NNGAAINGWN LAWTYAGNQF ISNLWNGNVS QVGQAVNVTN VNWNGYLGTG ATASFGMQAS FSGTNAKPTA FSLNGIACSV TP
|
| |