Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1626 |
Symbol | |
ID | 5733498 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1884988 |
End bp | 1886508 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641278765 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001544397 |
Protein GI | 159898150 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4124] Beta-mannanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000010667 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCAAG TTCAACGAGT TCCATTGCTG CTGCTTGGCA TGCTGGCCTT GCTGATCAGT ACAATCCCTT CGACCTCAAC GAGCGGCGCA ACCAATGCCT ACGCCGCCGC CGACCCAGCA ATTGCTAGCT ACGCCCAAAA CGTGATTAAT CGCATGGCCC AATATCAAGG CCACTCTATC ATCAGCGGCC AGCAAGAAGT CCATTGGGAT TCCTCGCGCA AAGACGAAAT GTCCAACGCC ATCTACAATC GCACAAATCC ACGCCAATAT CCAGGTTTGC GGGGCTGGGA TTTCCCCTTC GGCGGAGCTT ACGCCAACGA TGCCCAATGG ATGATCGACA CCATGATTAG CGATTGGAAT ACTGCCAAAG TGCTGCCAAC AATCAGCCAA CATTGGACAC CCTACGGCAG CCAAGGCACA AATCACGATG ATATGTTTGT GCAAGTCAAT ATCAATAATA TATTTGTTGA TGGCACAACT GAGCGTAGCC GCTACCTTAC CTGGCGCAGC AACATCGCTG ACGATTTGCA AAAACTGGAA AATGCCAGTG TACCAGTCTT ATGGCGGCCC TATCACGAGG CGGGCGGTGG CTGGTTCTGG TGGGATAAAA TTGGCTCAAG TAATTACAAG CGTTTGTGGA ATGACCTCTG GGATTATCTG ACCAACAGCC GTGGCTTGCA CAATTTGATC TGGGTTTGGT CGGCAGGAAC CAAGGGAGTC GGTACTGATT GGTATCCGAG CGGCAAGGTC GATATTCTTG GCCACGATAT TTATAACAAT AGCTCGGCTG ACTATAGCAG TTGGTACACT GATTTAGCTC GTTTCGATAG TAGCAAATTA CGAGCATTAA CCGAAGTCGA TTATATGCTT GATCCAGCAG CATTGAACAA TGCACCATTT GCCTTCTTTA TGACGTGGCA TACCGACATG TTTTATCGCA ATAGCGATAG CAAAATCCAA AGCGTTTATC AACATAACAA AACCGTCAAT CGCAGCCGCA TCAGCCAATA TCTGAATGGG AGATTGGGTA GTAGTGGAAC CAACCCAACC GCTACGCCTA TCAGTAGTGG CGCAACGCTC TATAATTTTG AGGGCAGCAC CCAAGGCTGG AGCGCCGCCA ATGTCAATGC AGGGCCATGG TCGGTCAACG AATGGGCCGC CAATGGTAGC TACAGTCTCA AAGCCGATGT GAGTTTGGGC AATCGTAGTT ACGACTTGAA ATTAACTCAA GCCCACAATT TTAGTGGCAA AAGCCAACTC CAAGCCCGCG TGCGCCATGC CACTTGGGGC AATGTTGGCA GCGGAATTAG CGCCAAACTC TATATCAAGG TTGGCTCAAA TTGGGCTTGG TACGACGGCG GAGCAATCAC AATTAATAGT GGGGGCACAA CTACATTAAC CCTCAACTTG AGTGGCATTG CCAATCTTGG TATCGTCAAC GAAATTGGGG TTAGTTTTAG TTCGCCCGCT AATAGCAGCG GCACTAGCGC AATTTATCTC GATTATGTCA CCTTGCAATA A
|
Protein sequence | MSQVQRVPLL LLGMLALLIS TIPSTSTSGA TNAYAAADPA IASYAQNVIN RMAQYQGHSI ISGQQEVHWD SSRKDEMSNA IYNRTNPRQY PGLRGWDFPF GGAYANDAQW MIDTMISDWN TAKVLPTISQ HWTPYGSQGT NHDDMFVQVN INNIFVDGTT ERSRYLTWRS NIADDLQKLE NASVPVLWRP YHEAGGGWFW WDKIGSSNYK RLWNDLWDYL TNSRGLHNLI WVWSAGTKGV GTDWYPSGKV DILGHDIYNN SSADYSSWYT DLARFDSSKL RALTEVDYML DPAALNNAPF AFFMTWHTDM FYRNSDSKIQ SVYQHNKTVN RSRISQYLNG RLGSSGTNPT ATPISSGATL YNFEGSTQGW SAANVNAGPW SVNEWAANGS YSLKADVSLG NRSYDLKLTQ AHNFSGKSQL QARVRHATWG NVGSGISAKL YIKVGSNWAW YDGGAITINS GGTTTLTLNL SGIANLGIVN EIGVSFSSPA NSSGTSAIYL DYVTLQ
|
| |