Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2177 |
Symbol | |
ID | 5734064 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2758018 |
End bp | 2759754 |
Gene Length | 1737 bp |
Protein Length | 578 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641279318 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001544945 |
Protein GI | 159898698 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3325] Chitinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0141597 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCGTC AACGCGCCTT GAGCATGAAG GTTTATTTGC TGTTGGCTCT GATCATGCTG GCTGGGAGTT TTGGTGCTGG ATTTGATCAA CCCAAATCAG CCCAAGCCCA AATCGCCTAT AAAATTGTGG GCTATTTGCC ATCGTGGCAA GGCAGTGTCA ACGGCACTCA AATTGATAAA CTGACCCACA TTAACTATGC TTTTCTGTTG CCCAACAACG ATGGCAGCCT CAAGCCAATC GAAAACTCCA GCAAATTGCA AGAGTTGGTA TCGGTCGCTC ATAGCAAAAA CAAAAAAGTG CTAATTTCAG TTGGTGGCTG GAACGACGGC GATGATAGTG CCTTCGAAAG TATCGCCGCC AACGCCAGCT ATCGCACGAA TTTTGCCAAT AACCTCAATA ATTTCGTCAA CCAATATAAT CTCGATGGGG TTGATATTGA TTGGGAATAT CCTGAGGCTG GCGATTTTTA CTTTGAAACG ATGTCGGCGA TCCGCAATCG CATCGGCTCG GGCAAATTAT TAACTGCGGC AGTCGCTGCA ACCAATGCTG GCGGCTCAGG CGTAACTAGC AATGCCATCG AAATTATGGA TTACATCACG CTCATGGCCT ACGATGGTGA TGGCGGCGCT GGTCATTCGC CCTATAGTTT AGCTCAGCAA TCGCTCGATT ATTGGGGCAC CAAAACTAGC AATAAGGCTA AATTAATCTT GGGCGTGCCG TTTTATGCTC GCCCAGGCTG GTATGGCTAC AACACCTTAC GCGCTGGTGG TTGCTCAGCC GATAGCGATA GCTGCTGGTA TGGCGGCGCA ACCCAATACT ACACGGGCCG CCCAACCATG CGCGCCAAAA TCGATTTGAT GAAAAGCAAG GGCGGCGGTG GGATTATGAT TTGGGAATTG AGCCAAGATA CGGCTGTTAC CAGCAGCGAT TCCTTGCTCA AAACCATTGC CGATCAGCTT GGCACACCCA GCACGCCAAC TGGCAATTTA GCCCTGAACA AAGCCGCTAC TGCTTCATCA ACCGAAAATA GCAGCTATGG CGCAAATCTC GCGTTCGATG GCAACAGCTC CACCCGTTGG TCGAGCACAT GGAGCGATCC ACAATGGCTA CAGGTTGATT TAGGCGCGGT GTATGCGATC AAACAAGTGG TTTTAAAATG GGAAGCCGCC TTTGGCCGCG CCTATCAAGT GCAAGTTTCC AACGATGGCA ATACCTGGCG CTCGATCTAT AGTACAACCA ATAGCGATGG CGCAACCGAT GATTTAGCAG TTTCCGGCAT TGGGCGCTAT CTGCGCGTCT ACGCCACAAC TCGCGCCACC GAGTGGGGCT ACTCGCTCTG GGAAGTCGAG GTTTATGGCA ATGCGTTGGC AACCAGCGCT TCATCGACCG AAGCAGGCGG CAGCACGACC AACGCGATTG ATACTAACGG CACTACTCGT TGGAGCGCAG GCTTGCCCCA AGCAGCAGGC CAGTGGTATC GGGTCGATTT TGGCAATAAC CAAAGTTTCA GCCAAATCAC GCTTGATGCA GGCCCATCCT ACGGCGATTA TCCACGCAAC TTCCAAGTGC AAGTTTCCAA CGATGGCAAT AGCTGGGCAA CCGTAGCTAC TGCCACTGGT ACAACCCAAG CAGTAACCGT CAACTTCGCT GCTCAAAATA GCCGTTATCT GCGGGTATGG CTCACCAGCA CTGGCGGCGG CAGTTGGTGG TCGATCCACG AATTAACCGT GCGCTAA
|
Protein sequence | MNRQRALSMK VYLLLALIML AGSFGAGFDQ PKSAQAQIAY KIVGYLPSWQ GSVNGTQIDK LTHINYAFLL PNNDGSLKPI ENSSKLQELV SVAHSKNKKV LISVGGWNDG DDSAFESIAA NASYRTNFAN NLNNFVNQYN LDGVDIDWEY PEAGDFYFET MSAIRNRIGS GKLLTAAVAA TNAGGSGVTS NAIEIMDYIT LMAYDGDGGA GHSPYSLAQQ SLDYWGTKTS NKAKLILGVP FYARPGWYGY NTLRAGGCSA DSDSCWYGGA TQYYTGRPTM RAKIDLMKSK GGGGIMIWEL SQDTAVTSSD SLLKTIADQL GTPSTPTGNL ALNKAATASS TENSSYGANL AFDGNSSTRW SSTWSDPQWL QVDLGAVYAI KQVVLKWEAA FGRAYQVQVS NDGNTWRSIY STTNSDGATD DLAVSGIGRY LRVYATTRAT EWGYSLWEVE VYGNALATSA SSTEAGGSTT NAIDTNGTTR WSAGLPQAAG QWYRVDFGNN QSFSQITLDA GPSYGDYPRN FQVQVSNDGN SWATVATATG TTQAVTVNFA AQNSRYLRVW LTSTGGGSWW SIHELTVR
|
| |