Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1177 |
Symbol | |
ID | 5733070 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1349957 |
End bp | 1351756 |
Gene Length | 1800 bp |
Protein Length | 599 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641278317 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001543953 |
Protein GI | 159897706 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3525] N-acetyl-beta-hexosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAATTAT TACCAAAACC AAAAAACCTT CAGCGTGCTG AGGGAAGCTA TGCTTTACCC AAAATCTGGC AGATTAGCCT ACCTAGTGGG CTTGATCAGC GGGTTACGGC CAGTTTGGCG GCACTGGGCG AATATCAAAT GGTTGAGCAA GCTGCCAATT TGCTGCTAAC GCTTGATTCA AGCATCGTTC ATGCCGATGG CTATCGTTTG CAGATTGCTG CTGATGGCGT GCAGATTGTG GCGCAAACTG CGGCAGGCTT GTTTTATGGC CTGCAAACGC TGCGCCAAAT TCAACAACAA TCAGCTGAGC AATTGCCGTA TCTATTAATT GATGATGCCC CTGATTTTGT GGTGCGTGGC TTTATGCTCG ATATTAGCCG CGATAAAGTG CCAACCATGG CCACGCTCTA TGCGCTAATC GACGAATTAG CTAGCTGGAA AATTAACCAA ATTCAGCTGT ATATTGAGCA TACGTTTGCT TACCAAAACC ACCCCACAGT ATGGGCTGAT GCTTCGCCGT TGACTGCCGA GGAAGTACGG GCGCTCGATG ATTTCTGTCT TGAACGCTTT ATCGAATTAG TGCCCAACCA AAATTGTTTT GGCCATATGC GACGTTGGCT GACCAAACCA GCCTATCGCG ATTTGGCCGA ATGCCCCGAT GGCTGCGACA CTGGCGACCC CGATTGGGGC TATTTTGAAG AACCCTTCAC CTTAGCTCCC GAACACCCAG GCAGCATCGA ATTAGTGCGT AGCATGCTCG ACGAACTATT GCCCAACTTC CGCACTCGCA CACTCAACGT TGGCTGCGAC GAAACGGTCG AATTGGGCTT GGGGGCAAGT GCAGCAGCAG TTGCTGAGCG CGGCAAAGGT CGGGTTTATC TCGAATTTTT GCAAAAGCTC TACCACGAAG CCCATTCACG CGGCTATGTG ATGCAATTCT GGAGCGATAT TATTTTGCAT TATCCAGAAT TGGTCAGCGA ATTGCCCCGC GATGCCGAGG TGTTGATTTG GGGCTACGAG TCGCATCATC CCTTTGAAGA ACAAGCCGCC ACCATTGCCA AAGCGGGCTT GCCCTTCTAC GTTTGCCCAG GCACGTCAAG CTGGAACACC GTCGCAGGCC GCACCACCAA CGCCTTGGAA AATATCGCGA GCGCCGCCAA ACATGGCTTG AACCATGGTG CAAAAGGCTT TTTGATGACC GATTGGGGCG ATAACGGCCA CTGGCAACCA CTCGTGACCA GCTATCTTGG CTTGGCAATC GGCGCTGCTC GGGCTTGGAA TGCGCTCGCT GAGCTTGATG TGGTTGCCTT GCTCGATACG GTGGTTTTTG CCGATAGCAA CAAGGTTTTG GGCAATTTGG TCTATCAATT GGGCGATGTC TATCGCGCTG TACCACCGTT ATTGCACAAT ACCTCGTCGT TATTCCGCAT TTTGCAGGCC AACCCAGCCG CAATTGCCGA ATTAAACCTT GATCGCAAAA ATTTGCATAA TGCCGATGAG ATTTTGCTGC AACTGCGCAA CGACTTGCAA ACCATCAAAC CCCAACGCGC TGATGGCGAG CTTTGCCAAG TTGAATTAGC TTGGGCGATC GATTTATTGC GCCATGCGGT GCAACGCGGT TTGTGGGTGC TTGATCAACA GCCAAGCGAA ACCGCTAGCA AATTGCAACC AGAAATCGAT GCCTTGATCG AACGCTTCCA ACAAGTTTGG CTGAGCCGCA ATCGCCCCGG TGGCTTGCAA GATAGCCTTA AACATTTTGC AAGCTTGCGC GATAGCTATG GCGAGGTGCG TTATGCCTGA
|
Protein sequence | MQLLPKPKNL QRAEGSYALP KIWQISLPSG LDQRVTASLA ALGEYQMVEQ AANLLLTLDS SIVHADGYRL QIAADGVQIV AQTAAGLFYG LQTLRQIQQQ SAEQLPYLLI DDAPDFVVRG FMLDISRDKV PTMATLYALI DELASWKINQ IQLYIEHTFA YQNHPTVWAD ASPLTAEEVR ALDDFCLERF IELVPNQNCF GHMRRWLTKP AYRDLAECPD GCDTGDPDWG YFEEPFTLAP EHPGSIELVR SMLDELLPNF RTRTLNVGCD ETVELGLGAS AAAVAERGKG RVYLEFLQKL YHEAHSRGYV MQFWSDIILH YPELVSELPR DAEVLIWGYE SHHPFEEQAA TIAKAGLPFY VCPGTSSWNT VAGRTTNALE NIASAAKHGL NHGAKGFLMT DWGDNGHWQP LVTSYLGLAI GAARAWNALA ELDVVALLDT VVFADSNKVL GNLVYQLGDV YRAVPPLLHN TSSLFRILQA NPAAIAELNL DRKNLHNADE ILLQLRNDLQ TIKPQRADGE LCQVELAWAI DLLRHAVQRG LWVLDQQPSE TASKLQPEID ALIERFQQVW LSRNRPGGLQ DSLKHFASLR DSYGEVRYA
|
| |