Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1175 |
Symbol | |
ID | 5733068 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1348367 |
End bp | 1349506 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641278315 |
Product | N-acetylglucosamine-6-phosphate deacetylase |
Protein accession | YP_001543951 |
Protein GI | 159897704 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1820] N-acetylglucosamine-6-phosphate deacetylase |
TIGRFAM ID | [TIGR00221] N-acetylglucosamine-6-phosphate deacetylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAGCC TACGTTTTGA AAATGCCACG ATCTATACGC CAGAGCTAAG TGCTGCGCGT AGTTTGGCGA TTGCCAATGG TCATGTGGCT GAGGCTGCTC CTACAGCGCC CTGCTACGAT CTGACTGATC TGATTGTTGT GCCAGGTTTG ATCGATTTGC AACTGAATGG GGCATTTGGC CACGATTTCA CCAGCGATCC TCATACGATT GGTGCGGTTG CTGCTGGCTT ACCGCAATAT GGCGTGACAG CATTTTTGCC AACGATCATT ACTTCACCGC TGAGTCAAGT TGCGGCGGCT CAACAGGTGG TGCAACAGGG CAATTTTACT GGTTCGCGGG TATTGGGGTT GCATCTCGAA GGGCCGTTTC TCAACCCAGC CAAGCGTGGT GCGCATAATC CCAGCCATTT GCAAAACCCC AGTTTAGCGG CGATTGAAAC TTGGTCGCCA GCCAATGGCG TGCGTTTGGT AACTTTGGCT CCTGAACTTG ATGCTGCCGA CGAACTCATT CGAGCATTGG TTGAGCGGGG AGTGGTGGTC AGTGCTGGCC ATTCCGAGGC CACATTTGAA GAAGCTGAGG CTGGTTTCAA TCAAGGCATT CGCGCGGTAA CCCATTTGTT CAACGCCATG CCAGCCTTGC ACCATCGCGA ACCAGGTTTA GCCGGAGCCG CCTTGAGCGA TCAACGGATC ACCATGGGCT TGATTCCCGA TGGCGTACAC GTTCATGCAG GCTTGGTGCG CCATATTTGG CACAGTGCCA GTCAACGGAT TGCAATTGTC AGCGATGCCC AAGCTGCGCT CGGCATGCCC GACGGCGAAT ATCTGCTTGG CGATACAACC CTGACCGTAG CCAACGGTGA GGCTCGGCGC AGCGATGGCC GTTTGGCGGG CAGCGTGTTG GCGATGGATC AAGCTTTACG CAACATTCAT GCGTGGACGA ACAGCCCGCT TGAGCAGATT CTGCCAGCCT TTACCACGAT TCCGGCCAAT TTGTTGGGCT TAGCTCACTA TGGCCGGATT GCAATCAATA ACCCTGCTGA TTTAGTCATT TTTGATCAGC AGCACTATCA GGTTGTCGCC ACACTTGTGG GCGGCAACAT CGTCTATGGT TCTACAACCT TGGAGCAACG TCAATGCTAA
|
Protein sequence | MSSLRFENAT IYTPELSAAR SLAIANGHVA EAAPTAPCYD LTDLIVVPGL IDLQLNGAFG HDFTSDPHTI GAVAAGLPQY GVTAFLPTII TSPLSQVAAA QQVVQQGNFT GSRVLGLHLE GPFLNPAKRG AHNPSHLQNP SLAAIETWSP ANGVRLVTLA PELDAADELI RALVERGVVV SAGHSEATFE EAEAGFNQGI RAVTHLFNAM PALHHREPGL AGAALSDQRI TMGLIPDGVH VHAGLVRHIW HSASQRIAIV SDAQAALGMP DGEYLLGDTT LTVANGEARR SDGRLAGSVL AMDQALRNIH AWTNSPLEQI LPAFTTIPAN LLGLAHYGRI AINNPADLVI FDQQHYQVVA TLVGGNIVYG STTLEQRQC
|
| |