Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3131 |
Symbol | |
ID | 5735003 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3956856 |
End bp | 3958646 |
Gene Length | 1791 bp |
Protein Length | 596 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641280274 |
Product | polysaccharide deacetylase |
Protein accession | YP_001545896 |
Protein GI | 159899649 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.927219 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCGGT TTCGCCAATC TTCGATCATT CTGCTTCTGA CCATGGCTTT GTTGGGCCTT GGCAGCCTGT TGATCGCTCA GTGGTTACGC CCCGATCGTT CGCCACGCAT CTGGACAGGC GGCAATATTT TGGCTAATGC CGATTGGCAG AGCGCCGCCG ATAACGGGAT TCCCGATGGT TGGTCGGGCA ATGGCATCAA ACGCGCCGAT ACCACCAATG GCTATGTGCT CGATGATCAA TATAGCTTGC AGTTGTATGG GGTCAATAGT TTTGCCCGTT CGCCACGCTT AACTGCCCAA GCAGGCCAGC GCTATCGACT GGGCTTTCAA GCCCTGATCG ATCCCGGCAC GCAGCGAAGC AGTTTGGGCG CACAAATTCA GGTTTGGGTG CATTGGGTCG ATACTGCTGG CGATGATATT CGGCTCGATA AGCAAGCTCC AGTTTTGCTG GGCTTCGATA GCCAAGGCGC AGCAACCTGG ACTCCAGTGT TGGTTGAAAC TGAACCCAGC CCCAATCAAG CCGCTTGGCT AGCAATTTCA ATCCACGCGC TTTCTGATGA CCCAATTTAT CTTGATAATT TGAGTATGGC AGCGGCTGGC ATTTACATCG AACCCTATCC GCAGGGCGCG ATTGCTGCTG TGAGTTTTAG TGTCGATTGG GAAACAGCCA TGGGCGGGGC GATTCACTCG CTCAGTTTGT CAACCGATGC GATCAGCAGC GCCACGACCT TGGGTTTACA AGCTCGCCAA GGCACATACA ATTTGCTGGA CTTGTTTGCG CCGCATCACA TTCGCGGCAC ATGGTTTGGC AATGGCTATA ATTTTTTATT CGGCAACCAA GAGCGCCGCA CCTGGATGGA CGACCCAACC TTTGCTTGGG CTGCCAGTAC GCCGCGCCGT TGGCAAACCA TCGATTGGTC ACAAACGCCG TGGTTCAGCC ACGATCCCTA TGGCACGGTC GAGAGCGATC CAGCCTGGTA TTTTGGCGAT TTACTTGAAC CGCTACGAAA CGCTCAGCAA ACGATCGAAA GTCATACATT TAGCCATATG TACATTGGCT TTGCCCATGC CAAAGAGTTA GCTAGCGATA GCAATGAGTG GCAGGCAGTG ACAAGCAGCC AAGGCATTGC GCCCGCCAGC AGTTTAGCTT TCCCGTATGG CGGCAGCGAT GGCATGACCG AAGCCCATTG GCAAACCTTG CAAGCCGCAG GTATTCGCAC CGTCGTGCGC ACCCGCGTGC TTGATGCCAC CAACGTGCAA CGTGGTGTGA ATGATCGCCA TTTGTTGATC GATCGGCGTT GGTGGCAACC GCGCCAATTG CCAGGCCACG ATCTGGTTGC CTTGCCCGAT GTTTACCTGA CCCCGCAAAC CGCTATGACT GCAACCAATT ACTTGCAGCA AGCCATCGCT AGCGGCGGTG TGATCGATAT CTATGCCCAT AATTACGAAA TTTACAACCC TGAGCAAATT GGCGTTTGGC AAACGGCAAT TCAACAGGCA GTTGAAGCCC AAGCGTGGAT CGCCACAGTG CCGGAAATTG CCGACCGCTG GCGAGCACGC CAAACGCTCC AACTCACCAT CGAACAAACC CCTGATCAAC TTCGGCTTCA GCTCGCCAAT CCCAGTCGTT TCGATTTAGC CCAGCTTAGC TTGCGGCTTC CGGCTGAGAG CACTGGCAGC GATAAAGGCA ATTTTGATCA GGCTAACCAT CGGCTGATTC TTGATTTACC AGCAAACTCC GCCGAGGAGA TTACGATATG GCTCAAACCA TCAACGCCGC ACGTACCATA A
|
Protein sequence | MQRFRQSSII LLLTMALLGL GSLLIAQWLR PDRSPRIWTG GNILANADWQ SAADNGIPDG WSGNGIKRAD TTNGYVLDDQ YSLQLYGVNS FARSPRLTAQ AGQRYRLGFQ ALIDPGTQRS SLGAQIQVWV HWVDTAGDDI RLDKQAPVLL GFDSQGAATW TPVLVETEPS PNQAAWLAIS IHALSDDPIY LDNLSMAAAG IYIEPYPQGA IAAVSFSVDW ETAMGGAIHS LSLSTDAISS ATTLGLQARQ GTYNLLDLFA PHHIRGTWFG NGYNFLFGNQ ERRTWMDDPT FAWAASTPRR WQTIDWSQTP WFSHDPYGTV ESDPAWYFGD LLEPLRNAQQ TIESHTFSHM YIGFAHAKEL ASDSNEWQAV TSSQGIAPAS SLAFPYGGSD GMTEAHWQTL QAAGIRTVVR TRVLDATNVQ RGVNDRHLLI DRRWWQPRQL PGHDLVALPD VYLTPQTAMT ATNYLQQAIA SGGVIDIYAH NYEIYNPEQI GVWQTAIQQA VEAQAWIATV PEIADRWRAR QTLQLTIEQT PDQLRLQLAN PSRFDLAQLS LRLPAESTGS DKGNFDQANH RLILDLPANS AEEITIWLKP STPHVP
|
| |