Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0400 |
Symbol | |
ID | 5731968 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 470013 |
End bp | 471179 |
Gene Length | 1167 bp |
Protein Length | 388 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641277523 |
Product | cell wall biosynthesis glycosyltransferase-like protein |
Protein accession | YP_001543179 |
Protein GI | 159896932 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.443716 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTTGGT TGGCTTGGTG GTATCTGCTT GATCGCTGTT GGCGTTGGCT GGTGATTCGA CGGTTTTTTG CCAAGCACCA ACCACCAGCA CCCAAACATT GGCCGAGCAT TGATTTGATT CAACCGTTAA CCCATGGGGT GTTTGATTTA GCCAAAACCT TGCAACAGCG GCTCGATTTG CGCTATGCGG GCCAACTTCA ACATTATTGG GTGATCGATC AAGCAGATAA TCAAACATTG GCAGTGTGTT CTGCTTTACA GAAAAAACAT CCTGAGCAGC ACATCACGAT TATTCAAGTT GCGCCAGATT GGGGCCAACG TGCTTCGAAG TTAGTGAAAT TGCAAGCTGT GCTGGCCCAA GCCAACTCCG AGATCGTATG GTTTGTTGAT GATGATGTTA GCTTACCGCT TGATGGCTTG AGCCAAGGCC TACCCTATTT GTTCCAGCCG CAGGTTGGGG CGATTTTTGG CTTGGCCTGC TATGTGAATT GGCATAATTT CCCGTCGGCT TTGATGAGCA ATTTTGTCAA TGCCAATGCC TTGCCCAGCT ATATCGGTTT GGCGGCTTTA ACCGAGCCAT ACACAATTAC TGGGCATCAA TTTGCTTTGC AACGCAGCGT GTTTGAACAA ATTGGCGGGC TGAGTGGCAT GTACGGGCGG ATTGACGACG ATCATGAGTT GGCGCGGCGG GTGCAAGCCC ATGGTCTGCG CAATTTGCAA ATGCCATTAA TCTATCAGGT AGATAATTAT TTTGTGAATT TACCAGCCTA TTTTCAGCAG ATGCAGCGTT GGTTTACGAT TCCCCGCGTG CTGATGCTGC CGCATCTCAG CCAATACGAC CAGTTTGTGA CCATTTTGAG CAGCCTTGCC CAGCCGATTC CCACGATTTT GGCCTTGGCA AGCATGCGCC AGCCAAAACT ACGGCCTTGG CTCGTGGCAT GTTTATTGGC GCAGTTGAGC TTGCAGGGTT GGCAAATTCG GCGCTATTGC CAAACCAAAG TGCCGTGGTG GGCTTGGCCC TTGAGTATCA TTGGTACGTT GATCGATCCC TTGCTGATGC TTTGGGGTTT GCTGGGCGAT GATACGATTG TTTGGCGCGG TGAGCGGATT CGGCTGCGGC ATTCGGCCTC GGCTCAATGG TTGGGCAAGG AGCAAGATCA TGATTAA
|
Protein sequence | MRWLAWWYLL DRCWRWLVIR RFFAKHQPPA PKHWPSIDLI QPLTHGVFDL AKTLQQRLDL RYAGQLQHYW VIDQADNQTL AVCSALQKKH PEQHITIIQV APDWGQRASK LVKLQAVLAQ ANSEIVWFVD DDVSLPLDGL SQGLPYLFQP QVGAIFGLAC YVNWHNFPSA LMSNFVNANA LPSYIGLAAL TEPYTITGHQ FALQRSVFEQ IGGLSGMYGR IDDDHELARR VQAHGLRNLQ MPLIYQVDNY FVNLPAYFQQ MQRWFTIPRV LMLPHLSQYD QFVTILSSLA QPIPTILALA SMRQPKLRPW LVACLLAQLS LQGWQIRRYC QTKVPWWAWP LSIIGTLIDP LLMLWGLLGD DTIVWRGERI RLRHSASAQW LGKEQDHD
|
| |