Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0065 |
Symbol | |
ID | 5731938 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 84367 |
End bp | 85875 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641277187 |
Product | O-antigen polymerase |
Protein accession | YP_001542845 |
Protein GI | 159896598 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3307] Lipid A core - O-antigen ligase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.321315 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGCTTG CACGTTGGCG TTGGCATCCC CTCACAACGC TTGGGCTGAG TGGGCTGGCG GCCTTGGCAA TTGTGGCGCT CTATCGGCCC AAACGCTTTG TGCCATTGAT TGGCATTGAT GAGGCGGCTG CGCATAATTT GCTGTTTGCG CTAGCGTTGT TGGTGTTATT TGCCCTACTG GCTTGGCTGC GACCCGAAAT TGGCTTGGCT GGAGTGGCCG CGACGATTCC GTTTAACTAT CGCTCACGCG GCTTTTGGGA TGGCAACTAC CCATTTATTG ATGGCAAATA TTTCCCGTTG CACGAGTTGT TGCTGTTGGT CGTCTTGGGT GTTACGGCGC TCGATGTTGG TTGGCGCTTG ATCCAACGCC AAATCGCATG GCAAATGTGG TGGCGTGAAC ATTGGCGCAT GCTGCTGTTG CCCTTGGCTT GGCTGCTTGT GGCTACATTT GCGGCAAACT TGGCCGTGCC CGAAGGTCAA GCTGAGGCAT GGCGCGAATG GCGTTGGATG ATTATCGAGC CATTGTTGCT GTATGGCTTG ATTTTGTATT GGCAACCACG CTACCCAGTG CGGCGCTGGT TACTTTGGGG TTTGCTAGCT GGCAGCGTAA TTGTGGCGAT CATTGGCATT TTGCAGTGGC GCGACCTCGA TTGGACTCCG ATTGATGGTA CGGGCATGTG CTTTAGCGAT TTGATTGTTG ATTCTGGCGG CACAAAACGC ACATCATCGG TCTACTGCCA CCCCAATAAT TTGGCCTTGT GGATGGATCG CGCCAGTATG TTGGCGGTTG TCGCGGCAGC ATGGGCCTGC TGGCAATGGT GGCGCAAACG CACCTGGCAA ACGGCTATAT GGTCGTTGCT CTATCTTGGG GCCAGCAGCT TGTTGCTATT AAGTTTGATG CTAACCTATT CCAAGGGGGC ACGGTTTGCA GTGGCCTTGG TATTGATTGG CCTAAGCTGT TTGCCGCGCC GCTGGTGGCT ACCGCTGATC ACGAGCGCTG TGCTTGGTGG TTTGTTGCTT TATTCGTCGT TGAGTGGTCC TGAGCGCTTG AACGTGACGG GCGATTCCAG TTCGGCGCGT TTGAGTATTT GGCGCTCAGC CACGGCGATG ATTATTGATC ATCCAATTGT AGGCATTGGG CTTGATCAAT TTTATTTCTA TTTCAATCCA CAATTCAATC GTGGCTATAT CGAGCCACGC CTTGCCGCCG ACCCTGCCGA GCGTAACACC GCCCATCCGC ACAATTTGCT ACTCGATTTG TGGTTGCGGG TTGGGATTGC GGGAGTGCTT ATTTTTGCGG CGTTGGCGTG GCGCAGCCTG CGGCGCACTT GGCAAATTTG GCATAGCGAG CAGCCTGAGC GCTGGTTGGC GTTAGCGGCT TTGGCGGCCT TGATGGCTGG CTGGTTGCAT GGCGGCGTTG ATCAAGGCTA TTTCTCCAGC GATTTAGCAA TGGTGACATG GTTAACCCTA GGGATGATCG ATAGCTTCAC CCTAAAAGGT CAAAATTGA
|
Protein sequence | MMLARWRWHP LTTLGLSGLA ALAIVALYRP KRFVPLIGID EAAAHNLLFA LALLVLFALL AWLRPEIGLA GVAATIPFNY RSRGFWDGNY PFIDGKYFPL HELLLLVVLG VTALDVGWRL IQRQIAWQMW WREHWRMLLL PLAWLLVATF AANLAVPEGQ AEAWREWRWM IIEPLLLYGL ILYWQPRYPV RRWLLWGLLA GSVIVAIIGI LQWRDLDWTP IDGTGMCFSD LIVDSGGTKR TSSVYCHPNN LALWMDRASM LAVVAAAWAC WQWWRKRTWQ TAIWSLLYLG ASSLLLLSLM LTYSKGARFA VALVLIGLSC LPRRWWLPLI TSAVLGGLLL YSSLSGPERL NVTGDSSSAR LSIWRSATAM IIDHPIVGIG LDQFYFYFNP QFNRGYIEPR LAADPAERNT AHPHNLLLDL WLRVGIAGVL IFAALAWRSL RRTWQIWHSE QPERWLALAA LAALMAGWLH GGVDQGYFSS DLAMVTWLTL GMIDSFTLKG QN
|
| |