Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2378 |
Symbol | |
ID | 5734259 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3029937 |
End bp | 3032039 |
Gene Length | 2103 bp |
Protein Length | 700 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641279519 |
Product | O-antigen polymerase |
Protein accession | YP_001545146 |
Protein GI | 159898899 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3307] Lipid A core - O-antigen ligase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.868657 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGGAAAG ATCGTTGTAT GGCTCAATCA CGCCAGTTTG ATCGGGTATT GGCTTTGCTC GTGCTTGGCC TCAGCTGCTG GACATTATTC TTACCAATGC CAGCCTTATC GAATGCCAAC GCTCCAGCGG CTTTGGTGCA GGCCAGCCTT GCCACGCTTG ATCAACAGGC TGAGCAACGC CAAATTGCCG TCATTGCTGC TAGCGTGCTT GATGGCGAAC GTGGCGATTT TGAGCGCCGT GGCGTGCAAT TGGCCCAACT TAGCCCCGAA CAACAAAGCC AAGTGCGGCG TGAATTGGCG GTTGATATTG AATTATTGGC CCAAACCTCA CGCGCTGCTG AGCATGAACG TCAAGCCTTG CAAGCCTACG ATACGGCTTT GATGGGTCAA ACCCGCGATT TAGGCGCATT GGCTGAGCAA CTACGTTCAG CCACATGGCC AATCGTCGAA CACCTCAAAC TGTATCCACC ACCACTCGGA ACCCGCAGCG ATTGGCTAGC CCCGAACAGC AACTATTTTC AAACCCGCGC AATTACGATT AGCAATGGCA CACTTGAACA AGCGACCCAA GCAGCGATTG AGGTTGGTCG GGCCAGTTAT AGCCAACGCC AACTGCGCGA GCTTGATCAA ACCTATCGAC AGGTGCTTGA CCAATATGCC CAAACCTTGC AACAAACAAT CACCAGCCAA ACTAATGTGC GGCCATGGTG GCGTTGGGTT TTGGCAAGTG GTTTAGCCTT GGGTTTTGCA GGTTTGTTGT GGTGGGTTTT ACGCGCTATA TTAGCCAAAG ATTGGCTGTT GTTGGCGGCG GGCGTGGCGT TGGCGGCATG GTTTGGGCTG CCATGGCCTG TGGCGTCGAT TGGACTGCTA GCATTAGTTG GCTTAATTGG GTTGCGCCCA AGTTTAGCGG CTTGTCTGCC ATTGCTCGCA ATTCCGCTGT ATTATCGCGC TCGCTTGGTT GGCAACTTGA GTTTTCCGCT CAACGAAACC TTGTTTGGCA TCAGTGCTGC TGGCTTGCTA TTGCATGTGA TTTGGCGTTG GTGGCGTGGT CAACGGCCTG ATGTTGCTTG GTTCAAAACT AGTTGGTATT GGCCAATTGG GGCAGTAGCG CTGGGTTTGG TGCTGGCGGC TGGGTTGAGC CTCGGGTCTA GTGGCTTGGT CGAGCCTGCG ACAGCCTTGC GAGAATTGCG GCGCACGATC ATCGAGCCAG CGATTTGGGC TTGTTTGGCC TTGCTATTAC TGGCAACTCA GCGAGTTAAA GCGCAAAGCA TGCTCTGGAG CTATATTGTA ATGGCGGCTT GGATCGCTGG TGATGGCTTG GTGCGCTTTG CCTTGGGCGA AGGCGTTTGG GCCACAACTG GCGTGCCACG CTTGATCGGC CTGTTGCCAA GCTCGACCGC CTTGGGTGTT TACTTGGGCG CTGGTTTGGC TGGTAGTTTA GCCTTGGCCT TGAGCGCCGA ATCGGCCAAA CAGCGTCAAT TGGCGTGGTT ACTCAGCCTA CCACTGAGTT TGGGTGTTTT GTTGACCTTC ACCCGTGGCG CATGGCTGGG CGTGGTTGGC GCTGTTGCTC TGGTTTTGCT GCTACAACGG CGTTGGCGTT TGTTGCTTGG CGCTGCTGGC TTAGCTGGTG TTGGTTTAGC GGGCTTTGGC TTGATTCAGC CCAGCTTGCT GGCCCGCGTG TTGCGGCTTG GCGAGGGCAC TGGCTCAGCT CGCCAAGAAA TTTGGGCTTC AGCCTGGCGA GCGGTGCAGG ATCAACCACT GCTTGGATTT GGGCTTGATC AATTTGCCCA GCTTGAGCCA AGCCGCTATG GAATTCCACA GATTCGTTTT TTGACCTTGG CGCATCCGCA TAATCTGCTG CTTGATGTCT GGTTACAATT AGGCTTGCTC GGTTTGCTGG TAGTTTTGGC CATGCTTGGC TGGCAGGTTT GGCGTTGGTG GGGCAGCAAA CAGGCTTTGG GGTTGGTGGG GCTGGCAATC TTGGCAGATT TAGTGATTCA TGGCATGCTT GACCAAACCA TGCTTGGCGG TGATATGATA TATCTCTGGT GGAGCTTGCT CCTTATCAGT GTTTGGCGGC CATCCGCAAC GGAGGAATCA TGA
|
Protein sequence | MWKDRCMAQS RQFDRVLALL VLGLSCWTLF LPMPALSNAN APAALVQASL ATLDQQAEQR QIAVIAASVL DGERGDFERR GVQLAQLSPE QQSQVRRELA VDIELLAQTS RAAEHERQAL QAYDTALMGQ TRDLGALAEQ LRSATWPIVE HLKLYPPPLG TRSDWLAPNS NYFQTRAITI SNGTLEQATQ AAIEVGRASY SQRQLRELDQ TYRQVLDQYA QTLQQTITSQ TNVRPWWRWV LASGLALGFA GLLWWVLRAI LAKDWLLLAA GVALAAWFGL PWPVASIGLL ALVGLIGLRP SLAACLPLLA IPLYYRARLV GNLSFPLNET LFGISAAGLL LHVIWRWWRG QRPDVAWFKT SWYWPIGAVA LGLVLAAGLS LGSSGLVEPA TALRELRRTI IEPAIWACLA LLLLATQRVK AQSMLWSYIV MAAWIAGDGL VRFALGEGVW ATTGVPRLIG LLPSSTALGV YLGAGLAGSL ALALSAESAK QRQLAWLLSL PLSLGVLLTF TRGAWLGVVG AVALVLLLQR RWRLLLGAAG LAGVGLAGFG LIQPSLLARV LRLGEGTGSA RQEIWASAWR AVQDQPLLGF GLDQFAQLEP SRYGIPQIRF LTLAHPHNLL LDVWLQLGLL GLLVVLAMLG WQVWRWWGSK QALGLVGLAI LADLVIHGML DQTMLGGDMI YLWWSLLLIS VWRPSATEES
|
| |