Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4434 |
Symbol | |
ID | 5736285 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5674199 |
End bp | 5675623 |
Gene Length | 1425 bp |
Protein Length | 474 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641281597 |
Product | O-antigen polymerase |
Protein accession | YP_001547194 |
Protein GI | 159900947 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3307] Lipid A core - O-antigen ligase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCGTC ACGCATTAAC TTGGCGCTCC GTGCAGCCAT TGGATTGGTT GCTAGCTAGC TTGGGGCTGG CAGGAGTGGC AATTATTACT CTGCTGCCCT TCACGCAGGC AGCAACCTTG ATTATTTGTG GCATGCTGCT AGTTTGCATG CTGATTCAGC CTGCCGTAGG TTTGAGCCTG ACGGTTGCCA CGGTCATGCT TCAAGAGTTA TTGAGCTTTC CGCTGGGCCT GACCGCCACC CATGTGATTG GGATTATGGC GCTGGGGGCG TGGTTGCTGT ATGGCATGGC GCAGCGCAAA ATCATCATCG ACACAACTTT GCTGGTGCCA TGGAGCCTGT TTTTGATGGC CTTGCTGCTC TCGGCGGGAC TGACCGAATA TAACGCGGTT GATGCCTTGA AGCAGGTGGT GCGTTGGTTT ATGGCCTTGC TGGCCTTTGT GGTGACGGTT GCCACAATTA CCACGCCCAA ACGCGCGATT GGCCTGATTG CGGTGATGTT CACGGTTGGG GTCATCGAGG CACTGATCGG TATTCAGCAA TATCGCGTGG GTGCTGGGCC ATTTGCCATT GGCGAAACTG TGCGGGCTTA TGGCACAATC GGTAAGCCCA ACACCTTTGC CGGCTTTTTG GAGTTGATGT GGCCCATGAC TTTGAGTGTA GCCTTGGGCT TGCTCTGGTT TTGGTGGCAG CAGCGCCAAC GCTGGCACTA TTTAATTGGC TCGGCCTTGA GTGCTGGCGC AAGCCTGATC ATTTTGGCGG CAGTTGGGGT TAGTTTTTCG CGCGGCGCTT GGATTGGCAT TATGGGTGCG GTGGTGGTGA TGCTGCTGGC GGTTGATCGG CGGCGAGCCT TGCCATTAAT CGCGCTTGGT GGAATCTTGC TGTTGGCGAT TATCAGCCAA CCTGAGCTTT TCCCCCCAGT GATTACCGAG CGAATTAGCA GTCTAACCAA CAATTTACGG ATTTTTGATG CTGGGCGGGT GACGGTTACC GATGAAAATT TTGCGGTTGT CGAACGCATG GCCCATTGGC AAGCGGGGGC AAATATGTTT TTGGCTCATC CGCTGCTCGG AGTTGGCCCC GACAACTTCA ATCGAGCCTA TCCCGAATTT TTTGTCGGGC GCTGGTCGGA ATCGCAAGGC CACTCGCACA ACTACTACAT TCATATTGCG GCAGAAGCTG GCATTTTAGG CTTTGTTGCT TATCTCGTGC TGATTGCAGC GGTCTATCGT CAAGCCTATT TGGCAATTCA GGCGACGCGC GGCACGGTTT GGCAGATGGT AGCAATTGGC TGCTGTGGTA TCATAACCGC CATTCAATTG CATAATGTTT TCGATAATCT CCATGTGTTG AATTTTGGAA TTCATTTGAG CGCAGTGTGG GCCTTATGTG TGGTTCTGAC ACAGCGCCAA GGGTGGCGTG CATGA
|
Protein sequence | MQRHALTWRS VQPLDWLLAS LGLAGVAIIT LLPFTQAATL IICGMLLVCM LIQPAVGLSL TVATVMLQEL LSFPLGLTAT HVIGIMALGA WLLYGMAQRK IIIDTTLLVP WSLFLMALLL SAGLTEYNAV DALKQVVRWF MALLAFVVTV ATITTPKRAI GLIAVMFTVG VIEALIGIQQ YRVGAGPFAI GETVRAYGTI GKPNTFAGFL ELMWPMTLSV ALGLLWFWWQ QRQRWHYLIG SALSAGASLI ILAAVGVSFS RGAWIGIMGA VVVMLLAVDR RRALPLIALG GILLLAIISQ PELFPPVITE RISSLTNNLR IFDAGRVTVT DENFAVVERM AHWQAGANMF LAHPLLGVGP DNFNRAYPEF FVGRWSESQG HSHNYYIHIA AEAGILGFVA YLVLIAAVYR QAYLAIQATR GTVWQMVAIG CCGIITAIQL HNVFDNLHVL NFGIHLSAVW ALCVVLTQRQ GWRA
|
| |