Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4465 |
Symbol | |
ID | 5736316 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5709685 |
End bp | 5711124 |
Gene Length | 1440 bp |
Protein Length | 479 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641281628 |
Product | O-antigen polymerase |
Protein accession | YP_001547225 |
Protein GI | 159900978 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000989096 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTGCAA CGTTTGATCA GCGGCGGCGG CGTGAATTTG GCTTGATTGT TGGCGGCACA TTAGTTGGCA TGGGCTTAGG CGCTGCTGCC GCGTTTGTGC CGAGCTTTTT GGTCGTCGCT GGTTTGGTGG CGTTGTTGGT CGGGGCATGG TTTGCCCGTT CGATCCACTC GATGCTGACG GCCACCGTGT TGGTGGCAAC GCTCTTACCA TTTGGCACCT TGCCTTTCAA AGTTGGGCTA ACTCCAGCCT TGCTCGAACT AGCATTATTG GCGCTGTATG CCATGTTGGT GGTGCGCAGC TTGGCCGACC CTGAGCGTAC TTGGCGTTGG GGCAGCCTAG CCCCATGGGT GATTTTGCTG CTAGCTAGTT CGTTTTTCTC CTTCATCATT GGCTCGAATG GCTCGCCCGA TAGTTTGCTG CTGCATAATT ATTTCAAATT GCTGCTAGGG ATTTGCTTGT TTTTGGGGGT GCAAAATGCG CTTGATTCAC TAGAACAGGC GCGTTGGTGG CTGCGCTTGC TGATTTTGGC TGGCTGGGCG GCGGCATTGT TAGGCATTGG TTTGCGCTTT GCCCCCGACG CTATGGCTTT GCGCTTTTTG ACCGCACTTG CTCCGCTGGG TTATCCGGCG AGTGGTCGGG TGTTACGCTA TGTTGAAGAT GACCCCAGCG GCTTTGAGCG AGCAATTGGT ACTTCGGTTG ATCCCAATGG CTTTGGTGGG ATGATGGCCT TGCTAGGAGC GATTGCGCTT GGTCAAGCCT TGGCCCAGCG CCCAGTCTTA GGCCGTAAAT GGCTATGGCT GATCACCGCT AGTTTTGCTT TGGCTGTATT TTTGACATCC TCACGGGCTG CCTTGGGTGG CTTTATGATC GCCGGCTTAT TTTTGGCAAC CGTGCGCTAT CGCCAATTGT GGTGGCTGAT TGGCGCTGGC GGTCTTGCTG GCGCAATCGC GATTGTGGGC TTGGGCAAGG GTGGCGATTT TGTCGAGCGG ATCGTCGAAG GCATTCAATT CAAAGATCAA GCCAACCAAA TGCGCTTGGC TGAGTTTCGC AATGCAATCG CGATTATACG CGAGTATCCG GTGTTTGGGG TGGGTTTTGG TCGCGCACCC AACATTGATC TCACAACTGG TGTGAGTAGT GTCTATTTGG CGCTTGGCTC GCGTATGGGT TTGGTTGGCT TAGGCCTCTA TATTTTAACT GCACTGGCCT TTTTGGTGCT TACCACTCAG GCTGCACGCC GCTGTGAACG CTCGGTAAGC GATGCAATTA TTGGTTTGCA GGCAGCAATT TTGGCGGCGC TAGCAGTTGG TTTGCTCGAT CATTATTTCT TCAATATTGA GTTTCCGCAT ATGGGGACGC TATTTTGGGG GGTGGTTGGC TTGGCGATGG TGTTTATGCG CGAGGTAAAG AATGATCAGC TAACTTCATC ATTGAAATAA
|
Protein sequence | MFATFDQRRR REFGLIVGGT LVGMGLGAAA AFVPSFLVVA GLVALLVGAW FARSIHSMLT ATVLVATLLP FGTLPFKVGL TPALLELALL ALYAMLVVRS LADPERTWRW GSLAPWVILL LASSFFSFII GSNGSPDSLL LHNYFKLLLG ICLFLGVQNA LDSLEQARWW LRLLILAGWA AALLGIGLRF APDAMALRFL TALAPLGYPA SGRVLRYVED DPSGFERAIG TSVDPNGFGG MMALLGAIAL GQALAQRPVL GRKWLWLITA SFALAVFLTS SRAALGGFMI AGLFLATVRY RQLWWLIGAG GLAGAIAIVG LGKGGDFVER IVEGIQFKDQ ANQMRLAEFR NAIAIIREYP VFGVGFGRAP NIDLTTGVSS VYLALGSRMG LVGLGLYILT ALAFLVLTTQ AARRCERSVS DAIIGLQAAI LAALAVGLLD HYFFNIEFPH MGTLFWGVVG LAMVFMREVK NDQLTSSLK
|
| |