Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0196 |
Symbol | |
ID | 5732042 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 231119 |
End bp | 232534 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641277320 |
Product | polysaccharide biosynthesis protein |
Protein accession | YP_001542976 |
Protein GI | 159896729 |
COG category | [R] General function prediction only |
COG ID | [COG2244] Membrane protein involved in the export of O-antigen and teichoic acid |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0256028 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCCGTC GCCTACTCAA AGATACTGCG CTGTATGCCC TAACTGGGGT GGCAACCAAA ATGATCGGGG CATTATTGCT GCCCTTGATC ACGCGCTTGC TTAGCCCTGA ACTCTATGGC AGCGTCGATT TGATTAGCCT TGTCGGTTTG TTTGCGATTG AGTTGGTGAC GCTCGGTAGC GATTTTGCCT TAGCCCTGTA TTTTCATGAA GCCACGATTG AGCGGCGACG TTTGGTTGGC TCGCTCTGGG TAGCACGGCT ACTTTTGGGC TTGCTGATCG CATTGATTGG CCATTTGCTT GCGCCATATT TGGCCTGGCA ATTGCTGCAA CGCAGCGATG CCCAAACAAT TTTGGCCTTG CGGCTTGGTT TGTGGGGGCA ACTCGCCAAC GGCATCATCG GTTTGTGGTT CACAACCCTG CGTCAAGAGA GCAAAGCACT GCGTTTATTT GGCCTGACGG TGCTGCGAGT AGCCTCGACC GCGTTACTGA CAATTGGCTG GATGTTGCAG AGTAGCCAGC GGTTGAGTGC CTATTTTGGG GCGATGCTGC TAGTTGATAG TTTGCTGGCG CTCGGCTTGA CCCTGCAAAT GCGTCGGCAA CTGGGCTGGC CTGATTGGCG CTTACTAAAA ACGTTGTTGG GCAAAGGTTT AGGTTTTTTG CCACGCTCGA TCTATTTCGT AGCGATGACC TTGATCAATC GCCAGATTTT GCTGCACTTT GGCTCGTTGG AGCAAATTGG CCAGTATGCA GCGGCAACCA AAATTAGCTT TATCGTATGG ATTGTGATTA GCGCTGCCAA TCAGGCTTGG TTGCCCTATA GCCTGTCGAT TGCTAGCACG CCGACCGCCA ACGCCAATTA TCGCCAATAT CTCACCAGCT ATACGATGTT GATGGGTGCT GCCACGACTG GCTTAGGCTT ATTTGCCCCT GAATTGTTGC GACTGTTGAC CACAGGCGAT TATCTGCCCG CAGCGCCAGC GGTTGGTTGG TATGCGCTGA ATTTAATGGC GATTGGCTTG TTGACGGTGG CGGCAACCGG CCTGACGATC ACCAAAGCGA CCGCTGTTTT AGGCCAAACC AGTTTATTGA CCGCTGGGTT AAATGTTGGC TTAGCGATTG TGCTTGTGCC ATGGCTGGGC TTGGTTGGGG CGGCAATTGC GGCGGCAGGC GATCAACTGA TCGCGGCGTG GTTGGTGTAT CGCGCGGCTC AAAAGCGCTA TCCGATTGAT TTTGATGGTA AGGCGGTGAT CGGTTGGCTG AGCCTGACGA TTGCCTGCGT GGCCTTGGCT AGTTGGTTGC CATTGACCTT GAGCTGGCCA TTAATTGGGC TGAAATTGCT GATTGTTGGC GTGTGGTTGG GCTGCGTTTG GCGTTGGGGT CAGCCCAAAA TGCTGCTCTC AGTACTTAAA CGATAA
|
Protein sequence | MSRRLLKDTA LYALTGVATK MIGALLLPLI TRLLSPELYG SVDLISLVGL FAIELVTLGS DFALALYFHE ATIERRRLVG SLWVARLLLG LLIALIGHLL APYLAWQLLQ RSDAQTILAL RLGLWGQLAN GIIGLWFTTL RQESKALRLF GLTVLRVAST ALLTIGWMLQ SSQRLSAYFG AMLLVDSLLA LGLTLQMRRQ LGWPDWRLLK TLLGKGLGFL PRSIYFVAMT LINRQILLHF GSLEQIGQYA AATKISFIVW IVISAANQAW LPYSLSIAST PTANANYRQY LTSYTMLMGA ATTGLGLFAP ELLRLLTTGD YLPAAPAVGW YALNLMAIGL LTVAATGLTI TKATAVLGQT SLLTAGLNVG LAIVLVPWLG LVGAAIAAAG DQLIAAWLVY RAAQKRYPID FDGKAVIGWL SLTIACVALA SWLPLTLSWP LIGLKLLIVG VWLGCVWRWG QPKMLLSVLK R
|
| |