Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3355 |
Symbol | |
ID | 5735225 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4232558 |
End bp | 4233829 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641280502 |
Product | polysaccharide biosynthesis protein |
Protein accession | YP_001546119 |
Protein GI | 159899872 |
COG category | [R] General function prediction only |
COG ID | [COG2244] Membrane protein involved in the export of O-antigen and teichoic acid |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTAAGG AATCTACGCC AACGCAAGCC AGTGTGGCCG AAGCCCCTGC CCCTGCCAAA TTGCCACGCG CTCGGCTTTT TCAGATCAGC TTTGGGGTTG GCGGGGTCAT GCTGGCGCGG GTGTTGGGGA TTTTCACCCA AATTATTTTG GCACGACGCT TGGGTGCTGA GCAATATGGC TTGTTTAACT CGCTGTATGC CTTGCTCAAC CCACTGATTA TTTTAGCCAA CCTAGGCTTG GATAATTGGC TATTGCGTCA AAGCACGGAT CGTACAAAGC TCAACAAATC GGTAAGTCGG GTGTTTGCCT TACGGGGGTT TGTGCTGGCA GGCCTGTTAT TGGCCGCTGC ACCATTTGTC AGTATGCGCA GCCCTGAATT TACCCCGCAC CTCATTGCCT TGGCTTGTGC CGGAATTATT GGCGATTTAT TGGTAGGCAC CGCTGATACC GCCTTGCGAG CCTCAATTCA AGTGCTCAAA GCCTCAAGTT TGCAAATTGT GGTTGCGCTA AGTTTGTTTG TGCCGATTTG GTTTACCCCC AACTTGCCAT TTTCCACCGT GGTGATCTAT CGTTGTATCG CGGTAGCCTT GGGCTTTGGC CTGAGTGTCT ACTACTTGCA TGGCATTTTG CGCTGGGTTT GGCAACCACG CCAATGGATC AATACAATTA GCGAGGCGCG TTTCTACTTT ATCTCCGAGG TGATGGCCTA TTTGACCTTG CGGGTTGATC TGTTTTTGGT GGCTTTGCTG CTACCAACGA TCAATTCAGG GATTTATGCG CCACCACTAG CAATTATCAA TAATACCTTT TTAGTGCCCA GTATTACCGC CCAAATCTTG CTACCGATGA TCGTCAAGCA TCCGCCGCGC TCACCACTCT ATCGCCGCGT GATTAGCTTG GCAATTGGCT TGAGCATTCT CTACGGCTTG GCATGGTTGG CGCTGCTCAC CTGGCATTCC GATTGGATTA TCAATCTGAT GTATGGGCCA CAATATGCCG ATGCCATCCC GTTGTTGCAA ATCATGGCCA TCATTCCATT GCTCAAAAGC CTGAATTTCT GCTGGGCGAC CTTTATGGTT TCCAACGATC AGCAGCCGTT ACGAGTTAAA TTGCAAACCA TCGGGGCCAC AATTAGCATT CTCGGCAATT TTGTGGTGAT TCCGTTGTAT GGTCTGCATG GCGTAGCGAT TATTAACATG ATCACCGAAA TTGCCCTGTT TATCTCGTAT GGTTATGGAG CTTGGCTGAC CTATAAGAAA GTGATGCGAT GA
|
Protein sequence | MIKESTPTQA SVAEAPAPAK LPRARLFQIS FGVGGVMLAR VLGIFTQIIL ARRLGAEQYG LFNSLYALLN PLIILANLGL DNWLLRQSTD RTKLNKSVSR VFALRGFVLA GLLLAAAPFV SMRSPEFTPH LIALACAGII GDLLVGTADT ALRASIQVLK ASSLQIVVAL SLFVPIWFTP NLPFSTVVIY RCIAVALGFG LSVYYLHGIL RWVWQPRQWI NTISEARFYF ISEVMAYLTL RVDLFLVALL LPTINSGIYA PPLAIINNTF LVPSITAQIL LPMIVKHPPR SPLYRRVISL AIGLSILYGL AWLALLTWHS DWIINLMYGP QYADAIPLLQ IMAIIPLLKS LNFCWATFMV SNDQQPLRVK LQTIGATISI LGNFVVIPLY GLHGVAIINM ITEIALFISY GYGAWLTYKK VMR
|
| |