Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4125 |
Symbol | |
ID | 5735986 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5272330 |
End bp | 5273958 |
Gene Length | 1629 bp |
Protein Length | 542 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641281279 |
Product | polysaccharide biosynthesis protein |
Protein accession | YP_001546885 |
Protein GI | 159900638 |
COG category | [R] General function prediction only |
COG ID | [COG2244] Membrane protein involved in the export of O-antigen and teichoic acid |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCACTAC AATTCCTAAG CCTACCAACA GTTTCATTGC GAAGCTGGGT TTTGGTAGGC TTATTCAGTT TGGGGCTGGT AGGCTTATTG GGCTTAACCT TGGCAGGCAT TCGCCGTTGG AGCAAGCCAC GCCAAGCAAC TGACGACCAT TTGAGCACGA TTGGCCGCAA CACTAGTATT CCCTTTGCTC TACAAATGGC CAGTCGCATG CTTGATTTGG TCTTTGCGAT GATTCTTTAT CGCTTTTTGG CTGCCGAAAC CGTTGGAGCC TACGACTTTG CGGCAGTGAT TGTCGTCAAT TATTTTGGCA CAATTGCCGA TTGGGGCTTA ACGGTTTTGG CAACGCACGA AATTGTGCGC CAGCCAAGCC AAGCGCCCCA AACATTTCGC ACAACGCTCT GGCTCCGTTT GCGTTTTGCA ATTTTAGCCT TGCCAATTGC CGTGATTTTT GTGCTGATCT ACAACGGCTT GGCGCAGGCT GAGATTACGG CGGTTGGCCT GACCAGTCAG CAAATTACGG TGATCACAAT TTTGATGCTG ACCTTGTTTC CGGCGGCACT TTCGGCCAGC GTCACCGCTT GGTTGCAAGG CCACGAGCGC TTGGTCGCGG CTGCGGTCGT CAATCTTTTG ACCAATATTG GGAGTGCAGC ATTTCGCTTA ACTGCCTTGA TTTTGGGCTT TGGCATTATT GGGATTGCTA GCGGAGCCTT GGCGGGAGCA TTGCTCAGCG CCCTCTTATT TTGGCTGGCG ATGCGGCGTT TCTTCCCCGA AGTAGCGTGG TTTGGCCCAA CCTTACCCGC CAAACCCTTG CTCAAAGAGG GCTACCCGCT CTTGCTCAAT AGTTTGTTGA TGACGATCTT TTTTCGTTTC GACACCATTT TGTTGAGCGC CTTCCACGGC TTTGTGGTCT CGGCAACCTA TGGCGTAGCC TATAAACTGA TTAATTTCAC CCAAATTGTG CCGCCAATTG TGGTTAACGC GATTTTCCCG ACGCTAATTC GCCGTTCCGG CGATGATCGA GCTGGAATGA GTCGGGCTTA TGCTGGCACA TTGCGTATGT TGCTGAATTT AGCGTTTGGC ATCGCCGTTG TGGCTACAAT TATCGCTGTG CCACTAACCA CATGGCTCGC CGATCGGCCT GAATATTTGC CAGGCAGCGT CTATGCCTTG ATGATTACGA TTTGGTATTT ACCAGGCAGC TATCTGAATG GCCTGACTCA ATATGTGATT ATCGCGCTGG GCAAGAAACA GGCAATTACT AAGGCTTTTG GTTTAACTGC AATGGTCAAT TTGGGCTTGA ATATTTGGTT GATTCCACGC TATAGCTATT TTGCCGCCGC CGCAATCACG ATTGTTTCTG AGCTTGTGTT ATTTTTGCCG CTCTGGCTGG TACTACGCCG CGAACAGATT AACATCAACT TGGCGAGTTT ATTTTGGCGG CCTGCGCTGG CAGCATTGCT GGCTGGTGGT ATCGGCTGGT TGTTGCTCAG CATCAATGTG TATTTGGCAG GAGTCGTAAC CGGATTAATC TATGGCGCTG GCTTATGGTT CAGCGGCAGC ATCGGCCAAA CAGAACGCGA ATTGGCTGCG CGGATGTTTA AAAAGTTACG CCCCCAAGCA TCAAGCTGA
|
Protein sequence | MALQFLSLPT VSLRSWVLVG LFSLGLVGLL GLTLAGIRRW SKPRQATDDH LSTIGRNTSI PFALQMASRM LDLVFAMILY RFLAAETVGA YDFAAVIVVN YFGTIADWGL TVLATHEIVR QPSQAPQTFR TTLWLRLRFA ILALPIAVIF VLIYNGLAQA EITAVGLTSQ QITVITILML TLFPAALSAS VTAWLQGHER LVAAAVVNLL TNIGSAAFRL TALILGFGII GIASGALAGA LLSALLFWLA MRRFFPEVAW FGPTLPAKPL LKEGYPLLLN SLLMTIFFRF DTILLSAFHG FVVSATYGVA YKLINFTQIV PPIVVNAIFP TLIRRSGDDR AGMSRAYAGT LRMLLNLAFG IAVVATIIAV PLTTWLADRP EYLPGSVYAL MITIWYLPGS YLNGLTQYVI IALGKKQAIT KAFGLTAMVN LGLNIWLIPR YSYFAAAAIT IVSELVLFLP LWLVLRREQI NINLASLFWR PALAALLAGG IGWLLLSINV YLAGVVTGLI YGAGLWFSGS IGQTERELAA RMFKKLRPQA SS
|
| |