Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3579 |
Symbol | |
ID | 5735440 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4499827 |
End bp | 4501332 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641280728 |
Product | O-antigen polymerase |
Protein accession | YP_001546343 |
Protein GI | 159900096 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3307] Lipid A core - O-antigen ligase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGGCCA ACACACCGCA AGAAGAGCCA ACCAGCCAAC CAGATGTTTT TAACAGCAAA TGGATGAAAT TTGCCTTTGC TTTTTTAGCG ATCGCTGGGG GCGCTGGGGT TGCCGCCGCC TTGGCATTTT TCGATAACCC CTTGAAATTA GTGCTGCTGT TTGGTGGCGC TGGTGCAGCC ATGGTCACAA TGCGCAATAG CGAATGGGGC TTGCTGGCAC TGGTGTTCAT GAGCTACACC CGTTTTTCCG ATGTGATGGT GCGCAACGGA GCGCCCTCAA CCGCCCAACC ATTTTTAGCG TTGCTCTTTT TAATTATTTT CTTGCGCTGG GCACTCTACA ACCAAAAACC CGAGCCTTGG CTCAAGCCCG CCGCCTGGAT CTTTGTCTAT GGCATGGTTG GGGTCGCCAC ATTTCTCTAT GCTGATGATG TCATGCGAGT TAAGAATGGG GTCATCACCT ACTTCAAAGA TGCGATTATC GTGATCGTGG TGGTGATGAT GATGCGTTCG CCCAAGATGC TGCATCGTTC GATGTGGGCG CTCTTGTTCG CTGGCATTTT TATGGCTTCG ATTACCACCT GGCAGCAATT AACTGGTACG TTCGAGAACG ATTATTTGGG CTTTGCCAAA GCTGGCAAAA TGCAAATCGT CTCAGGCGTT GAGGATGATT ATCGGATTGC TGGGCCAATC GGCGACCCTA ATTTCTATTC ACAAGTGCTC TTGACCCTAA TTCCACTGGG CATGGATCGC ATGTGGAATG AGAAGAACAA AAAATTACGC TGGTTTGCAA TTTGGCAATT GAGCGTTTGT ATGGCCTCAA TTTTCTTCTC GTTCAGTCGT GGGGCATTTC TCTCGCTCTC GATTGCCAGC TTAATTATGT TTGTGCGCCG ACCACCCAAG CCGCTTTCGG TGATCATTAT CATCGCTTTG GGTTTTGTAA TCATCCCGAC CTTGCCAGCT TCGTATATCG CGCGGCTCGA AACGATTCCC GAGGCAATTC CCGGCTTAGC TCAAGAAGAT GTGCGCAACG AGGCTTCGTT CCGTGGCCGT TCGAGCGCCC AACAAGCGGG TTTACGCATG TTCTGGGCTA ACCCAGTTTT TGGCTTAGGT GTGGGCAATT TTGGCAATCA CTATCAAGAA TATGCCCGTG ATCTAGGACT TGATAACAGC CGTTGGGACC AAGCGCCGCA CAACATGTAC CTTGAAATTC TGACCGAAAA AGGCTTATTT GGGCTTTCGG TCTTTAGCGC AATGATGTGG GTGCTGTTCC GCGATATGAA CCGAGCACGT AAAAAGTTTC GCGAAATCAA TATGGGCGAT TTCGATGGTC TGATCTTTGG TTTCCAGGCT GGGTTGGTTG GCTATATGTT TGCCGGGATC TTCCTGCAAC TATCCTACCC ACGCTTTTTC TGGATTTTGA TCGCCATCGC CTATGCAATT CCCAATGTTG CCAATAAAGC TTATGAAGAG TATCGCGAGG CGCTACCAAA TGGCGAAACA GCCTGA
|
Protein sequence | MLANTPQEEP TSQPDVFNSK WMKFAFAFLA IAGGAGVAAA LAFFDNPLKL VLLFGGAGAA MVTMRNSEWG LLALVFMSYT RFSDVMVRNG APSTAQPFLA LLFLIIFLRW ALYNQKPEPW LKPAAWIFVY GMVGVATFLY ADDVMRVKNG VITYFKDAII VIVVVMMMRS PKMLHRSMWA LLFAGIFMAS ITTWQQLTGT FENDYLGFAK AGKMQIVSGV EDDYRIAGPI GDPNFYSQVL LTLIPLGMDR MWNEKNKKLR WFAIWQLSVC MASIFFSFSR GAFLSLSIAS LIMFVRRPPK PLSVIIIIAL GFVIIPTLPA SYIARLETIP EAIPGLAQED VRNEASFRGR SSAQQAGLRM FWANPVFGLG VGNFGNHYQE YARDLGLDNS RWDQAPHNMY LEILTEKGLF GLSVFSAMMW VLFRDMNRAR KKFREINMGD FDGLIFGFQA GLVGYMFAGI FLQLSYPRFF WILIAIAYAI PNVANKAYEE YREALPNGET A
|
| |