Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1558 |
Symbol | |
ID | 4710777 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 1693726 |
End bp | 1695144 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639856022 |
Product | O-antigen polymerase |
Protein accession | YP_001003124 |
Protein GI | 121998337 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3307] Lipid A core - O-antigen ligase and related enzymes |
TIGRFAM ID | [TIGR03097] probable O-glycosylation ligase, exosortase system type 1-associated |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.238347 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACCTTC GTGACATCGT CTTCGGCACC ATCCTGATCT CCAGCCTGCC GCTGATCTTA TACAGGCCCT GGATCGGGCC GCTGATGTGG TACTGGGTGG GCTTTTTTAA TCCGCACCAG CAGGCGTGGG GGTTCTTTGC CGGTGCTCAG GTCGCACTCC CCGTCGCTAT CGCGACCCTC GCAGCCACCG CCTATACCCA GGAGAAGCGC TGGCCACCGA TGACTCGGGA GATGTGGCTG GTCTTCCTTC AGGTCATCCT GTTCACGGTG ATTACGTTCG GCTTCGCCTG GCTCCCGGAT GACGCGTTCG GGCTCTGGGA TCAGCGGATG CGCATCATCT TGATGACCGT CATCACGGTA ATACTGATCT ACGGGAAGCA GCGCGTCATG GCGCTGCTGG CCATGATCAC CCTGTCCATC GCCTACTTCG GCTTCAAGGG CGGCCCCTAT ACGCTCAGCA CCGGATTCGG GGGGATGGTA CTCGGGCCAC AGGGAACGTT CATCGGCGGC AATACCGATA TCGGCCTGGC TCTGGTGATG ATCCTGCCTC TCACCTTAAT CCTCGCCCGC CAGGTCTATC ACGGGCGTTT CGAACTCCCG ATCCGCATCC CCGGCTTCGA AACCTGGCAC CGGCTGATTG GGCTCGCCCT CTACGGCGGC TTCTGGATGA CCCTGATCTC GATCATTGGG ACCCAGTCCC GCGGTGCCTG GGTAGCCCTG GCATGCACCT GGCCGTTCAT CTTCTGGCGC CTGCGCTTCA AGTGGGCCCT GGTCGCCGCC GTCGTCCTCG CGGTTGGGGT CATCGGAGTC ACGGTCCCGG ACCGCGTGGC CCACGAGTGG CAGACCCTTG TCGAATACGA GGACGACGGG TCCGCACAGG GCCGATTCCA CGCGTGGGAT GTGGCTTGGA ACATTGGGGT GGAGCACCCG CTGACCGGTG CCGGCTTTGG TGCCCAACGC ATCGACGCCG AGCTATGGCG CTCCTACAGT AGCGATGGTG ATGGCAGCCC GCTCGCACAG CACAGCATCT ACTTCCAGAC GCTGGCCGAG AACGGATTCC TGGGGCTCGG GCTGTTCCTG GCACTACTTG GCTTTACGCT GCTCACCCTG AACCGTCTGC GTCGCGACGC CGCTCAGCAC CCGGATACGC TCTGGATCAG CGAGTGGTCG TGGGCCCTCG CCATCGGCCT GATCGGTTAC TGCGTCGCGG GGGCCTTCTT GAGCCTCGCG TACTTCGACC TGATGTACGC CTTTATCGCC CTAGCCATCA TCCTGCGCCG AGAATTTGAG GATGTCCGGG TCGCGGTACG GTACCCCAGC CCGACCACCG CCACAGCACC ACAACAAACG CCGGTGGGCG AAGTCGGCTA TCGTCCGGGT ACTCCGCCGC GTGCCCTCTA TCGACGCCCC CCTGCATAA
|
Protein sequence | MDLRDIVFGT ILISSLPLIL YRPWIGPLMW YWVGFFNPHQ QAWGFFAGAQ VALPVAIATL AATAYTQEKR WPPMTREMWL VFLQVILFTV ITFGFAWLPD DAFGLWDQRM RIILMTVITV ILIYGKQRVM ALLAMITLSI AYFGFKGGPY TLSTGFGGMV LGPQGTFIGG NTDIGLALVM ILPLTLILAR QVYHGRFELP IRIPGFETWH RLIGLALYGG FWMTLISIIG TQSRGAWVAL ACTWPFIFWR LRFKWALVAA VVLAVGVIGV TVPDRVAHEW QTLVEYEDDG SAQGRFHAWD VAWNIGVEHP LTGAGFGAQR IDAELWRSYS SDGDGSPLAQ HSIYFQTLAE NGFLGLGLFL ALLGFTLLTL NRLRRDAAQH PDTLWISEWS WALAIGLIGY CVAGAFLSLA YFDLMYAFIA LAIILRREFE DVRVAVRYPS PTTATAPQQT PVGEVGYRPG TPPRALYRRP PA
|
| |