Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_4842 |
Symbol | |
ID | 3679340 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 6094586 |
End bp | 6095887 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 637720199 |
Product | O-antigen polymerase |
Protein accession | YP_325334 |
Protein GI | 75911038 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3307] Lipid A core - O-antigen ligase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.000342709 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0564871 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAT ATCTACTACT TGCAGAAAAA GCTTTTACAG TTCTGTCTCT ACTCCATTAT TCAGGCGCTC CTCTAGTTGT GATTCTGTCA GGGGGAGCAA GTGAAGGTGA TGATTCAATA GCGGCTCCTG ATTTTGCCTT AATTCAATTC ATATTTTTAA TAATTTACTT CATTACCTTT GCCTTACTCG TCCTGCGTTG GAAAAAAGTA ATTCAATTAA TTAATAAAGA TTGGTATATA TGGCTTTTAC TCCTAGTAGC TATATTTTCC ATATTTTGGT CTGATGTACC TGCAATGACA CAAAGTCGCG TCATAGCTCT ATCAGGAACA CTTCTATTTC CTCTTTACTT AGCCAGTCGC TACACTTTAA AAGAACAATT ACATTTACTA GCATGGACTT TCGGCATAGC GATCGTAGGT AGCTTTTTAT TCGCTTTCGG CCTAAGAAAT TATGGTCGTA TGGCTGGAGT GCATTTTGGT ACATGGCGAG GAATATATAA CCATAAAAAC GTTCTTGGTA AAGTCATGGC TCCCAGTGCC ATGGTGTTCT TACTTCTGGC TCTTCAACCA GAAAAAAAGC GTTGGCTATT TTGGGGAGGA TTCATAGCAT CGATCTGGCT AATTATTGCT TCAAAAGCAT CATCACCTCT TATCAATGTT ATTACTTTAA GTCTTTCATT ATTTTTATTT CGGATTTTAA GGTGGCGATA TAACTTTATG ATCCCCGCGT TAGTGGGAAT CACAACATTA AGTACCATTG CATATATATT AATGACTACT AATGCTGAAG CTATAGCAGC ATTATTAGGC AAAGATTTAA CACTCACTGG ACGAACTAAT TTTTGGCCTT TGATCATCGA TAAAATAGCT GAACGTCCTT GGTTTGGTTA TGGATATGGT GCATTCTGGC TGGGCTGGAC TGGTCCTTCC GCCGATATTT GGTACTCCTC TGGCTGGAAA CCACCAAATA GCCACAATGG TTACTTAGAC CTTTGCCTTG AATTGGGATT AGTAGGCTTA TCACTATATG TAATTGATTA TTTACAAGGT CTACTCAAGG CATTAGCTTA TGTGAGGTCA GTTAAAACAT CAGATGGATT TTGGCCAGGA GTTTTTCTTA TGTATGTTGT GTTGTCAAAT CTCACAGAAA GTACATTACT TATCCAAAAT AATTTCTTTT TTGTGATTCA AATTTCTATA TTACTATCAC TACGAATATG TGAAGAACAA AAAACAACTC ATTCCATGAT CAAGCATAAA AAGAAGATTA ATTATTCACA GAAAACCTAC AAAATACCGT AA
|
Protein sequence | MKKYLLLAEK AFTVLSLLHY SGAPLVVILS GGASEGDDSI AAPDFALIQF IFLIIYFITF ALLVLRWKKV IQLINKDWYI WLLLLVAIFS IFWSDVPAMT QSRVIALSGT LLFPLYLASR YTLKEQLHLL AWTFGIAIVG SFLFAFGLRN YGRMAGVHFG TWRGIYNHKN VLGKVMAPSA MVFLLLALQP EKKRWLFWGG FIASIWLIIA SKASSPLINV ITLSLSLFLF RILRWRYNFM IPALVGITTL STIAYILMTT NAEAIAALLG KDLTLTGRTN FWPLIIDKIA ERPWFGYGYG AFWLGWTGPS ADIWYSSGWK PPNSHNGYLD LCLELGLVGL SLYVIDYLQG LLKALAYVRS VKTSDGFWPG VFLMYVVLSN LTESTLLIQN NFFFVIQISI LLSLRICEEQ KTTHSMIKHK KKINYSQKTY KIP
|
| |