Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_29910 |
Symbol | wbpO |
ID | 7761892 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 3094696 |
End bp | 3095973 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643805864 |
Product | polysaccharide biosynthesis protein |
Protein accession | YP_002800132 |
Protein GI | 226945059 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0677] UDP-N-acetyl-D-mannosaminuronate dehydrogenase |
TIGRFAM ID | [TIGR03026] nucleotide sugar dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00762868 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATACTC TGGAAGACCT GAAACTCGCC ATCATCGGAC TCGGTTACGT CGGCCTGCCG CTCGCGGTCG AATTCGGCAA GCGCCGGCCG GTCGTCGGCT TCGACATCGA CCATGCGCGC ATCGCTGCCC TCGAGGCCGG GCACGACCGG ACCCTGGAAG TCGACGACCA CGAACTGCGG GAAGCGCGCC AGCTCCGCTA CAGCGCCGAT ATCGCCAGCC TGGCCGACTG CAACTTCTAC ATCGTCACGG TGCCGACGCC GATCGACTCG CACAAGAAAC CCGACTTGCT TCCCCTGATC CGCGCCTCGG AAACCATCGG CCAGGTGCTG AAGCGCGGCG ACATCGTCGT CTACGAGTCC ACCGTCTATC CCGGCGCCAC CGAGGAAGGC TGCGTTCCCG TGCTCGAACG GGTTTCCGGC CTGGCGTTCA ACCGCGACTT CTACGCCGGC TACAGCCCGG AACGGATCAA CCCCGGCGAC CGCGAGCACC GTATCACCGG CATCCGCAAG ATCACCTCCG GCTCCACCCC CGAGGTGGCC GAACTGGTCG ACGCGCTGTA CCGGGAAATC ATCGCCGCCG GCACCTACAA GGCGGACAGC ATCCGCATCG CCGAGGCGGC CAAGGTGATC GAGAACACCC AGCGCGACCT CAACATCGCC CTGGTCAACG AACTGGCGGT GATCTTCAAC CGCATGGGCA TCGACACCGA AGCGGTGCTG CAGGCGGCCG GCACCAAATG GAACTTCCTG CCGTTCCGCC CCGGCCTGGT CGGCGGCCAC TGCATCGGCG TCGATCCCTA CTACCTGACC CACAAGGCCG AGGCCATCGG CTACCACCCG GAAATCATCC TCGCCGGCCG GCGCCTGAAC GACGGCATGG GCGGCTACGT GGTCTCTCAA CTGATCAAGG CCATGCTCAG GCGGCGCATC CAGATCGACG GCGCCCGCGC CCTGGTCATG GGCCTGACCT TCAAGGAAAA CTGCCCGGAC CTGCGCAACA CGCGCGTCGT CGACATCATC CAGGAACTGC GCCAGTACAA CATCTCGGTC GATGTCTACG ACCCCTGGGT CGACGCCGAG GAAGCCCGGC GGGCCTACGA CATCAGGCCG CTGGACTCAC CGCCGGCCAG CGCCTACGAC GGCATCATCC TCGCCGTCGC CCACCACCAG TTCCGCAGCA TGGGCGCACC GGGCATCCGC CGTTTCGGCA AGCCCGGCCA CGTCCTCTAC GACCTCAAGT ACCTGCTCAA GCCCGACGAA GCGGATCTGC GCCTATGA
|
Protein sequence | MHTLEDLKLA IIGLGYVGLP LAVEFGKRRP VVGFDIDHAR IAALEAGHDR TLEVDDHELR EARQLRYSAD IASLADCNFY IVTVPTPIDS HKKPDLLPLI RASETIGQVL KRGDIVVYES TVYPGATEEG CVPVLERVSG LAFNRDFYAG YSPERINPGD REHRITGIRK ITSGSTPEVA ELVDALYREI IAAGTYKADS IRIAEAAKVI ENTQRDLNIA LVNELAVIFN RMGIDTEAVL QAAGTKWNFL PFRPGLVGGH CIGVDPYYLT HKAEAIGYHP EIILAGRRLN DGMGGYVVSQ LIKAMLRRRI QIDGARALVM GLTFKENCPD LRNTRVVDII QELRQYNISV DVYDPWVDAE EARRAYDIRP LDSPPASAYD GIILAVAHHQ FRSMGAPGIR RFGKPGHVLY DLKYLLKPDE ADLRL
|
| |