Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_44840 |
Symbol | waaG |
ID | 7763355 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 4539990 |
End bp | 4541114 |
Gene Length | 1125 bp |
Protein Length | 374 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643807335 |
Product | Lipopolysaccharide core biosynthesis protein |
Protein accession | YP_002801576 |
Protein GI | 226946503 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCTGG CCTTCATCCT CTACAAGTAC TTTCCCTTCG GCGGGCTGCA GCGCGACTTC ATGCGCATCG CCCTGGAATG CCAGAAGCGC GGACACGCCA TCCGTGTCTA CAGCATGTCC TGGGAGGGCG AGACCCCGCC GGGCTTCGAG GTGCTGATCG CGCCGATCAG GGCGTTGTTC AACCACCGCC GCAACGAGAA GTTCACCGCC TGGGTGCGGG CCGACCTGGC GCGGCGTCCC GTGGACCGGG TGGTCGGCTT CAACAAGATG CCCGGGCTGG ACGTCTACTA CGCCGCCGAC CCCTGCTACG AGGACAAGGC GCAGACCCTG CGCAACCCGC TCTATCGCCT GTGGGGGCGC TACCGGCATT TCGCCGGCTA CGAGCGCGCG GTGTTCGCGC CGCAGGCGAA GACACGCATA TTGATGATCT CCGAAGTGCA GCAACCATTG TTCGTCAAGC ATTACGGCAC GCCGGCGGAG CGTTTCCACC TGCTGCCTCC GGGCATCTCC GCCGACCGCC GCGCGCCGCC CGATGCGGAC GTGATCCGCG CCGACTTCCG CCGCGAGTTC GGCCTGGCCG GGGACGACCT GCTGCTGGTG CAGATCGGCT CCGGCTTCAA GACCAAGGGG CTGGACCGCA GCCTCAAGGC GCTCGCCGCG CTGCCCGGCG CCCTGAAGAA GCGCACCCGG CTGATCGCCA TCGGCCAGGA CGATCCGCGC CCCTTCCAGT TGCAGATCAA GGCGCTCGGC CTGTCCGGGC GGGTCGAGAT TCTCAAGGGG CGCAGCGACA TCCCGCGCTT TCTGCTCGGC GCCGATCTGC TGATCCACCC GGCCTACAAC GAGAACACCG GTACCGTGCT GCTCGAGGCG CTGGTCGCCG GACTGCCGGT GCTGGTCACC GATGTCTGCG GCTATGCCCA CTACATCGCC GAGGCGGGTT GCGGGCAGGT GCTGCCCAGC CCCTTCGAGC AGGAGCGCCT GAACCGGACG CTCGCCGCGA TGCTCGAAGA CGATGGGCAG CGAGCGCTCT ATCGGCGCAA CGGCCTGGCC TACGCCGGAA CCGCCGACCT CTATTCCATG CCGCAGCGGG CCGCCGATCT GATCCTGACG GAGCAGGGCG CGTGA
|
Protein sequence | MQLAFILYKY FPFGGLQRDF MRIALECQKR GHAIRVYSMS WEGETPPGFE VLIAPIRALF NHRRNEKFTA WVRADLARRP VDRVVGFNKM PGLDVYYAAD PCYEDKAQTL RNPLYRLWGR YRHFAGYERA VFAPQAKTRI LMISEVQQPL FVKHYGTPAE RFHLLPPGIS ADRRAPPDAD VIRADFRREF GLAGDDLLLV QIGSGFKTKG LDRSLKALAA LPGALKKRTR LIAIGQDDPR PFQLQIKALG LSGRVEILKG RSDIPRFLLG ADLLIHPAYN ENTGTVLLEA LVAGLPVLVT DVCGYAHYIA EAGCGQVLPS PFEQERLNRT LAAMLEDDGQ RALYRRNGLA YAGTADLYSM PQRAADLILT EQGA
|
| |