Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_10940 |
Symbol | algK |
ID | 7760038 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 1045145 |
End bp | 1046518 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643803998 |
Product | Alginate biosynthetic protein AlgK |
Protein accession | YP_002798300 |
Protein GI | 226943227 |
COG category | [R] General function prediction only |
COG ID | [COG0790] FOG: TPR repeat, SEL1 subfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.665621 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAACCTGA CCAAGCCCCT GCTGCTCTCG GCGCTGGCCG GCCTGACCGC CTGCGCCAAC CTCGACCTGC CCGACCAGCG TCTGGCGAAG GAAGCCCTGC AACGCGGCGA CACCCAGACC GCCGAGCGGC ATTTCCGTCA ACTCGCCGAC ATGGGCTTCA CCGAGGCCCA GCTCGGCCTG GCCGACATGC AGTTGGCCAG CGGTGATCCC GAGCAGCTGC GCAAGGCGGA ACAGACCTAC CGCATGGCGC TGGACGCCTC GCCGCGGGCC AAGGCCCGCC TGGGCAAGCT GCTGGCCTAC AAGCCGACCA GCAGCGAGGC GGAAAAGCGC GAGGCCGCCC AGTTGCTCAG CGATGCCTTC GCCGCCGGCG AGGATGGCGT GCTGCTGCCC CTGGCGATGC TTTACTTGAA GAACCCGCAG ACGTTCCCCG ACGTCAGCCT GCAGCAACGC ATCGATCAGT GGCGCGCCGC CGGACATCCC CAGGCGGACA TCGCACAGAT CGTGGTCTAC CGCACCCAGG GCACCTACGA CCAGCACCTG GACGACATCG AGCGGATCTG CCAGCAGCGC CTCGCCGAGC ACAGCGACTG CTACGTCGAG CTAGCCACCG TCTATCTCAA GCGCGACCAG AACGACCGCC TGCAGGCGCT GGTGCAACAG CTCATGGCCG CTCATCGCGC GGGCGGAGTG TCAGCGCAAC TGGTGACGGA AGTCGCCGGC GTGCTCTCCA ACCCGCTGCT CGGCCAGTCG AACGAAAAGA CCGCCCAGAC CATGCTCGAG GAAATCGCCC CGACCTACCC CGCCGCCTGG GTCAGTCTGG CGCGTCTGAT CTACGACTTT CCCGGCACCG GCGACACCGA CCAGATGCTC GACTACCTCG AGCGTGGCCG TGCCGCCGCC CAGCCGGCAG CCGATCTGTT GCTCGGTCGG CTCTACTACG AAGGCAAGCT GCTTCCACAG GACCCGTTCA AGGCCGAGGA ATATTTCATC AAGGCCCGCG CCACGGAAAA CAGCGCCCAC TACTACCTGG GTCAGATCTA TCGCCGCGGC TTCCTCGGCG AGGTCTACCC GCAGAAGGCC GTCGACAGCC TGCTGACCGC CGCCCGCGGC GGCCAGGCGA GCGCCGACTA CGCCCTCGCG CAGCTGTACT CCCAGGGTCG CGGCATCCGC ATCGACCTGG CCAACGCCTA CGTCTTCGCA CGTCTCGCCG TTTTGCAAGG CCGCCCCGAT TCCGAGCCCC TGCTTCAGGA AATAGAGGCT AATCTCGCTC CTGCAGAACG TACCCGGGGT GAGCAGATGC TGCACGCGGA ACAGCAGGCT CGTTACGGCG TGTGGCAGAC CTCGACGCAG CTGCAAGCCA TGCAAAATCA ATAG
|
Protein sequence | MNLTKPLLLS ALAGLTACAN LDLPDQRLAK EALQRGDTQT AERHFRQLAD MGFTEAQLGL ADMQLASGDP EQLRKAEQTY RMALDASPRA KARLGKLLAY KPTSSEAEKR EAAQLLSDAF AAGEDGVLLP LAMLYLKNPQ TFPDVSLQQR IDQWRAAGHP QADIAQIVVY RTQGTYDQHL DDIERICQQR LAEHSDCYVE LATVYLKRDQ NDRLQALVQQ LMAAHRAGGV SAQLVTEVAG VLSNPLLGQS NEKTAQTMLE EIAPTYPAAW VSLARLIYDF PGTGDTDQML DYLERGRAAA QPAADLLLGR LYYEGKLLPQ DPFKAEEYFI KARATENSAH YYLGQIYRRG FLGEVYPQKA VDSLLTAARG GQASADYALA QLYSQGRGIR IDLANAYVFA RLAVLQGRPD SEPLLQEIEA NLAPAERTRG EQMLHAEQQA RYGVWQTSTQ LQAMQNQ
|
| |