Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_20590 |
Symbol | dxs-2 |
ID | 7760985 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 2048876 |
End bp | 2050777 |
Gene Length | 1902 bp |
Protein Length | 633 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643804956 |
Product | 1-deoxy-D-xylulose-5-phosphate synthase |
Protein accession | YP_002799237 |
Protein GI | 226944164 |
COG category | [H] Coenzyme transport and metabolism [I] Lipid transport and metabolism |
COG ID | [COG1154] Deoxyxylulose-5-phosphate synthase |
TIGRFAM ID | [TIGR00204] 1-deoxy-D-xylulose-5-phosphate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCAAGA CGTTCCACGA GATTCCCCGC GAGCGCCCCG CCACCCCCCT GCTGGACCGC GCCGCGACGC CGGACCGGCT GCGCCAGCTC GGCGAGGCGG AACTCGAAGT ACTGGCCGAC GAACTGCGCC AGGAACTGCT CTACACCGTC GGGCAGACCG GCGGGCACTT CGGCGCCGGG CTCGGCGTGA TCGAGCTGAC CATCGCCCTG CACTACGTCT TCGACACCCC CGGGGACCGC CTGGTGTGGG ACGTCGGCCA CCAGGCCTAC CCGCACAAGA TTCTCACCGG CCGCCGCGAA CGCATGCTCA CGTTGCGCCA GAAGGACGGC ATCGCCGCCT TCCCGCGCCG CTCGGAAAGC CCCTACGACA CCTTCGGCGT CGGCCACTCC AGCACCTCCA TCGGCGCCGC GCTGGGCATG GCCATCGCCG CGCGGCTGAA GGGCGAGCGG CGCCGCTGCA TCGCCGTGAT CGGCGACGGC GCGCTGACCG CCGGGATGGC CTTCGAGGCG CTCAACCACG CCTCGGACGT GCAGGCCGAC ATGCTGGTGG TGCTCAACGA CAACGACATG TCGATCTCCA AGAACGTCGG CGGGCTGTCC AACTACCTGG CCAAGATCCT CTCCAGCCGC ACCTACGCGA GCATGCGCGA GGGCAGCAAG AACATCCTCT CGCGCCTGCC CGGCGCCTGG GAGATCGCCC GGCGCACCGA GGAATACGCC AAGGGCATGC TGGTGCCCGG CACCCTGTTC GAGGAGCTCG GCTGGAACTA CATCGGCCCG ATCGACGGCC ACGACCTGCC GACCCTGCTC GCCACCCTGC GCAACATGCG CGACCTCAAG GGGCCGCAGC TCCTCCACGT GGTGACCAAG AAGGGCAAGG GCTTCGCCCC GGCCGAGGCC GATCCCATCG GCTACCACGC GATCACCAAG CTGGAGCCCG AGGGCAGCGC CCCGCGCCAG CCCGGCCCTC CCCGATACTC CAGCGTCTTC GGCCGCTGGC TGTGCGACAT GGCCGCCGCC GATCCGCGCC TGGCCGGCAT CACCCCGGCG ATGAAGGAGG GCTCCGACCT GGTCGCCTTC AGCCAGCGCT TCCCGGAGCG CTACTTCGAC GTGGCCATCG CCGAGCAGCA CGCCGTGACC CTCGCCGCCG GCATGGCCTG CGACGGCCTC AAGCCGGTGG TGGCGATCTA CTCGACCTTC CTGCAGCGCG CCTACGACCA GTTGATCCAC GACGTCGCGG TGCAGAACCT CGACGTGCTG TTCGCCATCG ACCGCGCCGG CCTGGTCGGC GAGGACGGCC CGACCCACGC CGGCAGCTTC GACCTCTCCT ACCTGCGCTG CATCCCCGGC ATGCTGGTGA TGACCCCCAG CGACGAGAAC GAGCTGCGCC GGCTGCTCAC CACCGGCTAC CTGTTCGAAG GCCCGGCGGC GGTGCGCTAC CCGCGCGGCA GCGGACCGAA CGCGGCCCTC GAGCCGGGCC TGGAGCCCTT GCCGATCGGC AAGGGCGTGC TGCGCCGCCG GAGCGGGAAG AGCGACGGCC CGCGGGTCGC CCTGCTGGTG TTCGGCGTGC AGGTGGCCGA GGCGCTGCGG GTGGCCGGGA AGCTGGACGC CACGGTGGCC GACATGCGCT TCGTCAAGCC GCTGGACGAG GAACTGGTAC GCGAACTGGC CGCCGGCCAC GAGCTGCTGG TGACCGTCGA GGAGAACAGC ATCATGGGCG GCGCCGGCAG CGCGGTGGCG GAATACCTCG CCGAGGCCGG CCTCCTCAGG CCGCTCCTCC ACCTGGGCCT GCCGGACTAT TACGTGGAAC ACGCCCGGCC CGCGGAAATG CTCGCCGAAT GCGGCCTGGA CGCCGCTGGC ATCGAGGCGG CGGTGTGCGA ACGGCTGAAT ATGCTGAGCT GA
|
Protein sequence | MPKTFHEIPR ERPATPLLDR AATPDRLRQL GEAELEVLAD ELRQELLYTV GQTGGHFGAG LGVIELTIAL HYVFDTPGDR LVWDVGHQAY PHKILTGRRE RMLTLRQKDG IAAFPRRSES PYDTFGVGHS STSIGAALGM AIAARLKGER RRCIAVIGDG ALTAGMAFEA LNHASDVQAD MLVVLNDNDM SISKNVGGLS NYLAKILSSR TYASMREGSK NILSRLPGAW EIARRTEEYA KGMLVPGTLF EELGWNYIGP IDGHDLPTLL ATLRNMRDLK GPQLLHVVTK KGKGFAPAEA DPIGYHAITK LEPEGSAPRQ PGPPRYSSVF GRWLCDMAAA DPRLAGITPA MKEGSDLVAF SQRFPERYFD VAIAEQHAVT LAAGMACDGL KPVVAIYSTF LQRAYDQLIH DVAVQNLDVL FAIDRAGLVG EDGPTHAGSF DLSYLRCIPG MLVMTPSDEN ELRRLLTTGY LFEGPAAVRY PRGSGPNAAL EPGLEPLPIG KGVLRRRSGK SDGPRVALLV FGVQVAEALR VAGKLDATVA DMRFVKPLDE ELVRELAAGH ELLVTVEENS IMGGAGSAVA EYLAEAGLLR PLLHLGLPDY YVEHARPAEM LAECGLDAAG IEAAVCERLN MLS
|
| |