Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1897 |
Symbol | algA |
ID | 5712890 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 1978068 |
End bp | 1979498 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641267822 |
Product | alginate biosynthesis protein |
Protein accession | YP_001533240 |
Protein GI | 159044446 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0662] Mannose-6-phosphate isomerase [COG0836] Mannose-1-phosphate guanylyltransferase |
TIGRFAM ID | [TIGR01479] mannose-1-phosphate guanylyltransferase/mannose-6-phosphate isomerase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.0297962 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTATCC CCGTCATCCT CGCCGGAGGC TCCGGCAGCC GCCTGTGGCC TGCCTCGCGC AAGAGCTACC CCAAGCAATT CACCGAGCTG GTCGGTGCCC GCAGCCTGTT CCAGGATACG CTTGCCCGGC TGCAGGGCCC GCACTTCGCC GCGCCGACCA TCATCACCGG CGACGATTTC CGCTTCATCA CCGCCGAGCA GTTGGACGAT GCCGGGGTTA CCGGCGCCGA CATCCTGCTG GAGCCTGCGG GCCGCAACAC CGCGCCGGCG ATCCTCGCGG CTGCCCTGCG CCACGAGGCA ACGCCGGACG CGGTCCTGCT GGTCTCCCCC TCGGATCACC GGATCGCTGA CGGCGCCGCG TTTCTCGATG CGGTCGCCGC AGGGAAAGCG GCGGCAGAGG AGGGGCATCT GGTGACTTTC GGCGTCACGC CCATCGCGGC CGAGACCGGT TACGGGTATC TCGAGTTGTC GGGCACGCCG GTGCCCGGTC AGCCGCAAGT TCTCAAGTCC TTTGTGGAAA AGCCTGACGC GGCGCAGGCG GCGCAGCTTC TGGCGGCGGG GCGGCACCTT TGGAACGCGG GCATCTTCAT GTTCAAGGTT GGGACCATAA TCGATGCGTT TGAACGCCTG GCCCCGCGGT TGGTGATGCC TGTGCGCGCG GCGATGGCCG CGGGCGAGGA TGACTTGTGC TTCTATCGCC TCGGTGCGCA GGCCTACGCG CGCTGCGAGG ACATTTCCAT CGACTACGCG ATCATGGAGG CGGCCGAGGC CCTGCGGGTG ATCCCGGTTT CCTGCGGCTG GACCGACCTG GGCTCCTGGC GGTCGGTGCA CGGCGCGTCG GACCAGGACA CAGAGGGTAA CACGGTGCAG GGCAGCGCCT TGCAGATCGA CTGCCGCAAC AGTCTGCTCA AATCGACCGC ACCCGGCACC AGACTGGTCG GGCTGGGGCT GCAGAACATC GCCGCCATCG CCACCGATGA TGCGATCCTC GTGGCCAATC TCGACGACTC CGAGCGGGTG AAAGAGGTGG TCGCCGCCCT CAAGGTGCAG GGCGCCTCCC AGGCCGAGAG CTTCCGCCGC TGTCACCGGC CCTGGGGCTA TTTCGAGACG CTGTCCCTGG GCGAGCGGTT CCAGGTCAAG CGCATCATGG TCAAGCCGGG GGCCGCGCTC AGTCTGCAAA GCCATTTTCA CCGGGCGGAG CATTGGGTCG TTGTCGAAGG CAGCGCCCAT GTGACCGTGG ACCGCGATGT CTCGCTAATC AGCGAGAACC AGTCGGTCTA CATCCCGCTC GGCGCGGTCC ACCGGCTGGA GAATCGCGGA AAGGTGCCGC TGAACCTGAT CGAGGTGCAG TCGGGGGCGT ATCTTGGCGA GGACGACATC GTCCGCTACG AGGATGTTTA CGCCCGCGCC CCCAAGCAAA ACGTCGCCTG A
|
Protein sequence | MIIPVILAGG SGSRLWPASR KSYPKQFTEL VGARSLFQDT LARLQGPHFA APTIITGDDF RFITAEQLDD AGVTGADILL EPAGRNTAPA ILAAALRHEA TPDAVLLVSP SDHRIADGAA FLDAVAAGKA AAEEGHLVTF GVTPIAAETG YGYLELSGTP VPGQPQVLKS FVEKPDAAQA AQLLAAGRHL WNAGIFMFKV GTIIDAFERL APRLVMPVRA AMAAGEDDLC FYRLGAQAYA RCEDISIDYA IMEAAEALRV IPVSCGWTDL GSWRSVHGAS DQDTEGNTVQ GSALQIDCRN SLLKSTAPGT RLVGLGLQNI AAIATDDAIL VANLDDSERV KEVVAALKVQ GASQAESFRR CHRPWGYFET LSLGERFQVK RIMVKPGAAL SLQSHFHRAE HWVVVEGSAH VTVDRDVSLI SENQSVYIPL GAVHRLENRG KVPLNLIEVQ SGAYLGEDDI VRYEDVYARA PKQNVA
|
| |