Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1288 |
Symbol | |
ID | 5898743 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 1353581 |
End bp | 1355317 |
Gene Length | 1737 bp |
Protein Length | 578 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641561773 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_001682916 |
Protein GI | 167645253 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0135195 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTCAGG GAAACGGGGG CGGCAGGTTC CGCAGCGGTC ACAGCTACGG CAAGCTGGAT CGCGACGGGT TTATCCATCG CAGCTGGATG AAGAGCCAGG GCCTTCCGGA CGACGTGTTC GACGGCCGGC CGGTGATCGG CATCTGCAAC ACGTGGTCCG AGATCACCCC CTGCAACGCG GGCCTGCGCG ACATCGCCGA GCACGTGAAG CGGGGCGTCT GGGAGGCCGG CGGCCTGCCG CTGGAGTTCC CGGCCATCTC GCTGGGCGAG ACCCAGATGC GGCCGACGGC CATGCTGTTT CGCAACCTGC TGGCCATGGA CGTCGAGGAG TCGATCCGCG GCAATCCCAT CGACGGCGTC GTGCTGCTGG GCGGCTGCGA CAAGACCACG CCGGGCCAGA TGATGGGCGC GGCCAGCGTC GACCTGCCCA CCATCGTCGT CTCGTCGGGA CCCATGCTGA ACGGCAAGTT CCGCGGCAAG GACATCGGCT CGGGCACCGA CGTGTGGAAG TTCTCCGAGG CCGTCCGGGC CGGCGAGATG ACCCTGCCGG ACTTCATGTC GGCCGAGAGC GGCATGAGCC GCTCGCCCGG CACCTGCATG ACCATGGGCA CCGCCTCGAC CATGGCCGCG ATCGTCGAGG CCATGGGCAT GAGCCTGCCC TACAACGCCT CGATTCCCGC CGTCGACGCC CGCCGCAAGG CGATGTCGCA CGAGACCGGC CGGACCATCG TGCGCATGGT GCATGACGGG CGGACCATGT CGCAGGTCTG CACGCGCGCC GCCTTCGAGA ACGCCCTGCG CGTCCACGCC GCGATCGGCG GCTCGACCAA CGCCGTGGTC CACCTGCTGG CCCTGGCCGG CCGCCTCGGC GTCGAGTTGA CGCTGGAGGA CTTCGACCAT CTGTCGCGCG ACGTGCCGCT GCTGGTCGAC CTGCAGCCGT CGGGCCGCTT CCTGATGGAG GACCTGCACT ATGCCGGCGG CCTGCCGGCT GTGATGAAGC AGATGTCGCC GTTCCTGAAC CCCGAGGCCC AGACCGTCTC GGGCGTGCGG ATCGGCGAAC AGTACGAGAC GGCCGAGGTG TTCAACGCCG AGGTCATCCG CAGCGTCGAG GCGCCCGTGA AACCCGACAG CGGCATCTGG GTGCTGCGCG GCAACCTGGC CCCCGGCGGG GCGGTGATGA AGCCCAGCGC CGCCAGCCCA GAACTGCTGA GCCACAAGGG CAAGGCCGTG GTGTTCGAGA CCATCGAGGA CTTCAAGGCT CGCATCGACG ATCCCGACCT CGACGTCGAC GCCAGCTCGA TCCTGGTGCT CAAGGGCTGC GGCCCCAAGG GCTATCCGGG CATGCCGGAA GTGGGCAACA TGCCGCTGCC GACCAAGCTG CTGGAACAGG GCGTCAAGGA CATGGTCCGC ATCAGCGACG CCCGGATGAG CGGCACGGCC TTCGGCACCG TCATCCTGCA CGTCTCGCCC GAGTCCGACG CTGGCGGCCC GCTGGCCGTG GTCCAGAACG GCGACGAGAT CGAACTGAAC GGCCCAACCC GGTCGCTGAA CCTGCTGATC TCCGACGCGG AGCTGGAAGC CCGTCTGGCC GTCTGGCGCG CCAATCCGCC GGCGCCCAAG GCCACGCGCG GCTACGCCAA GCTGTACATC GACCACGTGC TGGGCGCCGA CAAGGGCGCG GACCTGGACT TCCTGGTCGG CTCCAGCGGC TCGGTCGTCA CGCGAGAATC TCACTGA
|
Protein sequence | MGQGNGGGRF RSGHSYGKLD RDGFIHRSWM KSQGLPDDVF DGRPVIGICN TWSEITPCNA GLRDIAEHVK RGVWEAGGLP LEFPAISLGE TQMRPTAMLF RNLLAMDVEE SIRGNPIDGV VLLGGCDKTT PGQMMGAASV DLPTIVVSSG PMLNGKFRGK DIGSGTDVWK FSEAVRAGEM TLPDFMSAES GMSRSPGTCM TMGTASTMAA IVEAMGMSLP YNASIPAVDA RRKAMSHETG RTIVRMVHDG RTMSQVCTRA AFENALRVHA AIGGSTNAVV HLLALAGRLG VELTLEDFDH LSRDVPLLVD LQPSGRFLME DLHYAGGLPA VMKQMSPFLN PEAQTVSGVR IGEQYETAEV FNAEVIRSVE APVKPDSGIW VLRGNLAPGG AVMKPSAASP ELLSHKGKAV VFETIEDFKA RIDDPDLDVD ASSILVLKGC GPKGYPGMPE VGNMPLPTKL LEQGVKDMVR ISDARMSGTA FGTVILHVSP ESDAGGPLAV VQNGDEIELN GPTRSLNLLI SDAELEARLA VWRANPPAPK ATRGYAKLYI DHVLGADKGA DLDFLVGSSG SVVTRESH
|
| |