Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3131 |
Symbol | |
ID | 5900586 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 3393664 |
End bp | 3395394 |
Gene Length | 1731 bp |
Protein Length | 576 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641563634 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_001684756 |
Protein GI | 167647093 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.696259 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.295178 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAAGA AACCCGACGG GACCTGGGAC AAGTCGCAAC TGCCCAGCCG GCATGTAACC GAAGGGCCGG CCCGCGCGCC GCACCGCTCT TATTATTACG CCATGGGTCT TGGCACGCGT GAGATCGCCC AGCCGTTCGT CGGCGTCGCC TCGTGCTGGA ACGAGGCCGC GCCCTGCAAC ACCGCCCTGA TGCGCCAGGC CAACGCCGTG GCCAAGGGCG TCAAGGCGGC CGGCGGCACC CCGCGCGAGT TCTGCACCAT CACCGTCACC GACGGCATCG CCATGGGCCA CGAGGGCATG CGTTCGTCCC TGGTCAGCCG CGACGTGATC GCCGACTCCG TCGAGCTGAC CATGCGCGGC CACGGCTATG ACGCGCTCGT GGGCGTCGCC GGGTGCGACA AGAGCCTGCC GGGCATGATG ATGGCCATGC TGCGCCTCAA CGTGCCCAGC GTGTTCCTGT ACGGCGGCTC GATCCTCCCG GGACGCTTCC AGGGCCGCGA CATCACCGTG ATGGACGTCT TCGAGGGCGT CGGCGCCTAT GCCGCCGGGA CCATGGACGC CAAGACCCTG TGCGAGCTGG AGCAGCACGC CTGCCCGTCG GACGGCGCCT GCGGCGGCCA GTTCACGGCC AACACCATGG CCTGCGTGTC GGAAGCCATC GGCCTGGCCC TGCCGCTGTC CTCGGCCCTG CCGGCCCCGT ACCTGGACCG CGACCAGTAC GCGGTGGCCT CGGGCGAGGC GGTGATGCGG CTGATCGAGC AGAACATCCG CCCGCGCGAT ATCTGCACCC GCAAGGCCTT CGAGAACGCC GCCGTCGTCG TCGCGGCCAC CGGCGGTTCG ACCAATGGCG CGCTGCACCT GCCGGCCATG GCCCACGAGT GCGGCATCGA GTTCACCCTC AAGGACGTGG CCGAGATCGC CGCCCGCACG CCCTATATCG CCGACCTCAA GCCCGGCGGT CGCTACGTGG CCAAGGACAT GGGCGAGGCC GGCGGCGTGC CGATGCTGCT GCGCACCCTG CTGGACGCCG GCCTGCTGCA CGGCGACGTC ATGACCGTCA CCGGCAAGAC CCTGGCCGAG AACCTGGCCG ATGTGGTCTG GCGTGAGGAC CAGGACGTGA TCCGCCCGGT CTCCAATCCG CTGTCGCCGA CTGGCGGCGT GGTCGGCCTG TGGGGCTCGC TGGCGCCCGA GGGCGGCATC GTCAAGGTGG CCGGCCTCAA GCACCAGGTG CACCGCGGCC CGGCCCGGGT GTTCGACGGC GAGGCGGCCT GTTTCGAAGC GGTGTCGAAC CGCGACTACA AGGCAGGCGA CGTCCTGGTC ATCCGCTACG AAGGTCCGCG CGGCGGGCCG GGCATGCGCG AGATGCTGTC GACGACCGCC GCGATCTACG GCCAGGGCGT GGAGAACATC GCCCTGATCA CCGACGGCCG CTTCTCGGGC GCCACGCGCG GCCTGTGCAT CGGCCACGTG GGTCCCGAGG CCGCCGTGGG CGGTCCGATC GCCCTGGTGC AGGACGGCGA CATCATCAGC ATCGACGCCA CCAAGGGGAC GATCGAGCTT GAGGTCGAGG CCGAGGAACT GGCGCGCCGC AAGGCCGCCT GGAAGCCGCG CGGCCACGAC TACAACAGCG GCGCGATCTG GAAGTTCGCC CAACTGGTCG GTCCAGCCTA TCTTGGCGCC ACGACCCATC CGGGCGCGGC CAAGGAGACG CACGTCTACG CGGACATCTG A
|
Protein sequence | MTKKPDGTWD KSQLPSRHVT EGPARAPHRS YYYAMGLGTR EIAQPFVGVA SCWNEAAPCN TALMRQANAV AKGVKAAGGT PREFCTITVT DGIAMGHEGM RSSLVSRDVI ADSVELTMRG HGYDALVGVA GCDKSLPGMM MAMLRLNVPS VFLYGGSILP GRFQGRDITV MDVFEGVGAY AAGTMDAKTL CELEQHACPS DGACGGQFTA NTMACVSEAI GLALPLSSAL PAPYLDRDQY AVASGEAVMR LIEQNIRPRD ICTRKAFENA AVVVAATGGS TNGALHLPAM AHECGIEFTL KDVAEIAART PYIADLKPGG RYVAKDMGEA GGVPMLLRTL LDAGLLHGDV MTVTGKTLAE NLADVVWRED QDVIRPVSNP LSPTGGVVGL WGSLAPEGGI VKVAGLKHQV HRGPARVFDG EAACFEAVSN RDYKAGDVLV IRYEGPRGGP GMREMLSTTA AIYGQGVENI ALITDGRFSG ATRGLCIGHV GPEAAVGGPI ALVQDGDIIS IDATKGTIEL EVEAEELARR KAAWKPRGHD YNSGAIWKFA QLVGPAYLGA TTHPGAAKET HVYADI
|
| |