Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4003 |
Symbol | |
ID | 5901465 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4332663 |
End bp | 4334456 |
Gene Length | 1794 bp |
Protein Length | 597 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641564524 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_001685626 |
Protein GI | 167647963 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGACCTCCG CTAATACGCC ATCTGGCAGA CCCCCCCGCC GGTTCCGCTC GCGCGACTGG TTCGACAATC CCGACCATAT CGACATGACC GCCCTCTATC TGGAGCGGTT CATGAACTAC GGGATCACGC CCGAGGAGCT GCGCAGCGGC AAGCCGATCA TCGGCATCGC CCAGACCGGC AGCGACATCT CGCCGTGCAA CCGCATCCAC CTGGACCTGG TGACCCGGAT CCGAGACGGC ATCCGCGACG CCGGCGGCAT TCCGATGGAG TTCCCGGTCC ATCCGATCTT CGAGAACTGC CGTCGTCCGA CGGCGGCCCT GGATCGCAAC CTGTCCTATC TGGGCCTGGT CGAGGTGCTG CACGGCTATC CGATCGACGC CGTGGTGCTG ACCACCGGCT GCGACAAGAC CACCCCGGCC GGCATCATGG CCGCCACCAC GGTCAATATC CCGGCGATCG TGCTGTCGGG CGGGCCGATG CTGGACGGCT GGCATGACGG CGAGCTGGTC GGCTCGGGCA CGGTGATCTG GCGCTCGCGG CGCAAGCTGG CGGCGGGCGA GATCAACGAG GAGGAGTTCA TCCAGCGCGC TTCCGACAGC GCCCCCTCGG CCGGCCATTG CAACACCATG GGCACGGCCT CGACCATGAA CGCCGTAGCC GAGGCGCTGG GCCTGTCGCT GACCGGCTGC GCGGCCATCC CCGCCCCGTA CCGCGAGCGC GGCCAGATGG CCTACAAGAC CGGCCAGCGG ATCGTCGACC TGGCCTATGA GGACGTAAAG CCCCTCGACA TCCTGACCAA GAAAGCCTTC GAGAACGCCA TCGCCCTGGT GGCGGCGGCC GGCGGCTCGA CCAACGCCCA GCCGCACATC GTGGCCATGG CCCGCCACGC CGGCCTCGAC ATCACCGCCG ACGACTGGCG CGCGGCCTAT GACATCCCGC TGATCCTCAA CATGCAGCCG GCCGGCAAGT ACCTGGGCGA GCGCTTCCAC CGGGCCGGCG GCGCGCCGGC CGTGCTGTGG GAACTGCTGC AGGCCGGACG CCTGCACGGC GACGTCATGA CCGTCACCGG CAAGACGATG GGCGAGAACC TGGAAGGCCG CGAGACCAAG GACCGCGAGG TGGTCTTCCC CTACGGCCAG CCGATGAGCG AGCGCGCCGG CTTCCTGGTG CTGAAGGGCA ACCTCTTCGA CTTCGCGATC ATGAAGACCA GCGTGATCAG CCAGGAGTTC CGCCAGCGCT ACCTGTCGGA GCCGGGCAAG GAAGACAGCT TCGAGGCCCG CGCCGTGGTG TTCGACGGCT CGGACGACTA CCACGCCCGC ATCAACGACC CGTCGCTGAA CATCGACGAG CGCACCATCC TTGTGATCCG CGGCGCGGGT CCGATCGGCT GGCCGGGTTC GGCCGAGGTG GTCAACATGC AGCCGCCGGA CGCCCTGCTC AAGCGCGGGA TCATGAGCCT GCCCACCCTG GGCGATGGCC GCCAGTCGGG CACCGCCGAC AGCCCCTCGA TCCTCAACGC CTCGCCCGAG AGCGCGATCG GCGGCGGCCT GTCGTGGCTG CGCACCGGCG ACATGATCCG CATCGATCTC AACACCGGGC GCTGCGACGC CCTGGTCGAC GAGGCGACGA TCGCCGAGCG TCGCAAGGAG GGCGTCCCGC CCGTGCCGGC GACCATGACC CCCTGGCAGG AGATCTACCG CGCCCACACG GGCCAGCTGG AGACCGGCGG GGTGCTGGAG TTCGCGGTCA AGTATCAGGA CCTGGCGAGC AAGCTGCCTC GGCACAATCA CTGA
|
Protein sequence | MTSANTPSGR PPRRFRSRDW FDNPDHIDMT ALYLERFMNY GITPEELRSG KPIIGIAQTG SDISPCNRIH LDLVTRIRDG IRDAGGIPME FPVHPIFENC RRPTAALDRN LSYLGLVEVL HGYPIDAVVL TTGCDKTTPA GIMAATTVNI PAIVLSGGPM LDGWHDGELV GSGTVIWRSR RKLAAGEINE EEFIQRASDS APSAGHCNTM GTASTMNAVA EALGLSLTGC AAIPAPYRER GQMAYKTGQR IVDLAYEDVK PLDILTKKAF ENAIALVAAA GGSTNAQPHI VAMARHAGLD ITADDWRAAY DIPLILNMQP AGKYLGERFH RAGGAPAVLW ELLQAGRLHG DVMTVTGKTM GENLEGRETK DREVVFPYGQ PMSERAGFLV LKGNLFDFAI MKTSVISQEF RQRYLSEPGK EDSFEARAVV FDGSDDYHAR INDPSLNIDE RTILVIRGAG PIGWPGSAEV VNMQPPDALL KRGIMSLPTL GDGRQSGTAD SPSILNASPE SAIGGGLSWL RTGDMIRIDL NTGRCDALVD EATIAERRKE GVPPVPATMT PWQEIYRAHT GQLETGGVLE FAVKYQDLAS KLPRHNH
|
| |