Gene Caul_4003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4003 
Symbol 
ID5901465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4332663 
End bp4334456 
Gene Length1794 bp 
Protein Length597 aa 
Translation table11 
GC content69% 
IMG OID641564524 
Productdihydroxy-acid dehydratase 
Protein accessionYP_001685626 
Protein GI167647963 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACCTCCG CTAATACGCC ATCTGGCAGA CCCCCCCGCC GGTTCCGCTC GCGCGACTGG 
TTCGACAATC CCGACCATAT CGACATGACC GCCCTCTATC TGGAGCGGTT CATGAACTAC
GGGATCACGC CCGAGGAGCT GCGCAGCGGC AAGCCGATCA TCGGCATCGC CCAGACCGGC
AGCGACATCT CGCCGTGCAA CCGCATCCAC CTGGACCTGG TGACCCGGAT CCGAGACGGC
ATCCGCGACG CCGGCGGCAT TCCGATGGAG TTCCCGGTCC ATCCGATCTT CGAGAACTGC
CGTCGTCCGA CGGCGGCCCT GGATCGCAAC CTGTCCTATC TGGGCCTGGT CGAGGTGCTG
CACGGCTATC CGATCGACGC CGTGGTGCTG ACCACCGGCT GCGACAAGAC CACCCCGGCC
GGCATCATGG CCGCCACCAC GGTCAATATC CCGGCGATCG TGCTGTCGGG CGGGCCGATG
CTGGACGGCT GGCATGACGG CGAGCTGGTC GGCTCGGGCA CGGTGATCTG GCGCTCGCGG
CGCAAGCTGG CGGCGGGCGA GATCAACGAG GAGGAGTTCA TCCAGCGCGC TTCCGACAGC
GCCCCCTCGG CCGGCCATTG CAACACCATG GGCACGGCCT CGACCATGAA CGCCGTAGCC
GAGGCGCTGG GCCTGTCGCT GACCGGCTGC GCGGCCATCC CCGCCCCGTA CCGCGAGCGC
GGCCAGATGG CCTACAAGAC CGGCCAGCGG ATCGTCGACC TGGCCTATGA GGACGTAAAG
CCCCTCGACA TCCTGACCAA GAAAGCCTTC GAGAACGCCA TCGCCCTGGT GGCGGCGGCC
GGCGGCTCGA CCAACGCCCA GCCGCACATC GTGGCCATGG CCCGCCACGC CGGCCTCGAC
ATCACCGCCG ACGACTGGCG CGCGGCCTAT GACATCCCGC TGATCCTCAA CATGCAGCCG
GCCGGCAAGT ACCTGGGCGA GCGCTTCCAC CGGGCCGGCG GCGCGCCGGC CGTGCTGTGG
GAACTGCTGC AGGCCGGACG CCTGCACGGC GACGTCATGA CCGTCACCGG CAAGACGATG
GGCGAGAACC TGGAAGGCCG CGAGACCAAG GACCGCGAGG TGGTCTTCCC CTACGGCCAG
CCGATGAGCG AGCGCGCCGG CTTCCTGGTG CTGAAGGGCA ACCTCTTCGA CTTCGCGATC
ATGAAGACCA GCGTGATCAG CCAGGAGTTC CGCCAGCGCT ACCTGTCGGA GCCGGGCAAG
GAAGACAGCT TCGAGGCCCG CGCCGTGGTG TTCGACGGCT CGGACGACTA CCACGCCCGC
ATCAACGACC CGTCGCTGAA CATCGACGAG CGCACCATCC TTGTGATCCG CGGCGCGGGT
CCGATCGGCT GGCCGGGTTC GGCCGAGGTG GTCAACATGC AGCCGCCGGA CGCCCTGCTC
AAGCGCGGGA TCATGAGCCT GCCCACCCTG GGCGATGGCC GCCAGTCGGG CACCGCCGAC
AGCCCCTCGA TCCTCAACGC CTCGCCCGAG AGCGCGATCG GCGGCGGCCT GTCGTGGCTG
CGCACCGGCG ACATGATCCG CATCGATCTC AACACCGGGC GCTGCGACGC CCTGGTCGAC
GAGGCGACGA TCGCCGAGCG TCGCAAGGAG GGCGTCCCGC CCGTGCCGGC GACCATGACC
CCCTGGCAGG AGATCTACCG CGCCCACACG GGCCAGCTGG AGACCGGCGG GGTGCTGGAG
TTCGCGGTCA AGTATCAGGA CCTGGCGAGC AAGCTGCCTC GGCACAATCA CTGA
 
Protein sequence
MTSANTPSGR PPRRFRSRDW FDNPDHIDMT ALYLERFMNY GITPEELRSG KPIIGIAQTG 
SDISPCNRIH LDLVTRIRDG IRDAGGIPME FPVHPIFENC RRPTAALDRN LSYLGLVEVL
HGYPIDAVVL TTGCDKTTPA GIMAATTVNI PAIVLSGGPM LDGWHDGELV GSGTVIWRSR
RKLAAGEINE EEFIQRASDS APSAGHCNTM GTASTMNAVA EALGLSLTGC AAIPAPYRER
GQMAYKTGQR IVDLAYEDVK PLDILTKKAF ENAIALVAAA GGSTNAQPHI VAMARHAGLD
ITADDWRAAY DIPLILNMQP AGKYLGERFH RAGGAPAVLW ELLQAGRLHG DVMTVTGKTM
GENLEGRETK DREVVFPYGQ PMSERAGFLV LKGNLFDFAI MKTSVISQEF RQRYLSEPGK
EDSFEARAVV FDGSDDYHAR INDPSLNIDE RTILVIRGAG PIGWPGSAEV VNMQPPDALL
KRGIMSLPTL GDGRQSGTAD SPSILNASPE SAIGGGLSWL RTGDMIRIDL NTGRCDALVD
EATIAERRKE GVPPVPATMT PWQEIYRAHT GQLETGGVLE FAVKYQDLAS KLPRHNH