Gene Caul_3131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3131 
Symbol 
ID5900586 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3393664 
End bp3395394 
Gene Length1731 bp 
Protein Length576 aa 
Translation table11 
GC content70% 
IMG OID641563634 
Productdihydroxy-acid dehydratase 
Protein accessionYP_001684756 
Protein GI167647093 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.696259 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.295178 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAAGA AACCCGACGG GACCTGGGAC AAGTCGCAAC TGCCCAGCCG GCATGTAACC 
GAAGGGCCGG CCCGCGCGCC GCACCGCTCT TATTATTACG CCATGGGTCT TGGCACGCGT
GAGATCGCCC AGCCGTTCGT CGGCGTCGCC TCGTGCTGGA ACGAGGCCGC GCCCTGCAAC
ACCGCCCTGA TGCGCCAGGC CAACGCCGTG GCCAAGGGCG TCAAGGCGGC CGGCGGCACC
CCGCGCGAGT TCTGCACCAT CACCGTCACC GACGGCATCG CCATGGGCCA CGAGGGCATG
CGTTCGTCCC TGGTCAGCCG CGACGTGATC GCCGACTCCG TCGAGCTGAC CATGCGCGGC
CACGGCTATG ACGCGCTCGT GGGCGTCGCC GGGTGCGACA AGAGCCTGCC GGGCATGATG
ATGGCCATGC TGCGCCTCAA CGTGCCCAGC GTGTTCCTGT ACGGCGGCTC GATCCTCCCG
GGACGCTTCC AGGGCCGCGA CATCACCGTG ATGGACGTCT TCGAGGGCGT CGGCGCCTAT
GCCGCCGGGA CCATGGACGC CAAGACCCTG TGCGAGCTGG AGCAGCACGC CTGCCCGTCG
GACGGCGCCT GCGGCGGCCA GTTCACGGCC AACACCATGG CCTGCGTGTC GGAAGCCATC
GGCCTGGCCC TGCCGCTGTC CTCGGCCCTG CCGGCCCCGT ACCTGGACCG CGACCAGTAC
GCGGTGGCCT CGGGCGAGGC GGTGATGCGG CTGATCGAGC AGAACATCCG CCCGCGCGAT
ATCTGCACCC GCAAGGCCTT CGAGAACGCC GCCGTCGTCG TCGCGGCCAC CGGCGGTTCG
ACCAATGGCG CGCTGCACCT GCCGGCCATG GCCCACGAGT GCGGCATCGA GTTCACCCTC
AAGGACGTGG CCGAGATCGC CGCCCGCACG CCCTATATCG CCGACCTCAA GCCCGGCGGT
CGCTACGTGG CCAAGGACAT GGGCGAGGCC GGCGGCGTGC CGATGCTGCT GCGCACCCTG
CTGGACGCCG GCCTGCTGCA CGGCGACGTC ATGACCGTCA CCGGCAAGAC CCTGGCCGAG
AACCTGGCCG ATGTGGTCTG GCGTGAGGAC CAGGACGTGA TCCGCCCGGT CTCCAATCCG
CTGTCGCCGA CTGGCGGCGT GGTCGGCCTG TGGGGCTCGC TGGCGCCCGA GGGCGGCATC
GTCAAGGTGG CCGGCCTCAA GCACCAGGTG CACCGCGGCC CGGCCCGGGT GTTCGACGGC
GAGGCGGCCT GTTTCGAAGC GGTGTCGAAC CGCGACTACA AGGCAGGCGA CGTCCTGGTC
ATCCGCTACG AAGGTCCGCG CGGCGGGCCG GGCATGCGCG AGATGCTGTC GACGACCGCC
GCGATCTACG GCCAGGGCGT GGAGAACATC GCCCTGATCA CCGACGGCCG CTTCTCGGGC
GCCACGCGCG GCCTGTGCAT CGGCCACGTG GGTCCCGAGG CCGCCGTGGG CGGTCCGATC
GCCCTGGTGC AGGACGGCGA CATCATCAGC ATCGACGCCA CCAAGGGGAC GATCGAGCTT
GAGGTCGAGG CCGAGGAACT GGCGCGCCGC AAGGCCGCCT GGAAGCCGCG CGGCCACGAC
TACAACAGCG GCGCGATCTG GAAGTTCGCC CAACTGGTCG GTCCAGCCTA TCTTGGCGCC
ACGACCCATC CGGGCGCGGC CAAGGAGACG CACGTCTACG CGGACATCTG A
 
Protein sequence
MTKKPDGTWD KSQLPSRHVT EGPARAPHRS YYYAMGLGTR EIAQPFVGVA SCWNEAAPCN 
TALMRQANAV AKGVKAAGGT PREFCTITVT DGIAMGHEGM RSSLVSRDVI ADSVELTMRG
HGYDALVGVA GCDKSLPGMM MAMLRLNVPS VFLYGGSILP GRFQGRDITV MDVFEGVGAY
AAGTMDAKTL CELEQHACPS DGACGGQFTA NTMACVSEAI GLALPLSSAL PAPYLDRDQY
AVASGEAVMR LIEQNIRPRD ICTRKAFENA AVVVAATGGS TNGALHLPAM AHECGIEFTL
KDVAEIAART PYIADLKPGG RYVAKDMGEA GGVPMLLRTL LDAGLLHGDV MTVTGKTLAE
NLADVVWRED QDVIRPVSNP LSPTGGVVGL WGSLAPEGGI VKVAGLKHQV HRGPARVFDG
EAACFEAVSN RDYKAGDVLV IRYEGPRGGP GMREMLSTTA AIYGQGVENI ALITDGRFSG
ATRGLCIGHV GPEAAVGGPI ALVQDGDIIS IDATKGTIEL EVEAEELARR KAAWKPRGHD
YNSGAIWKFA QLVGPAYLGA TTHPGAAKET HVYADI