Gene Caul_1288 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1288 
Symbol 
ID5898743 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1353581 
End bp1355317 
Gene Length1737 bp 
Protein Length578 aa 
Translation table11 
GC content69% 
IMG OID641561773 
Productdihydroxy-acid dehydratase 
Protein accessionYP_001682916 
Protein GI167645253 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0135195 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTCAGG GAAACGGGGG CGGCAGGTTC CGCAGCGGTC ACAGCTACGG CAAGCTGGAT 
CGCGACGGGT TTATCCATCG CAGCTGGATG AAGAGCCAGG GCCTTCCGGA CGACGTGTTC
GACGGCCGGC CGGTGATCGG CATCTGCAAC ACGTGGTCCG AGATCACCCC CTGCAACGCG
GGCCTGCGCG ACATCGCCGA GCACGTGAAG CGGGGCGTCT GGGAGGCCGG CGGCCTGCCG
CTGGAGTTCC CGGCCATCTC GCTGGGCGAG ACCCAGATGC GGCCGACGGC CATGCTGTTT
CGCAACCTGC TGGCCATGGA CGTCGAGGAG TCGATCCGCG GCAATCCCAT CGACGGCGTC
GTGCTGCTGG GCGGCTGCGA CAAGACCACG CCGGGCCAGA TGATGGGCGC GGCCAGCGTC
GACCTGCCCA CCATCGTCGT CTCGTCGGGA CCCATGCTGA ACGGCAAGTT CCGCGGCAAG
GACATCGGCT CGGGCACCGA CGTGTGGAAG TTCTCCGAGG CCGTCCGGGC CGGCGAGATG
ACCCTGCCGG ACTTCATGTC GGCCGAGAGC GGCATGAGCC GCTCGCCCGG CACCTGCATG
ACCATGGGCA CCGCCTCGAC CATGGCCGCG ATCGTCGAGG CCATGGGCAT GAGCCTGCCC
TACAACGCCT CGATTCCCGC CGTCGACGCC CGCCGCAAGG CGATGTCGCA CGAGACCGGC
CGGACCATCG TGCGCATGGT GCATGACGGG CGGACCATGT CGCAGGTCTG CACGCGCGCC
GCCTTCGAGA ACGCCCTGCG CGTCCACGCC GCGATCGGCG GCTCGACCAA CGCCGTGGTC
CACCTGCTGG CCCTGGCCGG CCGCCTCGGC GTCGAGTTGA CGCTGGAGGA CTTCGACCAT
CTGTCGCGCG ACGTGCCGCT GCTGGTCGAC CTGCAGCCGT CGGGCCGCTT CCTGATGGAG
GACCTGCACT ATGCCGGCGG CCTGCCGGCT GTGATGAAGC AGATGTCGCC GTTCCTGAAC
CCCGAGGCCC AGACCGTCTC GGGCGTGCGG ATCGGCGAAC AGTACGAGAC GGCCGAGGTG
TTCAACGCCG AGGTCATCCG CAGCGTCGAG GCGCCCGTGA AACCCGACAG CGGCATCTGG
GTGCTGCGCG GCAACCTGGC CCCCGGCGGG GCGGTGATGA AGCCCAGCGC CGCCAGCCCA
GAACTGCTGA GCCACAAGGG CAAGGCCGTG GTGTTCGAGA CCATCGAGGA CTTCAAGGCT
CGCATCGACG ATCCCGACCT CGACGTCGAC GCCAGCTCGA TCCTGGTGCT CAAGGGCTGC
GGCCCCAAGG GCTATCCGGG CATGCCGGAA GTGGGCAACA TGCCGCTGCC GACCAAGCTG
CTGGAACAGG GCGTCAAGGA CATGGTCCGC ATCAGCGACG CCCGGATGAG CGGCACGGCC
TTCGGCACCG TCATCCTGCA CGTCTCGCCC GAGTCCGACG CTGGCGGCCC GCTGGCCGTG
GTCCAGAACG GCGACGAGAT CGAACTGAAC GGCCCAACCC GGTCGCTGAA CCTGCTGATC
TCCGACGCGG AGCTGGAAGC CCGTCTGGCC GTCTGGCGCG CCAATCCGCC GGCGCCCAAG
GCCACGCGCG GCTACGCCAA GCTGTACATC GACCACGTGC TGGGCGCCGA CAAGGGCGCG
GACCTGGACT TCCTGGTCGG CTCCAGCGGC TCGGTCGTCA CGCGAGAATC TCACTGA
 
Protein sequence
MGQGNGGGRF RSGHSYGKLD RDGFIHRSWM KSQGLPDDVF DGRPVIGICN TWSEITPCNA 
GLRDIAEHVK RGVWEAGGLP LEFPAISLGE TQMRPTAMLF RNLLAMDVEE SIRGNPIDGV
VLLGGCDKTT PGQMMGAASV DLPTIVVSSG PMLNGKFRGK DIGSGTDVWK FSEAVRAGEM
TLPDFMSAES GMSRSPGTCM TMGTASTMAA IVEAMGMSLP YNASIPAVDA RRKAMSHETG
RTIVRMVHDG RTMSQVCTRA AFENALRVHA AIGGSTNAVV HLLALAGRLG VELTLEDFDH
LSRDVPLLVD LQPSGRFLME DLHYAGGLPA VMKQMSPFLN PEAQTVSGVR IGEQYETAEV
FNAEVIRSVE APVKPDSGIW VLRGNLAPGG AVMKPSAASP ELLSHKGKAV VFETIEDFKA
RIDDPDLDVD ASSILVLKGC GPKGYPGMPE VGNMPLPTKL LEQGVKDMVR ISDARMSGTA
FGTVILHVSP ESDAGGPLAV VQNGDEIELN GPTRSLNLLI SDAELEARLA VWRANPPAPK
ATRGYAKLYI DHVLGADKGA DLDFLVGSSG SVVTRESH