Gene Caul_1440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1440 
Symbol 
ID5898895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1529629 
End bp1531455 
Gene Length1827 bp 
Protein Length608 aa 
Translation table11 
GC content68% 
IMG OID641561927 
Productphosphogluconate dehydratase 
Protein accessionYP_001683068 
Protein GI167645405 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase
[TIGR01196] 6-phosphogluconate dehydratase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.683477 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGTCA ACCCCGTCGC TCTTCATCCT GTGATCGCCG AAGTCACCGC CCGGATCATC 
GAGCGCAGCC GCGACAGCCG CGCGACCTAC CTGGCCAATC TCGACGCGGC CGCGGCCGCC
CAGCCGGGAC GGGCAAAACT CAGCTGCGCC AACTGGGCCC ACGCCTTCGC CGCCTCGCCG
TCGGTCGACA AGGTTCGCGC CCTCGATCCG AACGCCCCCA ACCTGGGCAT CGTCTCGGCC
TATAACGACA TGTTGTCGGC CCACCAGCCG CTGGAGGAGT ACCCGGCGCT GATCAAGGCG
GCCGCGCGCG AGGTCGGGGC CACGGCGCAA TTCGCCGGCG GCGTGCCGGC CATGTGCGAC
GGCGTCACCC AGGGGCGTCC GGGCATGGAG CTGTCGCTGT TCTCGCGCGA CGTGATCGCC
ATGGCCACGG GCATCGCCCT GACCCACGAC GCCTTCGACG GCGCGCTCTA CCTGGGCGTC
TGCGACAAGA TCGTGCCGGG CCTGCTGATC GGCGCCCTGA CCTTCAGCCA CCTGCCGGCG
ATGTTCGTGC CGGCCGGTCC GATGACTTCG GGCCTGCCCA ATTCGGAGAA GGCCCGCATC
CGCGCCCTCT ACGCCGAGGG CAAGGTCGGT CGCGAGGAAC TGCTGGCCGC CGAGAGCGCC
AGCTATCACG GCCCTGGCAC CTGCACCTTC TATGGCACGG CCAACACCAA CCAGATGCTG
ATGGAGCTGA TGGGCCTGCA CCTGCCGGGC TCGGCCTTCG TCCATCCCCA CACCCCGCTG
CGCAGCGCCC TGGTCAAGGA AGCCGCCAAA CGCGTCGCCG CCATCACCCA CAAGGGCAAT
GAGTGGATTC CGGTCGGCCG GGTGATCGAC GAGAAGGCCG TGGTCAACGG CGTGGTCGGC
CTGATGGCCA CCGGCGGCTC GACCAACCTG GCCCTGCACC TGGTCGCCAT GGCCCACGCG
GCCGGCATCA TCCTGACCCT TGAAGACCTG GACGACATCT CGAAGAACAC GCCGCTGCTG
GCCAAGGTCT ATCCAAACGG CTCGGCCGAC GTGAACCAGT TCCACGCCGC CGGCGGCATC
CATTTCGTGG TCAGGGAGCT GTTGAAGGCC GGCTTGGTCC ACGAGGACGT CCTGACCGTC
GTCGGCCCCG GCCTGTCGCG CTACACCCAG GAGCCGGTCC TGATCGACGG CGAGCTGGCT
TGGCGGGAGG GCGCCGAGCA GTCGCTGGAT CTCAACATCC TGCGCCCGGC CTCAGACCCG
TTCAGCCCGG AAGGTGGTCT GCGCCTACTG ACCGGAAATC TGGGTCGCGG GGTGATCAAG
GTCTCGGCCG TCAAGCCCGA GCATCAGGTG ATCACCGCCC CGGCGGCCGT GTTCCAGGAA
CAGGAAGACT TCATCGCCGC CTTCAAGCGC GGCGAGCTCG ACCGCGATGT CGTGGTGGTG
GTGCGCTTCC AAGGCCCGTC GGCCAACGGC ATGCCTGAAC TGCACAACCT GTCGCCCTCG
ATCTCGGTGT TGCTGGACCG CGGCTTCAAG GTCGCCCTGG TCACCGACGG CCGGATGTCG
GGCGCGTCGG GCAAGACCCC GGCCGCCATC CACCTGACCC CGGAAGCGGC CAAGGGCGGC
GCCCTGGCCT ATGTCGAGGA CGGCGACGTC ATCTCTCTGA ACGCCCATAC GGGCGAACTG
AAAATCTTGG TGGACGAGGC GACCCTGCGC GCCCGCACGC CGGCAAAAGT CCCGGCGTCC
AAGCCGGGCT ATGGACGGGA ACTCTTCGGC CTGCTGCGTT CGGGGGTCGG CGCGTCCGAC
CACGGCGCAT CGGTGCTGTT TGCTTAG
 
Protein sequence
MAVNPVALHP VIAEVTARII ERSRDSRATY LANLDAAAAA QPGRAKLSCA NWAHAFAASP 
SVDKVRALDP NAPNLGIVSA YNDMLSAHQP LEEYPALIKA AAREVGATAQ FAGGVPAMCD
GVTQGRPGME LSLFSRDVIA MATGIALTHD AFDGALYLGV CDKIVPGLLI GALTFSHLPA
MFVPAGPMTS GLPNSEKARI RALYAEGKVG REELLAAESA SYHGPGTCTF YGTANTNQML
MELMGLHLPG SAFVHPHTPL RSALVKEAAK RVAAITHKGN EWIPVGRVID EKAVVNGVVG
LMATGGSTNL ALHLVAMAHA AGIILTLEDL DDISKNTPLL AKVYPNGSAD VNQFHAAGGI
HFVVRELLKA GLVHEDVLTV VGPGLSRYTQ EPVLIDGELA WREGAEQSLD LNILRPASDP
FSPEGGLRLL TGNLGRGVIK VSAVKPEHQV ITAPAAVFQE QEDFIAAFKR GELDRDVVVV
VRFQGPSANG MPELHNLSPS ISVLLDRGFK VALVTDGRMS GASGKTPAAI HLTPEAAKGG
ALAYVEDGDV ISLNAHTGEL KILVDEATLR ARTPAKVPAS KPGYGRELFG LLRSGVGASD
HGASVLFA