Gene Caul_2551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2551 
Symbol 
ID5900006 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2771776 
End bp2774004 
Gene Length2229 bp 
Protein Length742 aa 
Translation table11 
GC content68% 
IMG OID641563042 
Productpolyphosphate kinase 
Protein accessionYP_001684176 
Protein GI167646513 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0855] Polyphosphate kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.183316 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACG TCGTCGCCCT TGACCTGCCC GCCGCGCCGA CGCCGGCCGC GCCGGACCTG 
GCGGGCCTGC GGCTGGACGA CGAGCTGCTG GCCTCGCCCG AGCGGTTCTT CAACCGGGAA
ACCTCGTGGC TGGCCTTCAA CCAGCGGGTG CTGGAGGAGA GCGGCAATCC GCGCCACCCG
CTGCTGGAGC GGCTGCGGTT CCTGTCGATC TCGGCCAACA ATCTCGACGA GTTCTACATG
GTCCGCGTGG CCGGGCTGAA GGGCCAGGTG CGCGAGGGCG TGCGGGTGGT CAGCCAGGAC
GGGCTGACGC CGGGCGAGCA GCTGGCGCGG ATCAACGCCT CGGCCGCCGA GTTGATGGCC
GAGCAGCAGA AGATCTGGCG CGAGGTGCGG GCCGAGCTGT GGGCCGAAGG CCTCAAGCTG
CTGGACGCCA AGGACATCGT CGGCGCCGAC CGCGAGCGGG CCGAGGAGCT GTTCCTGACC
CGGATGTTCC CGGTGCTGAC GCCCCTGGCC ATCGATCCGG CCCACCCCTT CCCGTTCATC
CCGAACCTGG GCTTCTGCCT GGCGCTGAAG CTGCGGCGGA TCGTCGACGA CAAGCACCTC
TACGCCCTGG TGCCGGTGCC CAGCCAGGTG CAGCGCTTCT GGGAGCTGTC GCAGGCGGCC
GGGACCAGGA CCCGCAAGCG CGAGCGGCGG ATCGTGGCGC TGGAAAGCTT CATCATCCTG
TTCCTGGGGC ACCTGTTCCC CGGCTACGAG GTCGAGGGCC GCGGCCTGTT CCGCCTGATC
CGCGACAGCG ACCTGGAGAT CGAGGAAGAG GCCGAAGACC TGGTGCGCGA GTTCGAGGCG
CGGCTGAAGA AGCGCCGCCT GGGCCGCGTG GTGCGGGTCA AGATCCAGAC CACCATGCCG
GCCGACCTGC GCGACTTCAT CATCGAGGGC CTGCGCGCCG AGCCCGAGGA CGTGATCATC
GTCGACGGCA AGCTGGGCCT GGCCCAGATG GCCGAGCTGA TCCCGCCCGA CCGTCCGGAC
CTGAAGTTCA AGCCCTACAA CGCCCGCTTC CCCGAGCGGG TGCGCGACCA CGGCGGCGAC
TGCTTCGCGG CGATCCGCGA GAAGGACATC CTGGTCCACC ACCCGTTCGA GAGCTTCGAC
GTGGTGGTGC AGTTCATCCG CCAGGCGGCG CGCGACCCGG CCGTGCTGGC CATCAAGCAG
ACGCTGTATC GCACCAGCAA GGACAGCCCG ATCGTCGCGG CCCTGATCGA GGCGGCCGAC
AACGGCAAGA ACGTCACCGC CCTCGTGGAG ATCAAGGCCA GGTTCGACGA AGAAAACAAT
TTGAAGTGGG CTCGAGATCT GGAGCGCGCC GGCGTCCACG TGGTGTTCGG CTTCGTCGAC
TGGAAGACCC ACGCCAAGCT GTCGGTGGTG GTGCGGCGCG AGGGCGAGGC CCTGCGCACC
TATTGCCACT TCGGCACCGG AAACTATCAC CCGCAGACGG CGAAGGTGTA CACCGACCTG
TCGCTGTTCA CCTGCGATCC GGCCCTGGGG CGCGACGGCG GCAAGCTGTT CAACTTCATC
ACCGGCTACG CTCAGCCGCA CGGGCTGGAA AAGCTGAGCT TCTCGCCCGA GACGCTGAAG
CCGGACCTGC TGAGGATGAT CGCCCACGAG GCCCGCAACG CCCGCGACGG CAAGCCGGCG
GCGATCTGGG CCAAGATGAA CGCCGTGGTC GACCCGCAGA TCATCGACGC CCTCTACAGC
GCCAGCCAGG ACGGCGTGCA GATCGACCTG GTCGTGCGCG GCATCTGCTG CCTGCGCCCG
GGCATCAAGG GGCTGTCGGA GAACATCCGG GTCAAGAGCA TCGTCGGACG GTTCCTGGAG
CACGCCCGCG TCGTGGCCTT CGCCAACGGC GCCCCGATGC CCAGCGCCCA GACCCGGCTG
TTCATCAGCT CGGCCGACTG GATGCCGCGC AACCTCGACC GCCGGGTCGA GAGCCTGGTT
CCGCTGGAGA ACCCCACCGT GCACCAGCAG GTGCTCAACC AGATCATGGT CGCCAACCTC
AACGACGAGG CCCAGAGCTG GAACCTGGAT GGAGAAGGAC GTTACGCCCG CGACCCGGCC
TGGGACCGCA AGGGCGCCTT CTCGGCCCAC GAATACTTCA TGACCAATCC CAGCCTGTCT
GGCCGGGGCC ACAAGGTGAA GGACCTGCCG CAGGCCTTCG ACCACGTGGG ACCGCGCAAG
CGGGGATGA
 
Protein sequence
MTDVVALDLP AAPTPAAPDL AGLRLDDELL ASPERFFNRE TSWLAFNQRV LEESGNPRHP 
LLERLRFLSI SANNLDEFYM VRVAGLKGQV REGVRVVSQD GLTPGEQLAR INASAAELMA
EQQKIWREVR AELWAEGLKL LDAKDIVGAD RERAEELFLT RMFPVLTPLA IDPAHPFPFI
PNLGFCLALK LRRIVDDKHL YALVPVPSQV QRFWELSQAA GTRTRKRERR IVALESFIIL
FLGHLFPGYE VEGRGLFRLI RDSDLEIEEE AEDLVREFEA RLKKRRLGRV VRVKIQTTMP
ADLRDFIIEG LRAEPEDVII VDGKLGLAQM AELIPPDRPD LKFKPYNARF PERVRDHGGD
CFAAIREKDI LVHHPFESFD VVVQFIRQAA RDPAVLAIKQ TLYRTSKDSP IVAALIEAAD
NGKNVTALVE IKARFDEENN LKWARDLERA GVHVVFGFVD WKTHAKLSVV VRREGEALRT
YCHFGTGNYH PQTAKVYTDL SLFTCDPALG RDGGKLFNFI TGYAQPHGLE KLSFSPETLK
PDLLRMIAHE ARNARDGKPA AIWAKMNAVV DPQIIDALYS ASQDGVQIDL VVRGICCLRP
GIKGLSENIR VKSIVGRFLE HARVVAFANG APMPSAQTRL FISSADWMPR NLDRRVESLV
PLENPTVHQQ VLNQIMVANL NDEAQSWNLD GEGRYARDPA WDRKGAFSAH EYFMTNPSLS
GRGHKVKDLP QAFDHVGPRK RG