Gene Caul_2210 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2210 
Symbol 
ID5899665 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2406409 
End bp2407536 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content69% 
IMG OID641562702 
ProductPpx/GppA phosphatase 
Protein accessionYP_001683836 
Protein GI167646173 
COG category[F] Nucleotide transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0248] Exopolyphosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0781544 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.173268 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGAGG CGCCGAGAGC GCCCGGCCCG CCTCAGCCCT CGGGCTCTGG CCGGGGCCGG 
CCGCCAGGCG AGCAGTCGTG CTATGCGGCG CTCGACCTGG GTACGAACAA CTGCCGCCTG
CTGGTGGCGA CGCCTTCGCC GCGCGGTTTC CGTGTCGTCG AGGCCTATTC CCGGATCGTG
CGGCTGGGCG AGGGCCTGTC CCAGACCGGC CAGCTTTCCG ACGAGGCCAT GGATCGCTCG
ATGGCGGCCC TGAAAGTTTG CGCCGAGAAG ATCCGTCGCC GCAAGGTCGT CCGCGTGAAG
GCCGTCGCCA CCCAGGCCTG CCGTGGCGCG ACCAATGGTC CCGATTTCGT GCGTCGCGTG
GCCGAGGAGA CCGGCCTTCG CTTGCAGATC ATTACGCCCA AGGAAGAGGC CCAGCTGTCG
GTGGCCGGCT GCATGAGCCT GTTCGACCGC GAGGCCGAGG CGGCGTTGGT GGTCGATGTC
GGCGGCGGAT CGACGGAATT GTCCTGGGTG GACCTCCAGG CTGGCGGCCT CGACGCCAAA
CCCCGGCAGT TCGCGGCCTG GCGGTTGCCG ATCAAGGCCT GGCTGTCGAT CCCGATCGGC
GTCGTTACTC TGGCCGAGCG CTTCCCCGAG GGCGAACGGG TCGAGGAGGG TTGGTTCCGA
GCCATGGTCG ACGCGGTCAA GGCGCGGATC GAGGAATTCC CGCACGCCGA GCCGATGCGT
CGGCTGTTCG AGAACGGCCG GGCTCACATG GTCGGCACCT CGGGCGCGAT CACCAGCCTG
GCTGGCCTGC ACCTGGGGCT GCCGCGCTAT GACCGCAACG TCGTCGATGG CCTGTGGATG
CAGCGGCACG AGTGCGAGGC GGCGGCCGAC CGGCTGCTGA CCCTGACCGC CAAGGAGAGA
GCTCTGGAGC CCTGCATCGG GGCTGATCGC GCCGATCTGG TGTTGGCCGG GGCCGCGATC
CTGCAGGCTG TTCAGGAGCT TTGGCCGTGT TCGCGCGTGC GTGTCGCTGA TCGGGGGCTT
AGAGAAGGTC TCTTGATGTC CCTGATGTCC GATCAGCAGA AGAAGCCGCG CCGACGTCGG
CGAGGCGGCT CCGGTCGGGC GTCCCGTCCC GCAACGGTGA ACGCATGA
 
Protein sequence
MSEAPRAPGP PQPSGSGRGR PPGEQSCYAA LDLGTNNCRL LVATPSPRGF RVVEAYSRIV 
RLGEGLSQTG QLSDEAMDRS MAALKVCAEK IRRRKVVRVK AVATQACRGA TNGPDFVRRV
AEETGLRLQI ITPKEEAQLS VAGCMSLFDR EAEAALVVDV GGGSTELSWV DLQAGGLDAK
PRQFAAWRLP IKAWLSIPIG VVTLAERFPE GERVEEGWFR AMVDAVKARI EEFPHAEPMR
RLFENGRAHM VGTSGAITSL AGLHLGLPRY DRNVVDGLWM QRHECEAAAD RLLTLTAKER
ALEPCIGADR ADLVLAGAAI LQAVQELWPC SRVRVADRGL REGLLMSLMS DQQKKPRRRR
RGGSGRASRP ATVNA