Gene Caul_2549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2549 
Symbol 
ID5900004 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2769734 
End bp2771221 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content73% 
IMG OID641563040 
ProductPpx/GppA phosphatase 
Protein accessionYP_001684174 
Protein GI167646511 
COG category[F] Nucleotide transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0248] Exopolyphosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTTTG CGGCGCCGCG CGACGCGGCC GTTATCGATA TCGGGTCCAA CTCGGTGCGC 
CTCGTGGTGT ACCGGCTGGA AGGCCGCGCC ATCTGGACCG TCTTCAACGA GAAGGTCCTG
GCGGGGCTTG GCCGTGACCT CGGCGAGGGC GGCAAGCTGA ACCCCGAAGG CGTCGCCCAG
ACTCTGGCGG CGTTGAAGCG CTTCCGCGCC GTGCTCGAGG CCGTCAAGCC GGCCGAGACC
TTCATCGCCG CCACCGCCGC CGTGCGCGAC GCCGCCGACG GGGCCGCCTT CATCCGCCAG
ATCCGGGCCG AGACCGGCTT CATCGCCCGC GTGCTGTCGG GCGAGGAAGA GGCCCGCTAC
GCCGCCGTCG GCGTGCTGGC CGGGACCCCG GACGCCACGG GCGTGGTCGG CGACCTGGGC
GGCGCCAGCC TGGAGCTGAT CCGCCTGGAA TCCACCGGCG CCGGCCTGGG CGTCACCCTG
CCGCTGGGGC CGTTCAGCCT GGGCCTGTCC GACGGCTTCG ACCTGGAGCG GGTGCGCCGG
CTGTGCGCCC AGCGCCTGGC CCCGGCGGCC GAGGCCTTCA AGACCGACGT CTTCCACGCC
GTGGGCGGGG CCTGGCGCAA CCTGGCCCTG CTGCACATGC GATTGAGCGA CTATCCGCTG
CACGTCGTCC ACCAGTACAG CATCAGCCGG TCCGAGGCGC TCGAAGCCGC GCGCCTGGTG
GCCCACCAGT CGCGCAGCTC GCTCGACCGC ATCGAGGGGA TGAGCAAGAA GCGCTCGGAG
ACCCTGCCCT ACGCCGCCGT CGTGCTGGAG CAACTGATCG AGAGCCTGGA CCTCAAGCGT
ATCGAGATCT CGGCCTATGG CGTGCGCGAG GGCCTGCTGT TCGAGGCCAT GGCCCCGGGC
GTGCGCCGGC TGGACCCGCT GATCGAAGGC TGCGCCTCGC TGGGCGCGAG GCAAGGCGTC
GCCGACGAAC TGGGCGCGGC GCTGGACGCC TGGCTGGCCC CGGCCCTGGC GCAGCTGTCG
CCGTGTTTCG GCGACCGCGA TCCGGTCCTG GCCTCGGCCG CCTGCCGCCT GGCCGACCTG
GGCGCGCGCC TGCACCCCGA CCATCGCGCC GACCTGGTGT TCGAGCAGGT GCTGCGGGCC
CCGATCGCCG GCCAGACCCA CCCCGAACGC GCCTTCCTGG CGGCCGCCGC CTTCGCCCGT
CACGCCACCG CCTTCACGCC GCCGGAGATG GGCGTGCTGG AGCGCCTGCT ATCCCCGGCC
CGCCTGAAGC GCGCCCGCGC CCTGGGCGCC GCCATCCGCC TGGGCTGCGA CCTGTCGGGC
CGCAGCGCGC CCCTGCTGGC CCGTTCGCGA CTCATCATCG ACAAGGGCGA CCTGCTGCTG
ACCGCCGAGC CCGGCTACGC CGACCTGCTG CTGGGGGAGC AAACCGCCAA GCGCGCAGGA
ACCCTGGCGA ACCTGCTAGG ACTGAAGCTG AAGATCCTGG CGCTCTAG
 
Protein sequence
MPFAAPRDAA VIDIGSNSVR LVVYRLEGRA IWTVFNEKVL AGLGRDLGEG GKLNPEGVAQ 
TLAALKRFRA VLEAVKPAET FIAATAAVRD AADGAAFIRQ IRAETGFIAR VLSGEEEARY
AAVGVLAGTP DATGVVGDLG GASLELIRLE STGAGLGVTL PLGPFSLGLS DGFDLERVRR
LCAQRLAPAA EAFKTDVFHA VGGAWRNLAL LHMRLSDYPL HVVHQYSISR SEALEAARLV
AHQSRSSLDR IEGMSKKRSE TLPYAAVVLE QLIESLDLKR IEISAYGVRE GLLFEAMAPG
VRRLDPLIEG CASLGARQGV ADELGAALDA WLAPALAQLS PCFGDRDPVL ASAACRLADL
GARLHPDHRA DLVFEQVLRA PIAGQTHPER AFLAAAAFAR HATAFTPPEM GVLERLLSPA
RLKRARALGA AIRLGCDLSG RSAPLLARSR LIIDKGDLLL TAEPGYADLL LGEQTAKRAG
TLANLLGLKL KILAL