Gene Caul_1784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1784 
Symbol 
ID5899239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1886339 
End bp1887379 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content66% 
IMG OID641562274 
Productphosphate ABC transporter, periplasmic phosphate-binding protein PstS, putative 
Protein accessionYP_001683411 
Protein GI167645748 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0226] ABC-type phosphate transport system, periplasmic component 
TIGRFAM ID[TIGR02136] phosphate binding protein 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.304982 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0252217 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACCC TGCTCGGAGC CACCGCCGCG CTTGGCCTGC TGGCCGGCGC GTCCAGCGCC 
CACGCCGCCC GCGACTACGT CTGGGCCGCG GGCTCGTCGA CCGTGTTCCC CTTCTCCACC
CGCACCGCCG AAAACTTCGC CAAGAAGACC GGCAAGAAGG CTCCGAAGAT CGAAAGCCTG
GGCACCGGCG GCGGCGTCAA GATGTTCTGT GGCGGCATGG GCGAGAAGTT CCCCGACATC
GCCAACGCTT CGCGGCCGAT GAAGAAGTCG GAATTCGACG CCTGCAGGGC CAAGGGCGTC
AACAACATCG TCGAATTGAA AATCGGCTTC GACGGCATCG TCGTGGCCAT GGACAAGTCC
TCGCCAGACT ACAACTTCAA GGTCGAGCAC CTGTATCTGG GCCTGGGCCA GACGGTGCTG
CGCGGCGGCC AGTTCGTGGC CAATCCCTAC AAGACCTGGA AAGACGTCGG CGCCGGCCTG
CCGGCCAATC GCATCCTGGT CTACGGCCCT CCCCCCACCT CGGGCACCCG CGACGCCTTC
GTCGAACTGG CCATCGAGGG CGGGGCCAAG AAGTTCCCGA CCGCCAAGGC CCTGCACGAC
ACGGACGAGA AGGCCTTCAA GGCCAAGGTG GATCCGCTGC GCACGGACGG CGCCTGGGTC
GACGCCGGCG AGAACGACAA CGCCATCATC GGCACCATCG AGAAGACCCC GGGCGCGCTG
GGCGTGTTCG GCTACAGCTT CCTCGAAGAG AACGCCAACA AGATCAAGGG CGCCAGCGTC
AATGGCGTGA AGCCCACGGC CCAGGCGATC GCCAGCGGTC AGTACCCGCT GTCACGCTCG
CTCTACATCT ACGTCAAGAA GGACCAGGTC GGCGTGACCC CGGGCCTGAA AGAGTTCATC
GCCGAGTTCG TCTCCGACTC CGCCACCGGC CGGGGCGGCT ATCTGCAAGA CCGCGGCCTG
ATCCCCCTGC CCCCGGCCCA GCACGAGGCC ATGAAGGCCG CCGCCGGCAA GCTGACGCCG
ATGGCCGCGC CGAAGTCATA G
 
Protein sequence
MKTLLGATAA LGLLAGASSA HAARDYVWAA GSSTVFPFST RTAENFAKKT GKKAPKIESL 
GTGGGVKMFC GGMGEKFPDI ANASRPMKKS EFDACRAKGV NNIVELKIGF DGIVVAMDKS
SPDYNFKVEH LYLGLGQTVL RGGQFVANPY KTWKDVGAGL PANRILVYGP PPTSGTRDAF
VELAIEGGAK KFPTAKALHD TDEKAFKAKV DPLRTDGAWV DAGENDNAII GTIEKTPGAL
GVFGYSFLEE NANKIKGASV NGVKPTAQAI ASGQYPLSRS LYIYVKKDQV GVTPGLKEFI
AEFVSDSATG RGGYLQDRGL IPLPPAQHEA MKAAAGKLTP MAAPKS