Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1784 |
Symbol | |
ID | 5899239 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 1886339 |
End bp | 1887379 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641562274 |
Product | phosphate ABC transporter, periplasmic phosphate-binding protein PstS, putative |
Protein accession | YP_001683411 |
Protein GI | 167645748 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0226] ABC-type phosphate transport system, periplasmic component |
TIGRFAM ID | [TIGR02136] phosphate binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.304982 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0252217 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGACCC TGCTCGGAGC CACCGCCGCG CTTGGCCTGC TGGCCGGCGC GTCCAGCGCC CACGCCGCCC GCGACTACGT CTGGGCCGCG GGCTCGTCGA CCGTGTTCCC CTTCTCCACC CGCACCGCCG AAAACTTCGC CAAGAAGACC GGCAAGAAGG CTCCGAAGAT CGAAAGCCTG GGCACCGGCG GCGGCGTCAA GATGTTCTGT GGCGGCATGG GCGAGAAGTT CCCCGACATC GCCAACGCTT CGCGGCCGAT GAAGAAGTCG GAATTCGACG CCTGCAGGGC CAAGGGCGTC AACAACATCG TCGAATTGAA AATCGGCTTC GACGGCATCG TCGTGGCCAT GGACAAGTCC TCGCCAGACT ACAACTTCAA GGTCGAGCAC CTGTATCTGG GCCTGGGCCA GACGGTGCTG CGCGGCGGCC AGTTCGTGGC CAATCCCTAC AAGACCTGGA AAGACGTCGG CGCCGGCCTG CCGGCCAATC GCATCCTGGT CTACGGCCCT CCCCCCACCT CGGGCACCCG CGACGCCTTC GTCGAACTGG CCATCGAGGG CGGGGCCAAG AAGTTCCCGA CCGCCAAGGC CCTGCACGAC ACGGACGAGA AGGCCTTCAA GGCCAAGGTG GATCCGCTGC GCACGGACGG CGCCTGGGTC GACGCCGGCG AGAACGACAA CGCCATCATC GGCACCATCG AGAAGACCCC GGGCGCGCTG GGCGTGTTCG GCTACAGCTT CCTCGAAGAG AACGCCAACA AGATCAAGGG CGCCAGCGTC AATGGCGTGA AGCCCACGGC CCAGGCGATC GCCAGCGGTC AGTACCCGCT GTCACGCTCG CTCTACATCT ACGTCAAGAA GGACCAGGTC GGCGTGACCC CGGGCCTGAA AGAGTTCATC GCCGAGTTCG TCTCCGACTC CGCCACCGGC CGGGGCGGCT ATCTGCAAGA CCGCGGCCTG ATCCCCCTGC CCCCGGCCCA GCACGAGGCC ATGAAGGCCG CCGCCGGCAA GCTGACGCCG ATGGCCGCGC CGAAGTCATA G
|
Protein sequence | MKTLLGATAA LGLLAGASSA HAARDYVWAA GSSTVFPFST RTAENFAKKT GKKAPKIESL GTGGGVKMFC GGMGEKFPDI ANASRPMKKS EFDACRAKGV NNIVELKIGF DGIVVAMDKS SPDYNFKVEH LYLGLGQTVL RGGQFVANPY KTWKDVGAGL PANRILVYGP PPTSGTRDAF VELAIEGGAK KFPTAKALHD TDEKAFKAKV DPLRTDGAWV DAGENDNAII GTIEKTPGAL GVFGYSFLEE NANKIKGASV NGVKPTAQAI ASGQYPLSRS LYIYVKKDQV GVTPGLKEFI AEFVSDSATG RGGYLQDRGL IPLPPAQHEA MKAAAGKLTP MAAPKS
|
| |