Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4554 |
Symbol | |
ID | 5902015 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4933434 |
End bp | 4934486 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641565073 |
Product | sulfate ABC transporter, periplasmic sulfate-binding protein |
Protein accession | YP_001686172 |
Protein GI | 167648509 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1613] ABC-type sulfate transport system, periplasmic component |
TIGRFAM ID | [TIGR00971] sulfate/thiosulfate-binding protein [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCACG ATCTCAAGAC CCCCACGCGA CGCGGCCTGC TCGGCTCGGC CACGGCCGGG GCCGCGGCCC TGGCGGTTCC CGCCGCCCTG GCCGGCGCGG CCCACGCTCA AGGCGTCAAG CCGCTGACCC TGCTCAATGT CAGCTACGAC CCGACCCGCG AGCTCTACAA GGACGTCAAC GCCGCCTACG CCAGGTACTG GAAGGACAAG GTCGGCCAGG TGCTGACCAT CAACCAGTCG CACGGCGGCT CGGGCAAGCA GGCCCGCTCG GTGATCGACG GCCTGCAGGC CGACGTCGTC ACCCTGGCGC TGGCCTATGA CATCGACGAG ATCGCCGCGA GAGCCAAGCT GCTCCCGGCC AACTGGCAAT CGCGCCTGCC CAACAATTCC ACGCCCTACA CCTCGACGAT CGTGTTCCTG GTCCGCAAGG GCAACCCCTG GAAGATCAAG GACTGGGGCG ACCTGATCAA GCCGGGCATC GACGTGATCA CCCCCAACCC GAAGACCTCG GGCGGGGCGC GCTGGAACTA CCTGGCCGCC TGGGCCTGGG CCTTGAAGCA GCCGGGCGGC AATCCGGCCA AGGCCGAGGC CTTCGTCGGC GAGATCTTCA AGCACGTTCC TGTGCTCGAC ACCGGCGCGC GCGGCGCGAC CACCAGCTTC ACCCAGCGCG GCCTGGGCGA CGTGCTGCTG TCGTGGGAGA ACGAGGCCTA CCTGGCGCAG GAGGAACTGC CGGGCAAATT CGACATCGTC TATCCGTCGC TGTCGATCCT GGCCGAGCCG CCGGTCGCCC TGGTCGACAA GAACGTCGAC CGGCACAAGA CCCGCAAGGC GGCCGAGGGC TATCTGAACT TCCTCTACAG CCCCATCGCC CAGGACCTGA TCGGCAAGAA CTACTATCGC CCCCGCAACC CGGCGGCGGC GGCCAAGTAC GCCGCGCGGT TCAAGTCGAT CCCGCTGGTC ACCATCGACG ACACCTTCGG CGGCTGGAAG AAGGCCCAGG CCACCCACTT CGCGGACGGC GGCGTCTTCG ACCGGATCTA TCGTCCGAAA TAG
|
Protein sequence | MTHDLKTPTR RGLLGSATAG AAALAVPAAL AGAAHAQGVK PLTLLNVSYD PTRELYKDVN AAYARYWKDK VGQVLTINQS HGGSGKQARS VIDGLQADVV TLALAYDIDE IAARAKLLPA NWQSRLPNNS TPYTSTIVFL VRKGNPWKIK DWGDLIKPGI DVITPNPKTS GGARWNYLAA WAWALKQPGG NPAKAEAFVG EIFKHVPVLD TGARGATTSF TQRGLGDVLL SWENEAYLAQ EELPGKFDIV YPSLSILAEP PVALVDKNVD RHKTRKAAEG YLNFLYSPIA QDLIGKNYYR PRNPAAAAKY AARFKSIPLV TIDDTFGGWK KAQATHFADG GVFDRIYRPK
|
| |