Gene Caul_4554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4554 
Symbol 
ID5902015 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4933434 
End bp4934486 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content68% 
IMG OID641565073 
Productsulfate ABC transporter, periplasmic sulfate-binding protein 
Protein accessionYP_001686172 
Protein GI167648509 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1613] ABC-type sulfate transport system, periplasmic component 
TIGRFAM ID[TIGR00971] sulfate/thiosulfate-binding protein
[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCACG ATCTCAAGAC CCCCACGCGA CGCGGCCTGC TCGGCTCGGC CACGGCCGGG 
GCCGCGGCCC TGGCGGTTCC CGCCGCCCTG GCCGGCGCGG CCCACGCTCA AGGCGTCAAG
CCGCTGACCC TGCTCAATGT CAGCTACGAC CCGACCCGCG AGCTCTACAA GGACGTCAAC
GCCGCCTACG CCAGGTACTG GAAGGACAAG GTCGGCCAGG TGCTGACCAT CAACCAGTCG
CACGGCGGCT CGGGCAAGCA GGCCCGCTCG GTGATCGACG GCCTGCAGGC CGACGTCGTC
ACCCTGGCGC TGGCCTATGA CATCGACGAG ATCGCCGCGA GAGCCAAGCT GCTCCCGGCC
AACTGGCAAT CGCGCCTGCC CAACAATTCC ACGCCCTACA CCTCGACGAT CGTGTTCCTG
GTCCGCAAGG GCAACCCCTG GAAGATCAAG GACTGGGGCG ACCTGATCAA GCCGGGCATC
GACGTGATCA CCCCCAACCC GAAGACCTCG GGCGGGGCGC GCTGGAACTA CCTGGCCGCC
TGGGCCTGGG CCTTGAAGCA GCCGGGCGGC AATCCGGCCA AGGCCGAGGC CTTCGTCGGC
GAGATCTTCA AGCACGTTCC TGTGCTCGAC ACCGGCGCGC GCGGCGCGAC CACCAGCTTC
ACCCAGCGCG GCCTGGGCGA CGTGCTGCTG TCGTGGGAGA ACGAGGCCTA CCTGGCGCAG
GAGGAACTGC CGGGCAAATT CGACATCGTC TATCCGTCGC TGTCGATCCT GGCCGAGCCG
CCGGTCGCCC TGGTCGACAA GAACGTCGAC CGGCACAAGA CCCGCAAGGC GGCCGAGGGC
TATCTGAACT TCCTCTACAG CCCCATCGCC CAGGACCTGA TCGGCAAGAA CTACTATCGC
CCCCGCAACC CGGCGGCGGC GGCCAAGTAC GCCGCGCGGT TCAAGTCGAT CCCGCTGGTC
ACCATCGACG ACACCTTCGG CGGCTGGAAG AAGGCCCAGG CCACCCACTT CGCGGACGGC
GGCGTCTTCG ACCGGATCTA TCGTCCGAAA TAG
 
Protein sequence
MTHDLKTPTR RGLLGSATAG AAALAVPAAL AGAAHAQGVK PLTLLNVSYD PTRELYKDVN 
AAYARYWKDK VGQVLTINQS HGGSGKQARS VIDGLQADVV TLALAYDIDE IAARAKLLPA
NWQSRLPNNS TPYTSTIVFL VRKGNPWKIK DWGDLIKPGI DVITPNPKTS GGARWNYLAA
WAWALKQPGG NPAKAEAFVG EIFKHVPVLD TGARGATTSF TQRGLGDVLL SWENEAYLAQ
EELPGKFDIV YPSLSILAEP PVALVDKNVD RHKTRKAAEG YLNFLYSPIA QDLIGKNYYR
PRNPAAAAKY AARFKSIPLV TIDDTFGGWK KAQATHFADG GVFDRIYRPK