Gene Caul_0958 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0958 
Symbol 
ID5898413 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1007068 
End bp1008780 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content65% 
IMG OID641561440 
ProductSSS family solute/sodium (Na+) symporter 
Protein accessionYP_001682586 
Protein GI167644923 
COG category[R] General function prediction only 
COG ID[COG4147] Predicted symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.483778 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCCTT GGCGAACGCT GCCGTGGCTG ATCGGCCTGG CGAGCCTGAC CTTCGCCGGC 
GTTGCGGCAG CCACGGGCTT CGACGGCCAA GCCCAACGCC AGCCGCTCAA CTGGACGGCC
ATCGCCATCT TCGCGGGGTT CGTGGCGCTC ACCCTTTGGA TCACCCGCTG GGCCGCGAGC
CGGACAAAGA CCGTGTCCGA TTTCTATACC GCCGGCGGCA AGGTCACGGG CTTTCAGAAC
GGATTGGCCA TCGCGGGCGA CTTCATGTCG GCCGCGTCGT TCCTGGGCAT TTCGGCCCAG
ATCTTCACCG ACGGCTATGA CGGGCTGATC TATTCGATCG GGTTCCTGGT CGGCTGGCCC
ATCCTGATGT TCCTGATGGC CGAAAAGCTG CGCAATCTTG GCCGCTTCAC GTTCGGCGAT
GTGGCGTCCT ATCGCTTCGC CCAGGCCCCG GTGCGCAGCT TCGCCGCGTC CAGCACCCTG
ATCGTGGTGA TCTTCTACCT GATCGCTCAG ATGGTCGGGG CCGGGCAGCT GATCCAGCTG
CTGTTTGGCC TCCCCTATCG CTACGCCGTG GGCCTGGTGG GTCTGCTGAT GATCCTCTAT
GTGCTGTTTG GCGGCATGAC CGCAACCACC TGGGTTCAGA TCACCAAGGC GGTGCTGCTG
CTGGGCGGCG CCAGCTTCAT GGCCTTCATG GTCATGGCTT CCGTGCATTT CTCCCCCAAC
GCCCTCTTCG CCCAGGCGAT CGACGTGAAG ACCCTACTGG CCGCCAAGGC GGGCGCCGAC
CCGCGAGAGG CGGCGCGGTT GGGCGGCGCC ATCATGGGGC CGGGGGGCTT TCTGAAGGAC
CCCATCTCGG CCATTTCCTT CGGCATGGCG CTGATGTTCG GCACCGCGGG CCTGCCGCAT
ATCCTGATGC GCTTCTTCAC GGTCGCGGAC GCGCGCGAAG CCCGCAAGTC CATCCTGTGG
GCGACGACCT GGATCGGCTA TTTCTATGTC CTGACGTTCA TCATCGGCTT TGGCGCCATT
GTCCTCGTCG CCAGCAACCC GGCGTTCCTC ACGCCCGAGG GCGGCTTGAG AGGGGGCGGC
AACATGGCCG CGATCCACCT GGCTCAGGCG GTCGGCGGCA ACCTCTTCCT CGGCTTCATC
TCCGCGGTGG CCTTCGCCAC CATTCTGGCC GTGGTGGCGG GCCTGGCGCT GTCGGGCGCC
TCGGCCATCT CGCATGATCT CTACGCGACC GTCTTCAAGC ACGGCGCGGC CGCGTCTCGC
GACGAACTGC GCGTGTCGCG GATCACGACG GTGATCCTTG GCCTGATCGC GGTCGTGATG
GGCGTGATGT TCGAGAAGCA GAATGTGGCC TTCATGGTCT CGCTCGCCTT CGCCCTCGCG
GCGTCGGGCA ACTTTCCCGT TCTTCTGCTC GGCCTGTTGT GGAAAGGCTG CACAACGCGG
GGCGCGGTCA TCGGCGGGTT CATCGGCCTG ACCACCGCCC TGGTCCTGAT GATCCTGTCC
CCGTCGATCT GGGTAAAGGC CCTGGGCCAT GAGACGGCGA TCTTCCCGTT CACCTCACCG
GCGCTGTTCT CCATGACCGC CGGGTTCGTC GGCATATGGC TCTTCTCGGT CCTTGATCGC
AGTCCCAGGT CGGTCGTGGA TCGCGACGGG TTCGAGGAGC AGCTGGTGCG GTCCGAAACG
GGGATCGGCG TCAGCGCGGC CCTGGATCAC TAG
 
Protein sequence
MTPWRTLPWL IGLASLTFAG VAAATGFDGQ AQRQPLNWTA IAIFAGFVAL TLWITRWAAS 
RTKTVSDFYT AGGKVTGFQN GLAIAGDFMS AASFLGISAQ IFTDGYDGLI YSIGFLVGWP
ILMFLMAEKL RNLGRFTFGD VASYRFAQAP VRSFAASSTL IVVIFYLIAQ MVGAGQLIQL
LFGLPYRYAV GLVGLLMILY VLFGGMTATT WVQITKAVLL LGGASFMAFM VMASVHFSPN
ALFAQAIDVK TLLAAKAGAD PREAARLGGA IMGPGGFLKD PISAISFGMA LMFGTAGLPH
ILMRFFTVAD AREARKSILW ATTWIGYFYV LTFIIGFGAI VLVASNPAFL TPEGGLRGGG
NMAAIHLAQA VGGNLFLGFI SAVAFATILA VVAGLALSGA SAISHDLYAT VFKHGAAASR
DELRVSRITT VILGLIAVVM GVMFEKQNVA FMVSLAFALA ASGNFPVLLL GLLWKGCTTR
GAVIGGFIGL TTALVLMILS PSIWVKALGH ETAIFPFTSP ALFSMTAGFV GIWLFSVLDR
SPRSVVDRDG FEEQLVRSET GIGVSAALDH