Gene Csal_0258 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0258 
Symbol 
ID4026347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp294696 
End bp295955 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content60% 
IMG OID637965409 
Productextracellular solute-binding protein 
Protein accessionYP_572321 
Protein GI92112393 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTCGTC GATTGACTCT TCTTGCCGCC CTGGCCGGCG CCACCGTCGG TCAGGCCCAT 
GCCGCGGACT TGACCATTTC CTGTGGCGCG GTGGGAGCTG AGCTGACTCT CTGCAAGGAA
GGCGTCTCCG CCTGGGAAGA GCAGACCGGT CACAGCGTCG CTGTTGTCTC GACGCCCAAT
TCATCCACCG AACGTCTCTC GCTCTATCAG CAAATCCTGT CTGCACAATC GGGGGATATC
GATATTATGC AGATCGATGT GGTCTGGCCT GGCCTGCTCG CCAACCACCT GCTCGATTTG
CGTGAGATAG GCGGCAAGGA TATTGCCGAT GGCAATTTCC AGACGATCGT CGACAACAAT
ACCGTCGATG GGCGGTTGGT GGCGATGCCC TGGTTTACCG ATGCCGGGGT TCTCTATTAC
CGCAAGGATC TGCTCGATAA GTACGATCGA GAGGTGCCCG AGACCTGGCA GCAGATGACC
GAGACCGCCG AGGCCATCCA GCAGGCTGAA CGCCAGGCCG ACAATGAGGA GATGTGGGGC
TATGTCTTCC AAGGGCGTGC CTATGAGGGG CTGACCTGCA ATGCCCTGGA GTGGGTCGCC
AGCCATGGCG GGGGCAGTAT CGTCGACGAA CAGGGCGAGA TCACCATCGA CAATCCGCGT
GCCGCGGCTG CTCTGGATCA GGCTGCCGAG TGGGTCGGCC ATATCTCGCC CCAAGGCGTG
CTCAATTACA CCGAAGAACA GGCCCGCGGC GTCTTCCAGT CCGGCAACGC AGTCTTCATG
CGTAACTGGC CTTACGCTTG GTCGCTGGTG CAGAGCGAGA ATAGCGACGT GCGCGGCAAG
GTCGGTGTCA CCACGTTGCC GCATGGCCCC AAAGGAAGCA GTGCCGCGAC CCTGGGGGGC
TGGAACCTGG CAGTATCCAA ATACAGCAAG CATCCCGAGC TGGCTGTCGA TCTGGTCAAG
TTCTTGACGT CCGAAGCCGA GCAGAAACGT CGTGCCCTCG AGGGTTCCTA CAACCCGACC
ATCAAGTCGC TTTACCAGGA TAAGGAAGTC CTCGCCGCCG TGCCTTTCTT CGGCAAGCTC
TACGATACCT TCACCAATGC CGTGGCACGG CCTTCGGCAC CGACCGGCAA CAAATACGGC
CGCGTCAGCA ATGCCTTCTT CAACGCTGCC CACGACGTGC TTGCCGGCAA CATGAACGGT
GCCCGGGCGG TATCGCAACT GCAGGACAAG CTCGAGCGCA TGAAGCGCCG CCGCTGGTAA
 
Protein sequence
MIRRLTLLAA LAGATVGQAH AADLTISCGA VGAELTLCKE GVSAWEEQTG HSVAVVSTPN 
SSTERLSLYQ QILSAQSGDI DIMQIDVVWP GLLANHLLDL REIGGKDIAD GNFQTIVDNN
TVDGRLVAMP WFTDAGVLYY RKDLLDKYDR EVPETWQQMT ETAEAIQQAE RQADNEEMWG
YVFQGRAYEG LTCNALEWVA SHGGGSIVDE QGEITIDNPR AAAALDQAAE WVGHISPQGV
LNYTEEQARG VFQSGNAVFM RNWPYAWSLV QSENSDVRGK VGVTTLPHGP KGSSAATLGG
WNLAVSKYSK HPELAVDLVK FLTSEAEQKR RALEGSYNPT IKSLYQDKEV LAAVPFFGKL
YDTFTNAVAR PSAPTGNKYG RVSNAFFNAA HDVLAGNMNG ARAVSQLQDK LERMKRRRW