Gene Csal_2089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_2089 
Symbol 
ID4026551 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp2358500 
End bp2360281 
Gene Length1782 bp 
Protein Length593 aa 
Translation table11 
GC content61% 
IMG OID637967288 
Productextracellular solute-binding protein 
Protein accessionYP_574139 
Protein GI92114211 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.115344 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCAGGA CGGGACGACA ACAAAACCAC GAGGTCAATA TGAAACATAC AACAATGCGA 
CTCTCTCTAC TCGCGGCTGG CGTGATGCTG GCAAGCGGGG GACTGCACGC CGATGATCAA
GATACTCGCG CCATCGCCGA GCGCCTGGTC GATGAGCACT TCCAGGAGTC GACGCTGACA
CGCGAAGAGC AGATCGAGGA ATTGATGTGG TTCGCCAAGG CGGCCGAGCC TTTTAGAGGC
ATGGATATCA ACACGGTGGC GGAAGGCTTG ACCACGCACA TCTACGAGCG GGATGTGCTG
GCCGACGCGT TCACCGAGCT GACGGGTATC GAGGTGACGC ACAACATCAT CGGCGAGGGG
GATGTCGTCA ACAACATGCA GACCCAGATG CAGTCGGGTC GCAATATCTA TGACGGCTAC
GTCAACGATA CCGACTCCAT CGGTACGCAC ATTCGCTACG GCACCACCAT CAATCTTTCC
GACGCCATGG AAAACGAGTG GGCCGACTAT ACGTTGCCGA CCCTGGATCT CGATGACTTC
ATCGGCCTGC AGTATGGCAC CGGTCCGGAT GGCAGCGTCT ATCAATTGCC GACCCAGCAG
TTCGCCAATC TCTACTGGTT CCGCTATGAC TGGTTCCAGC GCGAGGATCT GCAGAAGCAG
TTCCGTGAGC TCTATGGCTA CGACCTGGGC GTGCCGACCA ACTGGACTGC CTACGAAGAC
ATCGCCGAGT TCTTCACCGA GCATGTCGGC GAGATCGATG GCGAGAAGGT CTATGGCCAC
ATGGACTATG GTCGTCGCGA TCCCTCGCTG GGCTGGCGTT TCCACGACTC CTGGCTCTCC
ATGGCGGGCA TGGGCAGCCC CGGCGTACCG TTCGGCAATC CGGTGGACGA CTGGGGGATT
CGCGTCAACG AGCAGAGCCA GCCGGTCGGG GCCAGCGTGT CGCGTGGCGG GGCCACCAAT
TCCCCGGCCT CGGTGTTCGC CGTCACCAAG ATGGTCGATT GGCTCGACAA GTATGCCCCA
CCCGAAGCCA GCGGCATGAC CTTTGGCGAA GCCGGCCCCG TGCCGGCCCA GGGCAATGTC
GCGCAGCAGA TCTTCTGGTA CACCGCTTTC ACGGCCGACA TGACCGACCC TGAACTGCCG
GTCACCGATG AAGAGGGCAA CCCCAAGTGG CGCATGGCGC CGTCCCCGAC AGGGCCTTAC
TGGGAAGAGG GTATGAAGGT CGGTTATCAA GACGTGGGGG CCTGGACCTT CTTCGACTCG
ACGCCTGAGG ATCGTCGCAC GGCTGCCTGG CTGTTCGCCC AGTTCACCGT CTCCAAGACA
GTGTCGCTGG AAAAGCTGAT GGCGGGGCTG ACGCCGATCC GCGAATCGGA CATCTTCTCC
GAACAGATGA CCGAGATGGC TCCCAAGCTG GGCGGTCTGG TGGAATTCTA TCGTAGCCCG
AACGAATCCA ACTGGACGCC GTCCTCCACC AACGTGCCGG ACTACCCGCG CATGGCGCCG
CTGTGGTGGC AGAACCTGTC GCCGGCGGTC AGTGGCGATA TCTCGCCCAA GGAGGCGCTC
GACAACCTGG CCAGGGATCT CGACAACATC ATGGCGCGCC TGGCGCGAGC CAAGGTCTTC
GATACCTATG CGCCCAACCT GAACGAGGAG CGTGATCCGC AGTACTGGCT CGATCAGCCG
GGTTCGCCGA AACCGAAGCT CGACGACGAG ATGCCGCAGG GCAAGACGGT TCCCTATGAC
GAAATGATGG AGGCGTGGAT GGCCGCCGGT TCTCGCGAAT GA
 
Protein sequence
MSRTGRQQNH EVNMKHTTMR LSLLAAGVML ASGGLHADDQ DTRAIAERLV DEHFQESTLT 
REEQIEELMW FAKAAEPFRG MDINTVAEGL TTHIYERDVL ADAFTELTGI EVTHNIIGEG
DVVNNMQTQM QSGRNIYDGY VNDTDSIGTH IRYGTTINLS DAMENEWADY TLPTLDLDDF
IGLQYGTGPD GSVYQLPTQQ FANLYWFRYD WFQREDLQKQ FRELYGYDLG VPTNWTAYED
IAEFFTEHVG EIDGEKVYGH MDYGRRDPSL GWRFHDSWLS MAGMGSPGVP FGNPVDDWGI
RVNEQSQPVG ASVSRGGATN SPASVFAVTK MVDWLDKYAP PEASGMTFGE AGPVPAQGNV
AQQIFWYTAF TADMTDPELP VTDEEGNPKW RMAPSPTGPY WEEGMKVGYQ DVGAWTFFDS
TPEDRRTAAW LFAQFTVSKT VSLEKLMAGL TPIRESDIFS EQMTEMAPKL GGLVEFYRSP
NESNWTPSST NVPDYPRMAP LWWQNLSPAV SGDISPKEAL DNLARDLDNI MARLARAKVF
DTYAPNLNEE RDPQYWLDQP GSPKPKLDDE MPQGKTVPYD EMMEAWMAAG SRE