Gene Csal_0633 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0633 
Symbol 
ID4025980 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp710603 
End bp711913 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content63% 
IMG OID637965804 
Productextracellular solute-binding protein 
Protein accessionYP_572694 
Protein GI92112766 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.131479 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCCTTA CCTACGCTCT ACCCCTGGCC GGAGCCGCGA CACTGGCTTC GTTCGCCGCT 
CACGCCGAGA CCATCACCGT GGCCACGGTG AACAACAACG ACATGATCAT CATGCAGGGC
CTGACCGACG AATTCGAAAA GGCTCACCCG GACATCGACC TGGAATGGGT GGTGCTCGAG
GAAAACGTGT TGCGTCAGCG TCTGACCACC GACATCGCCA CCGACGGCGG CCAGTTCGAT
GTCATGACCA TCGGGACGTA CGAGGTGCCG ATCTGGGCCA AGCAGGATTG GCTGGTCGAG
CTCGACGACC TGCCCGAGAG CTACAACGAG CAGGATCTGC TCAAGCCGAT CCGCGACGGC
CTGAGCCAGG ACGGTTCGCT CTATGCGCTG CCGTTCTACG GTGAAAGCTC GATGATGTAC
TACCGCACCG ACCTGTTCGA GCAGGCCGGC ATCGAGATGC CCGAACAGCC GACCTGGGAG
CAGGTCGAGG ACTGGGCGAG CCAGATCAAC GATCCCGACA ACGGCGTGTA TGGCATCTGC
CTGCGTGGCA AGCCGGGCTG GGGCGAGAAC ATGGCGTTCG TCAGCACCCT GGTCAATACC
TTCGGCGGTC GCTGGTTCGA CGAGGAATGG CATCCGGAAA TCAACTCGCC GGAGTGGAAG
GAAGCGGTCG GTTTCTATGT CGACCTGATG AACAACTATG GCCCGCCGGG TGCGACCTCC
AACGGCTTCA ACGAGAATCA GGCGCTGTTC TCCAGCGGCA AGTGCGGCAT GTGGGTCGAT
GCCACGTCCG CTGCCGGACG TCTCTACAAT CCCGACGAGT CGCAGGTCGC CGACAAGCTC
GGCTTCGCCC CGGCGCCGAT CGCCGAGACC CCGAAGGGCG CCAACTGGCT GTGGTCGTGG
ACGCTGGCGA TTCCCGCCTC GTCGGATGCC AAGGACGCCG CCAGGACCTT CATTACCTGG
GCGACCTCGC AGGACTACAT CGAGCTGGTA GGGGAAACCG AAGGCTGGAC CAGTGTGCCG
CCGGGCACCC GTGAGTCCAC CTACGAGAAT CCCAAGTACC AGGAAGCTGC GCCGTTCGCC
GACTTCGTGC TCAACGCCAT CCAGACCGCC GATCCCACCG ATTCGACGCT CAAGCCGAGT
CCCTACATCG GCGTGCAGAC CGTCAACATC CCCGAGTTCC AGGCGGTAGG CACCCAGGTG
GGACAGATGA TCGGGGCTGC ACTTGCCGGT CAACAGTCCG TCGATGCCGC GCTCGACCAG
GCCCAGCGTT CGGTCGATCG CACCATGCGC CAGGCGGGAT ACTACGACTA A
 
Protein sequence
MRLTYALPLA GAATLASFAA HAETITVATV NNNDMIIMQG LTDEFEKAHP DIDLEWVVLE 
ENVLRQRLTT DIATDGGQFD VMTIGTYEVP IWAKQDWLVE LDDLPESYNE QDLLKPIRDG
LSQDGSLYAL PFYGESSMMY YRTDLFEQAG IEMPEQPTWE QVEDWASQIN DPDNGVYGIC
LRGKPGWGEN MAFVSTLVNT FGGRWFDEEW HPEINSPEWK EAVGFYVDLM NNYGPPGATS
NGFNENQALF SSGKCGMWVD ATSAAGRLYN PDESQVADKL GFAPAPIAET PKGANWLWSW
TLAIPASSDA KDAARTFITW ATSQDYIELV GETEGWTSVP PGTRESTYEN PKYQEAAPFA
DFVLNAIQTA DPTDSTLKPS PYIGVQTVNI PEFQAVGTQV GQMIGAALAG QQSVDAALDQ
AQRSVDRTMR QAGYYD