Gene Csal_1873 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1873 
Symbol 
ID4028236 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp2130202 
End bp2131782 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content62% 
IMG OID637967067 
Productextracellular solute-binding protein 
Protein accessionYP_573924 
Protein GI92113996 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.021803 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCGATA AGAAGACTCT CCTGGCGTCT TTGCTGGGCG CCTCCGTCGT GCTTCCCCTG 
CCCGCGCTGG CGCAAGAGGA TGCCGGCGAC CCCGTGACGC TTCGCATGGC CTACGATGCC
GACCCGGTAT CGCTGGATAT CCACGAACAA CTCTCCGGCG GCATTCTGCA GCTGTCTCAC
CTGACGCACG ACCCGCTCGT CCGCTGGACG AAAGACGTCA AGTTCGAGCC GCGCCTGGCC
ACCGATTGGG AACGCATCGA TGACACCACC ATGCGGTTCA CGCTGCGTGA CGGCGTCACG
TTCCACACCG GCAACGACTT CACCGCCAAG GATGTCGTGT GGACGATCGA GCGTCTCAAG
CGCAGTGCCG ATTTCAAGGC CATCTTCGAC CCCATCGCCT CGGCGAAAGC CATCGACGAG
CATACCGTCG AGATCAAGAC CCACAAGCCC TACCCGCTCG TTCTCAATCT CGCGACCTAT
ATTTTCCCCA TGGATAGCGA GTACTACACC GGCGAGACGG AAGATGGCGA TCCCAAGGAC
GAAATCGTCA AGAACGGCGA CTCCTTCGCC TCACGGCACT CGTCGGGCAC GGGCCCTTAC
GAGGTTGTGT CCCGCCAGCA AGGCGTCAAG GTCGAGTTCG AGCGCTTCGA CGACTACTGG
GATCAAGACT CTCCGGGCAA CGTCGACCGC ATCGTCCTGA CCCCGATCGG CGAGAATGCC
ACGCGTGTGT CGGCCCTGCT GTCCGGCGAT GTCGATTTCA TCGCGCCCGT GCCGCCCAAT
GACCTGGAGC GCGTCGAGGC CGACCAGAAC GTCGAACTGA CCACGATGTC GGGTACCCGC
ATCATCCTCA TGCAGCTCAA CCAGAAGCGT GTGGAAGCCT TCCAGGACCC GCGCGTCCGC
CAAGCCTTCA ACTATGCGGT CAACCAGGAG GCCATCGCCG ACCGCCTGAT GAAAGGCTTC
GCCACGCCCG CCGCGCAGCT GTCGCCCAAG GGCTACGACG GGTACAACGA CAGCCTGACA
CCGCGCTACG ACGTCGAGAA AGCCAAGGAA CTGATGAAAG AAGCCGGCTA CGAGGACGGT
TTCTCGGTTT CCATGATGGC GCCCAACAAC CGCTATGTGA ACGATGCCAA GATCGCACAG
GCGGTCGCCA CCATGCTGTC GCGCATCAAT GTCGACGTGG ACCTCAAGAC GCTGCCCAAG
GCCCAGTACT GGGGAGAGTT CGACGATCGC GCCGCGGATA TCATGATGAT CGGCTGGCAC
GCCGACACCG AGGATTCCGC CAACCTGTTC CAGTACCTCA CCGAGTGCCC GGACCCCGAG
ACCGGAGCCG GCCAGTACAA CGCGGCCAAC TACTGCAATC CGGAGCTCGA CGAGAAAGTG
GCGCAGGCCA ATGTCGAGAC GGACCGCGCC AAGCGCGCCG AGATGCTGCA GGCGGTCGAG
AAGGCGCTGT ACGAGGATGC GGCCTTCATG CCGTTGCATT GGCAGGATCT TGCCTGGGCG
TCGAAGAACA ACGTCAAGCT CGAGCCGGTG GTGAACGTCA TGAACTTCCC TTACCTCGGG
GATCTCGTGG TCGAGCAATA A
 
Protein sequence
MIDKKTLLAS LLGASVVLPL PALAQEDAGD PVTLRMAYDA DPVSLDIHEQ LSGGILQLSH 
LTHDPLVRWT KDVKFEPRLA TDWERIDDTT MRFTLRDGVT FHTGNDFTAK DVVWTIERLK
RSADFKAIFD PIASAKAIDE HTVEIKTHKP YPLVLNLATY IFPMDSEYYT GETEDGDPKD
EIVKNGDSFA SRHSSGTGPY EVVSRQQGVK VEFERFDDYW DQDSPGNVDR IVLTPIGENA
TRVSALLSGD VDFIAPVPPN DLERVEADQN VELTTMSGTR IILMQLNQKR VEAFQDPRVR
QAFNYAVNQE AIADRLMKGF ATPAAQLSPK GYDGYNDSLT PRYDVEKAKE LMKEAGYEDG
FSVSMMAPNN RYVNDAKIAQ AVATMLSRIN VDVDLKTLPK AQYWGEFDDR AADIMMIGWH
ADTEDSANLF QYLTECPDPE TGAGQYNAAN YCNPELDEKV AQANVETDRA KRAEMLQAVE
KALYEDAAFM PLHWQDLAWA SKNNVKLEPV VNVMNFPYLG DLVVEQ