Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_1873 |
Symbol | |
ID | 4028236 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 2130202 |
End bp | 2131782 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637967067 |
Product | extracellular solute-binding protein |
Protein accession | YP_573924 |
Protein GI | 92113996 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.021803 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCGATA AGAAGACTCT CCTGGCGTCT TTGCTGGGCG CCTCCGTCGT GCTTCCCCTG CCCGCGCTGG CGCAAGAGGA TGCCGGCGAC CCCGTGACGC TTCGCATGGC CTACGATGCC GACCCGGTAT CGCTGGATAT CCACGAACAA CTCTCCGGCG GCATTCTGCA GCTGTCTCAC CTGACGCACG ACCCGCTCGT CCGCTGGACG AAAGACGTCA AGTTCGAGCC GCGCCTGGCC ACCGATTGGG AACGCATCGA TGACACCACC ATGCGGTTCA CGCTGCGTGA CGGCGTCACG TTCCACACCG GCAACGACTT CACCGCCAAG GATGTCGTGT GGACGATCGA GCGTCTCAAG CGCAGTGCCG ATTTCAAGGC CATCTTCGAC CCCATCGCCT CGGCGAAAGC CATCGACGAG CATACCGTCG AGATCAAGAC CCACAAGCCC TACCCGCTCG TTCTCAATCT CGCGACCTAT ATTTTCCCCA TGGATAGCGA GTACTACACC GGCGAGACGG AAGATGGCGA TCCCAAGGAC GAAATCGTCA AGAACGGCGA CTCCTTCGCC TCACGGCACT CGTCGGGCAC GGGCCCTTAC GAGGTTGTGT CCCGCCAGCA AGGCGTCAAG GTCGAGTTCG AGCGCTTCGA CGACTACTGG GATCAAGACT CTCCGGGCAA CGTCGACCGC ATCGTCCTGA CCCCGATCGG CGAGAATGCC ACGCGTGTGT CGGCCCTGCT GTCCGGCGAT GTCGATTTCA TCGCGCCCGT GCCGCCCAAT GACCTGGAGC GCGTCGAGGC CGACCAGAAC GTCGAACTGA CCACGATGTC GGGTACCCGC ATCATCCTCA TGCAGCTCAA CCAGAAGCGT GTGGAAGCCT TCCAGGACCC GCGCGTCCGC CAAGCCTTCA ACTATGCGGT CAACCAGGAG GCCATCGCCG ACCGCCTGAT GAAAGGCTTC GCCACGCCCG CCGCGCAGCT GTCGCCCAAG GGCTACGACG GGTACAACGA CAGCCTGACA CCGCGCTACG ACGTCGAGAA AGCCAAGGAA CTGATGAAAG AAGCCGGCTA CGAGGACGGT TTCTCGGTTT CCATGATGGC GCCCAACAAC CGCTATGTGA ACGATGCCAA GATCGCACAG GCGGTCGCCA CCATGCTGTC GCGCATCAAT GTCGACGTGG ACCTCAAGAC GCTGCCCAAG GCCCAGTACT GGGGAGAGTT CGACGATCGC GCCGCGGATA TCATGATGAT CGGCTGGCAC GCCGACACCG AGGATTCCGC CAACCTGTTC CAGTACCTCA CCGAGTGCCC GGACCCCGAG ACCGGAGCCG GCCAGTACAA CGCGGCCAAC TACTGCAATC CGGAGCTCGA CGAGAAAGTG GCGCAGGCCA ATGTCGAGAC GGACCGCGCC AAGCGCGCCG AGATGCTGCA GGCGGTCGAG AAGGCGCTGT ACGAGGATGC GGCCTTCATG CCGTTGCATT GGCAGGATCT TGCCTGGGCG TCGAAGAACA ACGTCAAGCT CGAGCCGGTG GTGAACGTCA TGAACTTCCC TTACCTCGGG GATCTCGTGG TCGAGCAATA A
|
Protein sequence | MIDKKTLLAS LLGASVVLPL PALAQEDAGD PVTLRMAYDA DPVSLDIHEQ LSGGILQLSH LTHDPLVRWT KDVKFEPRLA TDWERIDDTT MRFTLRDGVT FHTGNDFTAK DVVWTIERLK RSADFKAIFD PIASAKAIDE HTVEIKTHKP YPLVLNLATY IFPMDSEYYT GETEDGDPKD EIVKNGDSFA SRHSSGTGPY EVVSRQQGVK VEFERFDDYW DQDSPGNVDR IVLTPIGENA TRVSALLSGD VDFIAPVPPN DLERVEADQN VELTTMSGTR IILMQLNQKR VEAFQDPRVR QAFNYAVNQE AIADRLMKGF ATPAAQLSPK GYDGYNDSLT PRYDVEKAKE LMKEAGYEDG FSVSMMAPNN RYVNDAKIAQ AVATMLSRIN VDVDLKTLPK AQYWGEFDDR AADIMMIGWH ADTEDSANLF QYLTECPDPE TGAGQYNAAN YCNPELDEKV AQANVETDRA KRAEMLQAVE KALYEDAAFM PLHWQDLAWA SKNNVKLEPV VNVMNFPYLG DLVVEQ
|
| |