Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_2999 |
Symbol | |
ID | 4028965 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 3336863 |
End bp | 3338512 |
Gene Length | 1650 bp |
Protein Length | 549 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637968205 |
Product | extracellular solute-binding protein |
Protein accession | YP_575042 |
Protein GI | 92115114 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.305947 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGCCAC TCGCGGCATC CGACACAACA ACAAGCGGAA CAGACGACAT GCTTCGACAC TCGCACACTC GGCTTGCCAC GACACTGCTA TCCTCCCTGT CGCTGGCGTC ACTGGCCCTG GCGGCGCCGG CCCAGGCCGA CACGCTGAAC ATCGGCGTCA TGGGGGAGCT GGCCTCGTTC GATACCTCGC AGGTTTCCGG CGGTGTCTGG GAATCGCAGA TCCTCATGGA CGTCTACGAA GGCCTGCTCA AGGAAAACCC CGAGGGCGAG GTGATGCCCG GCATGGCCAC CGACTGGGAC ATCTCCGAGG ACGGCAGGAC CTATACCTTT CATCTGCGCG AAGGCGCCAA ATGGTCCGAC GGCGCGCCGG TCACCGCCGA AGACTTCGTG TTCGGCTGGC AGCATCTGCT CGACCCGGCC AGCGCTTCGA AATACGCCTA CCTGCTCTAT CCGATAAAGA ATGCCGAAGC CGTCAACACG GGCGACAGGC CGCTCGACGC ACTCGGCGTC GAATCGCTGG ACGATGGCAA GACGCTCAAG GTGACACTCG ATGCCCCCAC GCCCTACTTC CTGCAGCTGC TGACCCACTA CACGGCCTAC CCGGTCCCCA AGCACGCCGT CGAGAAATAC GGCAAGCAGT GGGTCAAGAT GGACAACATC GTCACCAACG GCGCCTTCAC GCCCGTCGAG TGGGTCTCGC AATCGCGCAT CAGCGTCGAG AAGAATCCCG ACTACTACGA GGCCGACGAG GTCGAACTCG ACGGCGTCAA CTACTTCAAC ACCGAGGATC GCAATGCCGC CATCTCGCGC TTCCGCGCCG GCGAGCTGGA CATCGTCCGC GATTACCCCT CGAGCCGTTA CCAGTGGCTC GAGGACAACC TCCCCGAGGC CACCCACCTG AGCCCGATGC TGGGGTCCTA CTACTACGTG CTCAATACCC GCGAGGGGCG CCCGACCGCC GACAAGCGGG TCCGCGAGGC CCTGAACCTG GTCGCGCGCC GCAAGGTACT TTCCGAGCAG ATCATGGCCG GCAGTTTCAA GGATGCCTAC TCGCTGGTCC CGCCGGGCAC CAGCCATTAC GACGTCCAGC GCATGGACGG TGTCGATGGC GACTACCAGA AGCGCCTGGC CAGGGCCAAG CAACTGATCG AGGAGGCCGG CTACGGCCCC GACAACCCGC TGCACCTGCA ACTGCGCTAC AACACGTCCG ATGAGCACAA GAAGATCGCC ATCGCCCTGG CCGCGATGTG GAAGCCGCTG GGTGTCGACG TCGAGATGAC CAATGCCGAG GCCACCGTGC ACTACCAGAC CATCCAGCAA GGCGACTTCG ATATCGCCCG TGCCGGCTGG ATCGCCGACT ACAACGATGC CGAGAACTTC CTGACCTTGC TGCGCAGCGG CGTCGGCAAC AACTACGGCG GCTACGCCAA TCCCGAGTAC GACGCGCTGC TCGCTCAAGC CGCCACCGTT CGGGACCTCG ACGAGCGCGA GGCACTGCTC GAAAAAGCCG AGAACGTCGC CCTCGACGAC TACGCCCTCG TGCCGCTGCT CTATTACGTC ACTCGCAATC TGGTCAATCC CGATATCAGC GGCTGGCAGG ACAACGCCGA GGACGACCAT CCATCGCGCT GGGTGACGTT CACCGAGTAA
|
Protein sequence | MSPLAASDTT TSGTDDMLRH SHTRLATTLL SSLSLASLAL AAPAQADTLN IGVMGELASF DTSQVSGGVW ESQILMDVYE GLLKENPEGE VMPGMATDWD ISEDGRTYTF HLREGAKWSD GAPVTAEDFV FGWQHLLDPA SASKYAYLLY PIKNAEAVNT GDRPLDALGV ESLDDGKTLK VTLDAPTPYF LQLLTHYTAY PVPKHAVEKY GKQWVKMDNI VTNGAFTPVE WVSQSRISVE KNPDYYEADE VELDGVNYFN TEDRNAAISR FRAGELDIVR DYPSSRYQWL EDNLPEATHL SPMLGSYYYV LNTREGRPTA DKRVREALNL VARRKVLSEQ IMAGSFKDAY SLVPPGTSHY DVQRMDGVDG DYQKRLARAK QLIEEAGYGP DNPLHLQLRY NTSDEHKKIA IALAAMWKPL GVDVEMTNAE ATVHYQTIQQ GDFDIARAGW IADYNDAENF LTLLRSGVGN NYGGYANPEY DALLAQAATV RDLDEREALL EKAENVALDD YALVPLLYYV TRNLVNPDIS GWQDNAEDDH PSRWVTFTE
|
| |