Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_1193 |
Symbol | |
ID | 4027004 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 1368047 |
End bp | 1369114 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637966370 |
Product | extracellular solute-binding protein |
Protein accession | YP_573248 |
Protein GI | 92113320 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0687] Spermidine/putrescine-binding periplasmic protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00611018 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAACAA TGACCACAAG AATCGCCGCT GCACTCGGAG GCCTATTGCT GGCCGGCGGC ACCCAGGCTC AGGAAGACAA CAAGCTGTAC CTGTTCAACT GGACCCAATA CATGGACCCG GCCATCATCG AGGCCTTCGA GGAGAAATAC GACGCCGAGG TGGTGCAGAG CTACTACAAC TCGCTGCCCG AAATGTACGC CAAGCTCAAT GCTGGCGGGG TGTCGCAGTA CGACATCATC GTGCCGTCCA ACTATTACGT GCCACGGCTC ATCGAGACCG GCATGGTACA GAAGCTGGAC AAGTCCAAGA TCCCCAACCT CGACAACGTC ATGGAGCAGT TCGAGAATCC CAGCTACGAT CCGCAAAGCA CCTATTCGGC ACCTTACCAG TGGGGCGTGA CCGGTCTGGT ATACAATGCC GAGACGTTCC CCGACGCCCC CAAGAGCTGG TCGCTGATGT TCGATTCCGA GGTCAACTCG GGACATCCCT TCGCGCTGAT GGGTGATGGC CAGGTCACCA TGGGCGGCGC CTGTGCGTAC CTGGGCCACG GCTATGACTG CACCGACACC GAGGCCTGGA AGGAGGCCGC AAGACTGCTG ATCGACACCA AGAACCGCGA CAATTTCAGC GGCTTCGTCG ACGGGACGCC AAGCCTGCAG CAACTGGCAC GCGGCGTGAC CCACGCCGCC TTGAGCTATA ACGGCGACTA CCTCTTCTAT CGCCAGGAAA ATCCGGAGTC ATTCAAGAAC ATCAAGTTCA TGATTCCCGA CGAGGGGACC GAAATGTGGG TGGATACCAT GCTGATCCCT TCCAAGGCCC CCCACCCCGA TCTGGCCCAC AAGTTCATCA ACTTCATCCT GGATGCCAAG ATCGGCGCTC AGCTCTCCAA CTACAACTAC TACGCCAGCC CCAATGCGGA AGCTCAGCCC TATCTTGACG ACATCCTGAC CCAGCCTCCG ATCCAGCCGT CCGAGGAAGA CATGCAGCGC CTGCACTTCA CGCCCAGTCT CGAAGGCGAG CAACTGCAGG TCTTCCAGCA GCTTTGGTCA GAAGTCCAGT CACGCTGA
|
Protein sequence | MKTMTTRIAA ALGGLLLAGG TQAQEDNKLY LFNWTQYMDP AIIEAFEEKY DAEVVQSYYN SLPEMYAKLN AGGVSQYDII VPSNYYVPRL IETGMVQKLD KSKIPNLDNV MEQFENPSYD PQSTYSAPYQ WGVTGLVYNA ETFPDAPKSW SLMFDSEVNS GHPFALMGDG QVTMGGACAY LGHGYDCTDT EAWKEAARLL IDTKNRDNFS GFVDGTPSLQ QLARGVTHAA LSYNGDYLFY RQENPESFKN IKFMIPDEGT EMWVDTMLIP SKAPHPDLAH KFINFILDAK IGAQLSNYNY YASPNAEAQP YLDDILTQPP IQPSEEDMQR LHFTPSLEGE QLQVFQQLWS EVQSR
|
| |