Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0220 |
Symbol | |
ID | 4027303 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 246764 |
End bp | 247765 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637965371 |
Product | periplasmic solute binding protein |
Protein accession | YP_572283 |
Protein GI | 92112355 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0803] ABC-type metal ion transport system, periplasmic component/surface adhesin |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.732372 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCAAGT CTTTCACCTT ACTGCTCGGT GCCGCCGCTC TGACGCTGTC CGGTGCGGCA CGCGCCGAGG GTCCGATGAG CGTCGTCGCC AGCTTCAGCA TTCTCGGCGA CATGGTCGAG GAAGTCGGTG GCGAGCATGT CGACGTCACC ACGCTGGTCG GTCCGGACGG CGACGCCCAT GTCTTCTCCC CCAGCCCCAC CGACGCACGC GCTGTCGGCG AGGCGGACCT GTTCGTCGTC AACGGGCTGC ATTTCGAAGG CTGGCTGGAC CGCCTGGTGG AAGCCAGCGG CTACGAGGGG CCGGTGGTCG TGGCAAGCCG GGGCATCGAT GCCCTGAGCT TCGACGAGGA GCGCGAAGAG CACTCTTCCG ATCATGAAGG TCACGACCAC GCCACAGGCC ACGATCATGA CCACGACCAC GACCACGACC ACGACCACGA CCACGACCAC AGTGAGCATG CGGGCCACGA CCACGGTCCG GAAGACCCGC ACGCCTGGCA GGACCTGCAA AACGGCAAGC AGTACGTGGC CAACATCCGC GACGCGCTCG TCGCGGCAGA CCCCGAGCAT GCCGCTGACT ATCGCCGCAA TGCCGAGCAA TACGTCGAGG CCATGGATAC GCTGGATGCC GAGGTCCATC GTCGGATCGG CGCGATTCCC GAGGCCAATC GCGTGCTGAT CACCAACCAC GATGCCTTCG GCTATTTCGC CAACGCCTAT GGGCTGGACG TGCTCTCGCC GGTCGGCCTC TCCACGGCCG CCGAGCCCAG CGCCGCCGGC ATGGCCAAGC TGATCGAACA GATCCAGGCA CGCAACGTCA AGGCACTGTT CCTGGAAAAC ATGACCAGCC CCGCCCTGCT CGAGCAGCTG GCCGACGAAA CCGGGGTGAC CATCGGAGGC ACGCTCTACG CCGGCGCCCT GGCGGCCGAG GGCGAAGCCA GCACCTACCT CGGCATGTTC CGTCACAATG TCGATACGCT GACCGAGGCC TTGAAGGACT GA
|
Protein sequence | MSKSFTLLLG AAALTLSGAA RAEGPMSVVA SFSILGDMVE EVGGEHVDVT TLVGPDGDAH VFSPSPTDAR AVGEADLFVV NGLHFEGWLD RLVEASGYEG PVVVASRGID ALSFDEEREE HSSDHEGHDH ATGHDHDHDH DHDHDHDHDH SEHAGHDHGP EDPHAWQDLQ NGKQYVANIR DALVAADPEH AADYRRNAEQ YVEAMDTLDA EVHRRIGAIP EANRVLITNH DAFGYFANAY GLDVLSPVGL STAAEPSAAG MAKLIEQIQA RNVKALFLEN MTSPALLEQL ADETGVTIGG TLYAGALAAE GEASTYLGMF RHNVDTLTEA LKD
|
| |