Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0258 |
Symbol | |
ID | 4026347 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 294696 |
End bp | 295955 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637965409 |
Product | extracellular solute-binding protein |
Protein accession | YP_572321 |
Protein GI | 92112393 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTCGTC GATTGACTCT TCTTGCCGCC CTGGCCGGCG CCACCGTCGG TCAGGCCCAT GCCGCGGACT TGACCATTTC CTGTGGCGCG GTGGGAGCTG AGCTGACTCT CTGCAAGGAA GGCGTCTCCG CCTGGGAAGA GCAGACCGGT CACAGCGTCG CTGTTGTCTC GACGCCCAAT TCATCCACCG AACGTCTCTC GCTCTATCAG CAAATCCTGT CTGCACAATC GGGGGATATC GATATTATGC AGATCGATGT GGTCTGGCCT GGCCTGCTCG CCAACCACCT GCTCGATTTG CGTGAGATAG GCGGCAAGGA TATTGCCGAT GGCAATTTCC AGACGATCGT CGACAACAAT ACCGTCGATG GGCGGTTGGT GGCGATGCCC TGGTTTACCG ATGCCGGGGT TCTCTATTAC CGCAAGGATC TGCTCGATAA GTACGATCGA GAGGTGCCCG AGACCTGGCA GCAGATGACC GAGACCGCCG AGGCCATCCA GCAGGCTGAA CGCCAGGCCG ACAATGAGGA GATGTGGGGC TATGTCTTCC AAGGGCGTGC CTATGAGGGG CTGACCTGCA ATGCCCTGGA GTGGGTCGCC AGCCATGGCG GGGGCAGTAT CGTCGACGAA CAGGGCGAGA TCACCATCGA CAATCCGCGT GCCGCGGCTG CTCTGGATCA GGCTGCCGAG TGGGTCGGCC ATATCTCGCC CCAAGGCGTG CTCAATTACA CCGAAGAACA GGCCCGCGGC GTCTTCCAGT CCGGCAACGC AGTCTTCATG CGTAACTGGC CTTACGCTTG GTCGCTGGTG CAGAGCGAGA ATAGCGACGT GCGCGGCAAG GTCGGTGTCA CCACGTTGCC GCATGGCCCC AAAGGAAGCA GTGCCGCGAC CCTGGGGGGC TGGAACCTGG CAGTATCCAA ATACAGCAAG CATCCCGAGC TGGCTGTCGA TCTGGTCAAG TTCTTGACGT CCGAAGCCGA GCAGAAACGT CGTGCCCTCG AGGGTTCCTA CAACCCGACC ATCAAGTCGC TTTACCAGGA TAAGGAAGTC CTCGCCGCCG TGCCTTTCTT CGGCAAGCTC TACGATACCT TCACCAATGC CGTGGCACGG CCTTCGGCAC CGACCGGCAA CAAATACGGC CGCGTCAGCA ATGCCTTCTT CAACGCTGCC CACGACGTGC TTGCCGGCAA CATGAACGGT GCCCGGGCGG TATCGCAACT GCAGGACAAG CTCGAGCGCA TGAAGCGCCG CCGCTGGTAA
|
Protein sequence | MIRRLTLLAA LAGATVGQAH AADLTISCGA VGAELTLCKE GVSAWEEQTG HSVAVVSTPN SSTERLSLYQ QILSAQSGDI DIMQIDVVWP GLLANHLLDL REIGGKDIAD GNFQTIVDNN TVDGRLVAMP WFTDAGVLYY RKDLLDKYDR EVPETWQQMT ETAEAIQQAE RQADNEEMWG YVFQGRAYEG LTCNALEWVA SHGGGSIVDE QGEITIDNPR AAAALDQAAE WVGHISPQGV LNYTEEQARG VFQSGNAVFM RNWPYAWSLV QSENSDVRGK VGVTTLPHGP KGSSAATLGG WNLAVSKYSK HPELAVDLVK FLTSEAEQKR RALEGSYNPT IKSLYQDKEV LAAVPFFGKL YDTFTNAVAR PSAPTGNKYG RVSNAFFNAA HDVLAGNMNG ARAVSQLQDK LERMKRRRW
|
| |