Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_2738 |
Symbol | |
ID | 4028790 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 3070439 |
End bp | 3071755 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637967946 |
Product | extracellular solute-binding protein |
Protein accession | YP_574784 |
Protein GI | 92114856 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.345118 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACGTG CTTTGCTGCT TTCATCACTT GGCACCGTCC TCGCGCTGGG AACCGGCATG GGAAGTGCCC AGGCCAATAC CGAGATCACC TGGTGGCATG CCATGGGCGG TGCGCTGGGG GACAAGGTCG AGCAGATCGC GGCCGACTTC AATGCCAGTC AGGATGCCTA TACCGTCACG CCCGTCTTCA AGGGCAATTA CAGCGAGACC ATGACTTCCG CGATCGCCGC ATACCGTGCC GACAAGGGGC CGGATATCGT GCAGATCTAC GAAGTCGGTA CGGCGACGAT GATGGCCGCC AAGGGGGCCA TCGTACCGGT GCATCGTTTG ATGGCATCGG CCGATGTCGA TTTCGATCCC CAGGCCTACC TGCCTGCGGT CACGGGTTAT TACACCGACC CCGAGGGCAA CATGCTGTCG TTGCCCTTCA ATTCTTCGAC GCCGGTGACG TACTACAACC GCGAGCGGCT GGCGCAGGCG GGGGTCGAGG AGATCCCCCG TACCTGGCAG GAACTGGGCG ACGCGCTGGA AAAGATCGTC GACAGCGGGG CGGCGAGCTG CGGACTGACC ACGACCTGGC CTTCCTGGGT CATGCTCGAG AACTATTCCG CCATCAACGA CGTGCCGTTT GCCTCGCGGG CCAATGGCTT CGAGGGCACG GATGCGCGGC TGCGCTTCAA TCGCACCGCC GTCGTCGACC ACATCGAGCG CCTCACGCGC TGGCAGGAGG ACGGGCGCTT CGCCTATGGC GGACGCTTCG ATGACGCCGC GCCCAAGTTC TACGCGGGCG AGTGCGCGTT GATGATGGGA TCCTCGGCCT CCTACGCCAA TATCAAGGAA AATGCCGATT TCGATTTCGG CGTGGCGCCG CTGCCCTACG ATGCCGAGGT GGTCGAGCAG GCCAACAATT CGATCATCGG CGGGGCCTCG CTGTGGGTGC TCAACGGGCT GGACGAGACG CACCGTCAGG GAGTGGCCGA ATTCTTCGAA TACCTGTCCA CGCCCGAGGT CCAGGCGGAC TGGCATCAGT ACTCCGGCTA CCTGCCGATC ACGCAGGCGG CGGCCGACCT GACCCGCGAA CAAGGCTTCT ACGCGGAGCA TCCCGGCACC GACGTGGCGA TCGAGCAGAT CACGGCCGGC CAGCCCACCG ACAATTCCAA GGGCTTGCGG CTGGGCAACA TGGTGCAGAT TCGCGACATC ATCAACGGTG CGCTGGAAAA CGTCTTCGCA GGCGATGTAT CGCCGCAGGA CGGCCTCGAT CAGGCCGCCG CGCGTGGCAA CGAGCTGCTG GAAAAGTTCG AACGTGCCAA TCGCTGA
|
Protein sequence | MQRALLLSSL GTVLALGTGM GSAQANTEIT WWHAMGGALG DKVEQIAADF NASQDAYTVT PVFKGNYSET MTSAIAAYRA DKGPDIVQIY EVGTATMMAA KGAIVPVHRL MASADVDFDP QAYLPAVTGY YTDPEGNMLS LPFNSSTPVT YYNRERLAQA GVEEIPRTWQ ELGDALEKIV DSGAASCGLT TTWPSWVMLE NYSAINDVPF ASRANGFEGT DARLRFNRTA VVDHIERLTR WQEDGRFAYG GRFDDAAPKF YAGECALMMG SSASYANIKE NADFDFGVAP LPYDAEVVEQ ANNSIIGGAS LWVLNGLDET HRQGVAEFFE YLSTPEVQAD WHQYSGYLPI TQAAADLTRE QGFYAEHPGT DVAIEQITAG QPTDNSKGLR LGNMVQIRDI INGALENVFA GDVSPQDGLD QAAARGNELL EKFERANR
|
| |