Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_2089 |
Symbol | |
ID | 4026551 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 2358500 |
End bp | 2360281 |
Gene Length | 1782 bp |
Protein Length | 593 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637967288 |
Product | extracellular solute-binding protein |
Protein accession | YP_574139 |
Protein GI | 92114211 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.115344 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCAGGA CGGGACGACA ACAAAACCAC GAGGTCAATA TGAAACATAC AACAATGCGA CTCTCTCTAC TCGCGGCTGG CGTGATGCTG GCAAGCGGGG GACTGCACGC CGATGATCAA GATACTCGCG CCATCGCCGA GCGCCTGGTC GATGAGCACT TCCAGGAGTC GACGCTGACA CGCGAAGAGC AGATCGAGGA ATTGATGTGG TTCGCCAAGG CGGCCGAGCC TTTTAGAGGC ATGGATATCA ACACGGTGGC GGAAGGCTTG ACCACGCACA TCTACGAGCG GGATGTGCTG GCCGACGCGT TCACCGAGCT GACGGGTATC GAGGTGACGC ACAACATCAT CGGCGAGGGG GATGTCGTCA ACAACATGCA GACCCAGATG CAGTCGGGTC GCAATATCTA TGACGGCTAC GTCAACGATA CCGACTCCAT CGGTACGCAC ATTCGCTACG GCACCACCAT CAATCTTTCC GACGCCATGG AAAACGAGTG GGCCGACTAT ACGTTGCCGA CCCTGGATCT CGATGACTTC ATCGGCCTGC AGTATGGCAC CGGTCCGGAT GGCAGCGTCT ATCAATTGCC GACCCAGCAG TTCGCCAATC TCTACTGGTT CCGCTATGAC TGGTTCCAGC GCGAGGATCT GCAGAAGCAG TTCCGTGAGC TCTATGGCTA CGACCTGGGC GTGCCGACCA ACTGGACTGC CTACGAAGAC ATCGCCGAGT TCTTCACCGA GCATGTCGGC GAGATCGATG GCGAGAAGGT CTATGGCCAC ATGGACTATG GTCGTCGCGA TCCCTCGCTG GGCTGGCGTT TCCACGACTC CTGGCTCTCC ATGGCGGGCA TGGGCAGCCC CGGCGTACCG TTCGGCAATC CGGTGGACGA CTGGGGGATT CGCGTCAACG AGCAGAGCCA GCCGGTCGGG GCCAGCGTGT CGCGTGGCGG GGCCACCAAT TCCCCGGCCT CGGTGTTCGC CGTCACCAAG ATGGTCGATT GGCTCGACAA GTATGCCCCA CCCGAAGCCA GCGGCATGAC CTTTGGCGAA GCCGGCCCCG TGCCGGCCCA GGGCAATGTC GCGCAGCAGA TCTTCTGGTA CACCGCTTTC ACGGCCGACA TGACCGACCC TGAACTGCCG GTCACCGATG AAGAGGGCAA CCCCAAGTGG CGCATGGCGC CGTCCCCGAC AGGGCCTTAC TGGGAAGAGG GTATGAAGGT CGGTTATCAA GACGTGGGGG CCTGGACCTT CTTCGACTCG ACGCCTGAGG ATCGTCGCAC GGCTGCCTGG CTGTTCGCCC AGTTCACCGT CTCCAAGACA GTGTCGCTGG AAAAGCTGAT GGCGGGGCTG ACGCCGATCC GCGAATCGGA CATCTTCTCC GAACAGATGA CCGAGATGGC TCCCAAGCTG GGCGGTCTGG TGGAATTCTA TCGTAGCCCG AACGAATCCA ACTGGACGCC GTCCTCCACC AACGTGCCGG ACTACCCGCG CATGGCGCCG CTGTGGTGGC AGAACCTGTC GCCGGCGGTC AGTGGCGATA TCTCGCCCAA GGAGGCGCTC GACAACCTGG CCAGGGATCT CGACAACATC ATGGCGCGCC TGGCGCGAGC CAAGGTCTTC GATACCTATG CGCCCAACCT GAACGAGGAG CGTGATCCGC AGTACTGGCT CGATCAGCCG GGTTCGCCGA AACCGAAGCT CGACGACGAG ATGCCGCAGG GCAAGACGGT TCCCTATGAC GAAATGATGG AGGCGTGGAT GGCCGCCGGT TCTCGCGAAT GA
|
Protein sequence | MSRTGRQQNH EVNMKHTTMR LSLLAAGVML ASGGLHADDQ DTRAIAERLV DEHFQESTLT REEQIEELMW FAKAAEPFRG MDINTVAEGL TTHIYERDVL ADAFTELTGI EVTHNIIGEG DVVNNMQTQM QSGRNIYDGY VNDTDSIGTH IRYGTTINLS DAMENEWADY TLPTLDLDDF IGLQYGTGPD GSVYQLPTQQ FANLYWFRYD WFQREDLQKQ FRELYGYDLG VPTNWTAYED IAEFFTEHVG EIDGEKVYGH MDYGRRDPSL GWRFHDSWLS MAGMGSPGVP FGNPVDDWGI RVNEQSQPVG ASVSRGGATN SPASVFAVTK MVDWLDKYAP PEASGMTFGE AGPVPAQGNV AQQIFWYTAF TADMTDPELP VTDEEGNPKW RMAPSPTGPY WEEGMKVGYQ DVGAWTFFDS TPEDRRTAAW LFAQFTVSKT VSLEKLMAGL TPIRESDIFS EQMTEMAPKL GGLVEFYRSP NESNWTPSST NVPDYPRMAP LWWQNLSPAV SGDISPKEAL DNLARDLDNI MARLARAKVF DTYAPNLNEE RDPQYWLDQP GSPKPKLDDE MPQGKTVPYD EMMEAWMAAG SRE
|
| |