Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0633 |
Symbol | |
ID | 4025980 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 710603 |
End bp | 711913 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637965804 |
Product | extracellular solute-binding protein |
Protein accession | YP_572694 |
Protein GI | 92112766 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.131479 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCCTTA CCTACGCTCT ACCCCTGGCC GGAGCCGCGA CACTGGCTTC GTTCGCCGCT CACGCCGAGA CCATCACCGT GGCCACGGTG AACAACAACG ACATGATCAT CATGCAGGGC CTGACCGACG AATTCGAAAA GGCTCACCCG GACATCGACC TGGAATGGGT GGTGCTCGAG GAAAACGTGT TGCGTCAGCG TCTGACCACC GACATCGCCA CCGACGGCGG CCAGTTCGAT GTCATGACCA TCGGGACGTA CGAGGTGCCG ATCTGGGCCA AGCAGGATTG GCTGGTCGAG CTCGACGACC TGCCCGAGAG CTACAACGAG CAGGATCTGC TCAAGCCGAT CCGCGACGGC CTGAGCCAGG ACGGTTCGCT CTATGCGCTG CCGTTCTACG GTGAAAGCTC GATGATGTAC TACCGCACCG ACCTGTTCGA GCAGGCCGGC ATCGAGATGC CCGAACAGCC GACCTGGGAG CAGGTCGAGG ACTGGGCGAG CCAGATCAAC GATCCCGACA ACGGCGTGTA TGGCATCTGC CTGCGTGGCA AGCCGGGCTG GGGCGAGAAC ATGGCGTTCG TCAGCACCCT GGTCAATACC TTCGGCGGTC GCTGGTTCGA CGAGGAATGG CATCCGGAAA TCAACTCGCC GGAGTGGAAG GAAGCGGTCG GTTTCTATGT CGACCTGATG AACAACTATG GCCCGCCGGG TGCGACCTCC AACGGCTTCA ACGAGAATCA GGCGCTGTTC TCCAGCGGCA AGTGCGGCAT GTGGGTCGAT GCCACGTCCG CTGCCGGACG TCTCTACAAT CCCGACGAGT CGCAGGTCGC CGACAAGCTC GGCTTCGCCC CGGCGCCGAT CGCCGAGACC CCGAAGGGCG CCAACTGGCT GTGGTCGTGG ACGCTGGCGA TTCCCGCCTC GTCGGATGCC AAGGACGCCG CCAGGACCTT CATTACCTGG GCGACCTCGC AGGACTACAT CGAGCTGGTA GGGGAAACCG AAGGCTGGAC CAGTGTGCCG CCGGGCACCC GTGAGTCCAC CTACGAGAAT CCCAAGTACC AGGAAGCTGC GCCGTTCGCC GACTTCGTGC TCAACGCCAT CCAGACCGCC GATCCCACCG ATTCGACGCT CAAGCCGAGT CCCTACATCG GCGTGCAGAC CGTCAACATC CCCGAGTTCC AGGCGGTAGG CACCCAGGTG GGACAGATGA TCGGGGCTGC ACTTGCCGGT CAACAGTCCG TCGATGCCGC GCTCGACCAG GCCCAGCGTT CGGTCGATCG CACCATGCGC CAGGCGGGAT ACTACGACTA A
|
Protein sequence | MRLTYALPLA GAATLASFAA HAETITVATV NNNDMIIMQG LTDEFEKAHP DIDLEWVVLE ENVLRQRLTT DIATDGGQFD VMTIGTYEVP IWAKQDWLVE LDDLPESYNE QDLLKPIRDG LSQDGSLYAL PFYGESSMMY YRTDLFEQAG IEMPEQPTWE QVEDWASQIN DPDNGVYGIC LRGKPGWGEN MAFVSTLVNT FGGRWFDEEW HPEINSPEWK EAVGFYVDLM NNYGPPGATS NGFNENQALF SSGKCGMWVD ATSAAGRLYN PDESQVADKL GFAPAPIAET PKGANWLWSW TLAIPASSDA KDAARTFITW ATSQDYIELV GETEGWTSVP PGTRESTYEN PKYQEAAPFA DFVLNAIQTA DPTDSTLKPS PYIGVQTVNI PEFQAVGTQV GQMIGAALAG QQSVDAALDQ AQRSVDRTMR QAGYYD
|
| |