Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_2416 |
Symbol | |
ID | 4026853 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 2711581 |
End bp | 2712834 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637967618 |
Product | extracellular solute-binding protein |
Protein accession | YP_574462 |
Protein GI | 92114534 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.230559 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAACCA TCCATGCACT CGCCGCGGGG GCATTGCTGG TCACTGCCGT GGGGCAGGCC CAGGCCGCTC AAATCGAGGT ATTGCACTGG TGGACATCGG GGGGCGAAGC GAAAGCCGCC AACTTGCTCA AGGAAAAGCT CGAGGCCAAG GGGCATACCT GGAAGGACTT CGCGGTTGCG GGCGGCGCGG GCGACAGTGC CATGACGGTG CTCAAGTCGC GCGCGATTTC CGGCAACCCG CCGGCGGTGG CCCAGATCAA GGGACCCTTG ATCCAGGAAT GGGGCGAGAT GGGCTTTCTC GGCAATATCG ACAAGGCCGC CGAAGCCGAT GGCTGGGATG ATTTCCTGCC CCAGGAAATC GCGGCCTATG ACAAGGTCGA CGGCCATTAC GCCGCGGTGC CGGTCAATAT TCACCGCATC AACTGGATCT GGGCCAACCC GGAGGTGCTG AAGGCGTCGG GCGTCGAGGA AGTGCCGCAG ACCTGGGACG CCTTCTTCGA AGCCGCGGAC AAGATTCGCG AGGCGGGCTA CATTCCGCTC GCTCATGGTG GGCAGCCGTG GCAGGACGCG ACCGTGTTCG AAGTGGTCAT GATGGGCATC GGCGGCGGCG ACTTCTATCG CAAGGCGTTC GTCGAGCTGG ACCCCGAGGC GCTGACCAGC GACACCATGA TCGAGTCGCT CGAGACCTTC AAGAAGCTGC GCGGCACGAT GGACGACAAC ATCGCCGGAC GGGACTGGAA CATCGCCACC TCCATGGTCA TCAACGGCAA GGCGGCCATG CAGATCATGG GCGACTGGGC CAAGGGCGAG TTCACCGCCG CGGGCATGAC GCCGGGCGAG GACTACGAAT GTGTCGCACC GCCCATGACG GAGCACATGT TTTCGTACAA CACCGACAGC CTGGCGATGT TCGACGTCGA CGACGCAGGC CAGCAGCAGG CCCAGCTGGA TCTTGCGAGC CTGGTGCTGT CGCCCGACTT CCAGGCCAGC TTCAACCAGG CCAAGGGCTC GATTCCGGTG CGCCTGGACG TGCCGCTCGA CGACTTCGAC GCATGCGCCA AGGCATCTCG CGAGGCCTTC GATGTCGCCA TGGACGAAGG CGGACTGGTG CCCAGCCTGG CACACGGCAT GGCGGTATCG GACAGCCAGC AGGGCGCGGT GTTCGATGTC ATCACCAACT TCTTCAATGA CCCCGACATG ACGGCCGAAA CGGCCGCCGA ACGTCTGGTC AGCGCGGTGC GCGCGGCCGA GTGA
|
Protein sequence | MKTIHALAAG ALLVTAVGQA QAAQIEVLHW WTSGGEAKAA NLLKEKLEAK GHTWKDFAVA GGAGDSAMTV LKSRAISGNP PAVAQIKGPL IQEWGEMGFL GNIDKAAEAD GWDDFLPQEI AAYDKVDGHY AAVPVNIHRI NWIWANPEVL KASGVEEVPQ TWDAFFEAAD KIREAGYIPL AHGGQPWQDA TVFEVVMMGI GGGDFYRKAF VELDPEALTS DTMIESLETF KKLRGTMDDN IAGRDWNIAT SMVINGKAAM QIMGDWAKGE FTAAGMTPGE DYECVAPPMT EHMFSYNTDS LAMFDVDDAG QQQAQLDLAS LVLSPDFQAS FNQAKGSIPV RLDVPLDDFD ACAKASREAF DVAMDEGGLV PSLAHGMAVS DSQQGAVFDV ITNFFNDPDM TAETAAERLV SAVRAAE
|
| |