Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0537 |
Symbol | |
ID | 4027676 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 595266 |
End bp | 596204 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637965705 |
Product | substrate-binding region of ABC-type glycine betaine transport system |
Protein accession | YP_572598 |
Protein GI | 92112670 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components |
TIGRFAM ID | [TIGR03414] choline ABC transporter, periplasmic binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGACGCA CCACCACGCG GCTGATGGCC GCCCTTGCCC TGCTGCCCCT GGCAGCCCAC GCCCATGCCA CCGACGAGGC CACCCAGGTC TCCTTCGTCG CGCCGCCGTG GCCCGGCGTC ACGGTCAAGA CCGAAATCAT GGCGCAGATC CTGGCCCCGC TGGGCTATAC CAACGAGCGC CAGGAGCTCA GCAGCACGGT GGGCTACAAG ACCTTGCAAA CCGGCGACAG CGACGTCTTC CTCGCCGGCT GGCTCCCCGC CCAGCAGGAC AGCTACGACG CCGCGATGGA CGACGGCACC ATCGTCGATC TGGGCAACAA CGTCACCGGC GCGCGCATGG GCTTCGCGGT CCCCGGCTAT GTCTTCGATG CCGGCGTGAC CAGCGCCGAG CAGCTCGACG ATCCCGAGAA CCGCGAACGT TTCGAGGGCC GCTACTATTC CATCGAGTCG GGCTCGACGG TCAGCGACTT CATCAACGAC GCCAAACGCA ACGACACCTA CGGCCTGGGC GACTGGGAAC TACTGGAGTC CTCGACCCCC GGCATGCTCA GCGCCGTGCG CAGCGCCTAC GACGAGAAGC GCTGGGTCGC CTTCTACGGC TGGACGCCGC ACTGGATGGT GCCGGAATTC GACATGCATA TCCTCGACGA CCCTGAAGGC ATCTATGGCG AGAACCAGGG CCGCAGCGAC GTGCGCACCA TCGTCGCCAA GTCCTTCAGC GAGGCCAACC CCAACCTCAT GCGGCTGCTC GACCAGTTCG TGCTGACCGC CGAGCAGCAA AGCGACTTCA TTCGCGAATA CGGCCTGGAA GAACGCGAGC TCGAGACGGT GGCCCGCGAG TGGCTGCAAG CGCACCCCGA TGAACTGGCC GACTTCCTGG AAGGCGTCAC CACCCGTGAC GGCGAGCCCG GCCTGGCCGC CGTCAAGGCC AGCCTGTAA
|
Protein sequence | MRRTTTRLMA ALALLPLAAH AHATDEATQV SFVAPPWPGV TVKTEIMAQI LAPLGYTNER QELSSTVGYK TLQTGDSDVF LAGWLPAQQD SYDAAMDDGT IVDLGNNVTG ARMGFAVPGY VFDAGVTSAE QLDDPENRER FEGRYYSIES GSTVSDFIND AKRNDTYGLG DWELLESSTP GMLSAVRSAY DEKRWVAFYG WTPHWMVPEF DMHILDDPEG IYGENQGRSD VRTIVAKSFS EANPNLMRLL DQFVLTAEQQ SDFIREYGLE ERELETVARE WLQAHPDELA DFLEGVTTRD GEPGLAAVKA SL
|
| |