Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_2356 |
Symbol | |
ID | 4027465 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 2646965 |
End bp | 2647849 |
Gene Length | 885 bp |
Protein Length | 294 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637967560 |
Product | substrate-binding region of ABC-type glycine betaine transport system |
Protein accession | YP_574404 |
Protein GI | 92114476 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1732] Periplasmic glycine betaine/choline-binding (lipo)protein of an ABC-type transport system (osmoprotectant binding protein) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.427621 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAACCT TGATGCCCCT GCTCACCGGC GCAGTGCTGG CCGGCAGCCT GGCCACCGCC CAGGCGAACG AAATCGTGGT CGGCGGCAAG AACTTCACCG AGCAGCAGAT CCTTTCCAGC ATGACCACCC AGTACCTGGA AGGCCTCGGC TACGACGTCG AGAAGCGCGC CGGGCTGGGC TCCGCCGTCC TCCGTCAGGC CCAGGAAAAC GGCCAGATCG ACCTCTACTG GGAATACACC GGCACCTCGC TGATCAACTA CAACGACGAA TCCGCCGAAG GGTTGACCGT CGACGAGACC TACCAGAAGG TCAAGGAACT CGACGCCGAG AAGGGCCTGG TATGGCTCGA GCCCTCCGAC GCCAACAACA CCTACGCGCT GGCCATGCGC AAGGAGAGCG TCGAGGAAAC CGGCATCACC ACGCTCAGCG ACCTGGCCCA GGCGGTCAAC GACGAGCAGG GGCTGACCTT CGCCATGAAC GCCGAATTCT ACGCCCGTGA AGACGGCTGG CGCCCGCTGC AGCAGGCCTA TGAATTCCGC GCCGGGCGTG GCGACGTCAA GCGTATGGAC TCCGGCCTCG TCTACCAGGC GCTGCGCGAC GAGCAGGTCG ACGTGGGGCT GGTGTTCGCC ACCGACGGCC GTATCCCCGC CTTCGACTTC CAGGTGCTCG AGGATGACCA GAACTTCTTC CCGGCCTACG CGCTGACACC AGTGGTGCGC CAGGCGACCC TGGACGCCAA CCCCGAGCTC GCCGAGCAGA TGAATACCCT CTCCGGCCTG CTCGACAACG ACACCATGTC CACCCTCAAC GCCCGGGTCG ATGTCGACAA GACATCCATC GAACGGGTCG CCGAGAACTT CCTCGAGGAA AACGACCTGC TGTAA
|
Protein sequence | MKTLMPLLTG AVLAGSLATA QANEIVVGGK NFTEQQILSS MTTQYLEGLG YDVEKRAGLG SAVLRQAQEN GQIDLYWEYT GTSLINYNDE SAEGLTVDET YQKVKELDAE KGLVWLEPSD ANNTYALAMR KESVEETGIT TLSDLAQAVN DEQGLTFAMN AEFYAREDGW RPLQQAYEFR AGRGDVKRMD SGLVYQALRD EQVDVGLVFA TDGRIPAFDF QVLEDDQNFF PAYALTPVVR QATLDANPEL AEQMNTLSGL LDNDTMSTLN ARVDVDKTSI ERVAENFLEE NDLL
|
| |