Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_2771 |
Symbol | |
ID | 4028910 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 3102384 |
End bp | 3103322 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637967979 |
Product | substrate-binding region of ABC-type glycine betaine transport system |
Protein accession | YP_574817 |
Protein GI | 92114889 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components |
TIGRFAM ID | [TIGR03414] choline ABC transporter, periplasmic binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTCATT CGCTTCGTCC CTTGCTCGTC GGACTGCCGC TCGCCGCGGG TATCGGCATG GCCATGCCGG CCCAGGCCGC AGAGCAGATA CGCTTCGGCG TGCCCTCGTG GCCAGGCATC ACCGTGAAGA CCGAGATCGC CAGCCAGCTT CTGGAACCGC TCGGCTACCA GGCGCGGACC GACGACGTCG GCCTTCAGGT CATCTATCAG GGCATGGAAA GCGGTGATCT GGATGTCTTC CTGGGCGGCT GGCTGCCCGC GCAGGAGCCG ATGCTCACGC CGCTGGAAGA CAAGGGAAGC GTGACGCGCA TCGCCAACAA CGTCGATGGC GCGCAAATGA CGCTGGCAGT GCCCGAGTAC GTCTACGAAC AGGGCGTGAC CTCGTTCGAG GATCTCGACG CACACCGCGA CATGTTCGAG GGGCAAATCC ACGGCTTCGG CGCCGGTTCG GCCGCCAGCG AGATCCTCAA CAAGGCGATC GACAACGACA CCTGGGGCCT CGGCGACTGG CAACTGGTGG ACACCAGCAC CGTGGGCATG CTCAGCGCCG CGCGTGACGC CATTTCCCGC GAGGAGCCCA TCGTCTGGGT CGGCTGGACG CCGCACTGGA TGAACCTCGA ACTGCCGATG CGCTACCTGG ATGACAGCAA GAACCTGTTC GGCGAAGGGA ACGGCGCCAG CCAGGTGCTC ACGCTGATGA GCGCCGATTA TGCCGAGGCG CATCCCAACC TGACGACGTT TTTCGAGAAC TTCACCTTCA CGGCGGAACA GCAGAGCTGG ATGATCAAGG CCTTCGGTCT CGACGAGAAG GAACTCGACA GCGTGGCCAC CACCTGGATT CAGGAACATC CCGAGCGCGT CGAAGCCATG CTCGACGGTG TCACCACCAC CGATGGCGAC GCTGCATGGC CGGTGGTGAA AGAGGCGCTG TCGCTGTAA
|
Protein sequence | MSHSLRPLLV GLPLAAGIGM AMPAQAAEQI RFGVPSWPGI TVKTEIASQL LEPLGYQART DDVGLQVIYQ GMESGDLDVF LGGWLPAQEP MLTPLEDKGS VTRIANNVDG AQMTLAVPEY VYEQGVTSFE DLDAHRDMFE GQIHGFGAGS AASEILNKAI DNDTWGLGDW QLVDTSTVGM LSAARDAISR EEPIVWVGWT PHWMNLELPM RYLDDSKNLF GEGNGASQVL TLMSADYAEA HPNLTTFFEN FTFTAEQQSW MIKAFGLDEK ELDSVATTWI QEHPERVEAM LDGVTTTDGD AAWPVVKEAL SL
|
| |