Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_0853 |
Symbol | |
ID | 4897933 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | - |
Start bp | 870816 |
End bp | 871742 |
Gene Length | 927 bp |
Protein Length | 308 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640111438 |
Product | substrate-binding region of ABC-type glycine betaine transport system |
Protein accession | YP_001042736 |
Protein GI | 126461622 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components |
TIGRFAM ID | [TIGR03414] choline ABC transporter, periplasmic binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.564926 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCCCA TCCGACAGGC CCTGCTGGCC GCCGCCTCGA CCCTGGCCAT GGCCGGGACT GCCCTCGCCC AGGACCAGTG CCGCGAGGTG ACCTTCTCCG ACGTGGGCTG GACCGACATC ACCGTCACCA CCTCCGCCAC CCGTCAGGTG CTCGAGGCGC TCGGCTACGA GGTCGAGGTC GACATCCTCG GCGTGCCCGT GACCTACGCC TCGATGGACA AGGGCGACGT GGACGTGTTC CTCGGCACCT GGCTTCCCGC TCAGGAAAGC GCCATCGGCC CCTATCTCGA GAAGGGCAGC ATCGAGGAGA TCACGACGAA CCTCGAAGGC ACGAAATACA CGCTCGCGGT GCCGACCTAT CTCTACGACA AGGGCCTGAA GAGCTACGGC GACATCGCCA AGTTCAAGGA CGAGCTGGAA GGCAAGGTCT ACGGCATCGA GCCCGGCAAC GAAGGCAACG AATATCTCAT CAGCCTGACC GAGGCCGGAA AGCCGCTCGA AGGGTTCGAG GTCGTCCAGA GCTCCGAGCA GGGGATGCTG GCGCAGGTCG CCCGCTTCTA CCCGCAGGAG AAGGGCGTGG TCTTCCTCGG CTGGGAGCCC CACCCGATGA ACGCCACCTT CTCGCTGAAA TACCTGCCGG GCGGCGAGGA TTTCTTCGGC GAGGACGGCG TGGTGAAGAC CGTGGCGCGC AAGGGCTTCA AGGAAGACTG TCCGAACGTG ACCAAGATGC TGTCGCAGCA GAAATTCACC CTGCCCATGG AGAACGAGAT CATGGGCAAG ATCCTCGACG ACGGGATGGA GCCCGATGCG GCGGTGATGG AATGGCTGAA GGCCAACCCG GAGACCGTCG ATCCCTGGCT CGCCGGCGTG ACGACGGTGG ACGGCAAGGA TGCGCTGCCG GTGGTCAAGG AAGCCCTCGG CCTCTGA
|
Protein sequence | MTPIRQALLA AASTLAMAGT ALAQDQCREV TFSDVGWTDI TVTTSATRQV LEALGYEVEV DILGVPVTYA SMDKGDVDVF LGTWLPAQES AIGPYLEKGS IEEITTNLEG TKYTLAVPTY LYDKGLKSYG DIAKFKDELE GKVYGIEPGN EGNEYLISLT EAGKPLEGFE VVQSSEQGML AQVARFYPQE KGVVFLGWEP HPMNATFSLK YLPGGEDFFG EDGVVKTVAR KGFKEDCPNV TKMLSQQKFT LPMENEIMGK ILDDGMEPDA AVMEWLKANP ETVDPWLAGV TTVDGKDALP VVKEALGL
|
| |