Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_2104 |
Symbol | |
ID | 4022586 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 2354768 |
End bp | 2356327 |
Gene Length | 1560 bp |
Protein Length | 519 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637962297 |
Product | substrate-binding region of ABC-type glycine betaine transport system |
Protein accession | YP_569240 |
Protein GI | 91976581 |
COG category | [E] Amino acid transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1174] ABC-type proline/glycine betaine transport systems, permease component [COG1732] Periplasmic glycine betaine/choline-binding (lipo)protein of an ABC-type transport system (osmoprotectant binding protein) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.882961 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.496916 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGTCT TCAGCGACCC ACGCTGGACC GAGGCGCTGG CCAATCTGCC GGACTATCTC GGCAGCCATG TCCGCGTCAG CGTCGCGGCG CTGGGGCTCG GCCTCGCCGT CAGCCTGCCG CTCGCGATCC TCGCGCGTCA CCGGCCGGTG CTGCGCGGCG CCCTGCTCGG CTTCGCCGGC ATCGTGCAGA CCATTCCGGG CCTCGCATTG CTGGCGTTGT TCTATCCTCT GCTGCTGGCA CTGGCGGCAC TGAGCGCGAG CACGCTCGGC GTCAGCTTCT CCGCGTTCGG CTTCCTGCCC GCGGTGCTGG CGCTGGCGCT GTATTCGATG CTGCCGGTGC TGCGCAACAC CATCACCGGG CTCGCCGGCG TCGACCCGGC GCTGATCGAG GCCGCACAGG GCGTCGGTAT GACTCCACGG CAGACGCTGA CGATGGTCGA GCTGCCACTG GCGCTGCCGG TGATCATGGC CGGCATCCGC ACCAGTGCGG TGTGGGTGAT CGGCACCGCG ACGCTGTCGA CGCCGATCGG TCAGACCAGC CTCGGCAACT ACATCTTTGC GGGACTGCAA ACCCAGAACT GGGTGCTGGT GCTGTTCGGC TGTGCCGCAG CCGCTGTGCT GGCGCTGGCG GTCGATCAAT TGTTGGCGCT GATCGAGACT GGCCTGCGGC TCCGCAGCCG GCTCCGCGTC GCGCTCGGCG GCCTCGGGCT CGCCGCGCTG GTGATCGCGA CGCTGGCGCC GTCGCTCGCG TCGTCGAAAT CGACCTATGT GGTCGGCGCC AAGACCTTCA CCGAGCAATA CGTCTTGGCG GCGCTGATTG CGCAGCAACT CGACGACCGA GGTCTTCCCG CCACGATCCG CGCCGGCCTC GGCTCCAGCG TAATCTACGA CGCGCTCAAA AGCGGCGATA TCGATGCTTA CGTCGATTAT TCCGGCACGC TGTGGGCCAA CCAGTTTCAC CACCCCGGCA TCGCCGCGCG GGACCAAGTC TTGAGCGATC TGAAAGGCGA CCTCGGCAGA GACGGCGTCA CGCTGCTCGG CGAACTCGGC TTCGCGAACG CCTATGCGCT GGTGATGCCG CGCGCCAGAG CGGAGGCGCT CGGCGTCCGG TCGATCGCTG ATCTCGCTGC GCGCGCGCCG CAGATGACGA TCGCTGGCGA CTATGAATTC TTCTCGCGCC CGGAATGGGC CGAGCTGCGC AGGACTTACG GGCTTTCGTT CAAGACCGAG CGGCAGATGC AGCCGGACTT CATGTATGCC GCCGTCGCCT CCGGCGAGGT CGATCTGATC GCGGGCTACA CCTCGGACGG GCTGATCGCG AAATACGATC TGGTCGTCCT CGACGATCCG CGACAGGCGA TTCCGCCTTA CGACGCCGTG CTGCTGGTGT CGCCGAAGCG CGCCGGTGAC ACGAAGCTGC GCGACGCGCT CAGCCCCCTC GTCGGCCGTA TCGACATCGC GGCGATGCGA GCGGCCAATC TCCGGGCCGC TTCCGGCGGC GGCACAGGCA CGCCGGATGT GGTGGCGCGT GCGCTGTGGG ACAGCGTGAA GCCGCGGTGA
|
Protein sequence | MSVFSDPRWT EALANLPDYL GSHVRVSVAA LGLGLAVSLP LAILARHRPV LRGALLGFAG IVQTIPGLAL LALFYPLLLA LAALSASTLG VSFSAFGFLP AVLALALYSM LPVLRNTITG LAGVDPALIE AAQGVGMTPR QTLTMVELPL ALPVIMAGIR TSAVWVIGTA TLSTPIGQTS LGNYIFAGLQ TQNWVLVLFG CAAAAVLALA VDQLLALIET GLRLRSRLRV ALGGLGLAAL VIATLAPSLA SSKSTYVVGA KTFTEQYVLA ALIAQQLDDR GLPATIRAGL GSSVIYDALK SGDIDAYVDY SGTLWANQFH HPGIAARDQV LSDLKGDLGR DGVTLLGELG FANAYALVMP RARAEALGVR SIADLAARAP QMTIAGDYEF FSRPEWAELR RTYGLSFKTE RQMQPDFMYA AVASGEVDLI AGYTSDGLIA KYDLVVLDDP RQAIPPYDAV LLVSPKRAGD TKLRDALSPL VGRIDIAAMR AANLRAASGG GTGTPDVVAR ALWDSVKPR
|
| |