Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3337 |
Symbol | |
ID | 3911139 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 3817274 |
End bp | 3818833 |
Gene Length | 1560 bp |
Protein Length | 519 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637885240 |
Product | substrate-binding region of ABC-type glycine betaine transport system |
Protein accession | YP_486944 |
Protein GI | 86750448 |
COG category | [E] Amino acid transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1174] ABC-type proline/glycine betaine transport systems, permease component [COG1732] Periplasmic glycine betaine/choline-binding (lipo)protein of an ABC-type transport system (osmoprotectant binding protein) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.917916 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.224892 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCTGT TCGGCGATCC GCGCTGGAAC GAGGCGCTGG CCAATCTGCC GGACACGCTC GGCGCGCATG TCCGCGTCAG CCTCGCCGCG CTGGCGCTCG GCCTCGTCGT CAGCCTGCCG CTGGCGATTG TTGCGCGACG CCGCCCGATC CTGCGCGGCG CCCTGCTCGG CTTCGCCGGC ATCGTCCAGA CCATCCCCGG CCTCGCTCTA CTGGCGCTGT TCTATCCGCT GCTGCTGGCG CTGGCGGCGT TGAGTCTCCG CGCCTTCGGC GTCGGCTTCT CGGCGTTCGG CTTCCTGCCC GCGGTGCTGG CGCTGGCGCT GTATGCGATG CTGCCGGTGC TGCGCAACAC CATCACCGGG CTCGACGGCG TCGATCCGGC GCTCGTCGAG GCGGCGCAGG GCGTCGGCAT GACGCCGCGG CAGGTGTTGA CGATGGTCGA GGTACCGCTG GCGCTGCCGG TGATCATGGC CGGCATTCGC ACAGCCGCCG TATGGGTGAT CGGCACCGCG ACGCTGTCCA CGCCCATTGG CCAGACCAGC CTCGGCAACT ACATCTTCGC CGGGCTGCAG ACGCAGAACT GGGTGCTGGT GCTGTTCGGC TGCGCCGCCG CCGCAGTGCT GGCGCTCGCG GTCGATCAAC TGCTGGCGCT GATCGAGAGC GGCCTGCGGC TGCGCAGCCG CGTCCGCACC GCGCTCGGCG GCCTTGGCCT CGCCGCGCTG CTGATCGCGA CACTGGCGCC GTCGCTGGCC GGCACGCGAT CGACCTACTT CGTCGGCGCC AAGACCTTCA CCGAGCAATA TGTGCTCGCG GCCCTTCTCC AGCAGCGGCT CGACGCCGCC GGGCTGTCGG CCACGACGCG CGCCGGCCTC GGCTCCAGCG TGATCTACGA CGCGCTGAGG AGCGGCGACA TCGACGCCTA TGTCGACTAC TCCGGCACGC TGTGGGCCAA TCAGTTCAAG CACCCCGGCA TCGCCGCGCG CGACCAGGTG CTGGCCGATC TGAAGCAGGA TCTCGCCAAG GACGGCGTCA CCCTGCTCGG CGACCTCGGC TTCGCCAACG CCTATGCGCT GGCGATGCCG CGCGCCAGGG CCGAGGCGCT CGGTATCCGC TCGATCGCCG ATCTCGCCGC GCGCGCGCCG GCGATGACGA TCGCCGGCGA CTACGAGTTC TTCTCCCGCC CGGAATGGGC CAATCTGCGC CGCACCTATG GGCTCACTTT CAAGGCCGAA CGGCAGATGC AGCCCGACTT CATGTACGCG GCCGTCGCCA CCGGCGAGGT CGATCTGATC GCCGGCTATA CCTCGGACGG GCTGATCGCC AAATACGATC TCGTCGTGCT GGACGATCCC AAGCAGGCGA TCCCGCCTTA CGACGCGGTG CTGCTGATCT CACCAAAGCG CGCCGGCGAC GCGAAGCTAC GCGATGCGCT GACGCCGCTG GTCGGCCGCA TCGATATCGG CGCGATGCGC GCCGCCAATC TGCGCGCCGC CTCGGGCGGC GGCGACGGCA CGCCCGACGC GGTGGCGCGG GCGCTGTGGG ACGGCATCAA GGCGCGCTGA
|
Protein sequence | MSLFGDPRWN EALANLPDTL GAHVRVSLAA LALGLVVSLP LAIVARRRPI LRGALLGFAG IVQTIPGLAL LALFYPLLLA LAALSLRAFG VGFSAFGFLP AVLALALYAM LPVLRNTITG LDGVDPALVE AAQGVGMTPR QVLTMVEVPL ALPVIMAGIR TAAVWVIGTA TLSTPIGQTS LGNYIFAGLQ TQNWVLVLFG CAAAAVLALA VDQLLALIES GLRLRSRVRT ALGGLGLAAL LIATLAPSLA GTRSTYFVGA KTFTEQYVLA ALLQQRLDAA GLSATTRAGL GSSVIYDALR SGDIDAYVDY SGTLWANQFK HPGIAARDQV LADLKQDLAK DGVTLLGDLG FANAYALAMP RARAEALGIR SIADLAARAP AMTIAGDYEF FSRPEWANLR RTYGLTFKAE RQMQPDFMYA AVATGEVDLI AGYTSDGLIA KYDLVVLDDP KQAIPPYDAV LLISPKRAGD AKLRDALTPL VGRIDIGAMR AANLRAASGG GDGTPDAVAR ALWDGIKAR
|
| |