Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3566 |
Symbol | |
ID | 3911368 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4086879 |
End bp | 4088114 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637885468 |
Product | extracellular ligand-binding receptor |
Protein accession | YP_487172 |
Protein GI | 86750676 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.172998 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.680161 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGGCCC TCTCCCGTTC GATCGCCACC CTGGCGGCCG CCGCCCTGCT GTCCGCAGCC GCCGGACAGG CGATGGCGCA GAAGAAATAT GGCCCCGGCG CCAGCGACAC CGAAGTCAAG ATCGGCAACA TCGTGCCTTA CTCGGGCCCG GCGTCGGCCT ATGGCAGCGT CGGCAAGGCG CAAGAAGCCT ATTTCAAGAT GATCAACGAC AAGGGCGGCA TCAACGGCCG CAAGATCGTC TACATCTCCA ACGACGACGC CTATTCGCCG CCGAAATCGG TCGAGCAGAC CCGTAAGCTG GTCGAGAGCG ACGAGGTGCT GTTCATGTTC AGCCCGCTCG GCACGCCGTC CAACACCGCG ATCCAGAAAT ATCTCAACGC CAAGAAAGTG CCGCATCTGT TCCTGGCGTC GGGCGCCACC AAGTGGAACG ATCCGAAGCA CTTTCCGTGG ACGATGGGCT GGCTGCCGAG CTACCAGAGC GAAGGCCGGA TCTACGCCAA GTATCTGATG AAGGAGAAGC CGGACGCCAA GATCGCCGTG CTGTATCAGG GCGACGATTT CGGCAAGGAC TATCTCAAGG GCCTCAAGGA CGGCCTCGGC GCCAAGGCTT CGCAGGTGGT GATCGAGGAC AGCTACGAGC TGACCGAGCC GACCGTCGAT TCCCACATCG TCAAAATCAA GGCCGCCAAT CCCGACGTGC TGGTGATCTT CGCCACGCCG AAATTCGCGG CGCAGACCAT CAAGAAGGTC GCCGAACTGG CGTGGAAGCC GATGATGATC GTGCCGAACG TCTCGGCCTC GACCGGCAGC GTGATGAAGC CCGCCGGCTT CGAGAACGCC CAGGGCATCG TCTCCGCCTC CTACGCCAAG GACGCCACCG ACAAGCAGTG GGAAAACGAC CCCGGCATGA AGGCGTATTA CGAGTTCATG GAGAAGTATG CGCCGCAGGC CAGCCGCGCT GACTCATCGT TCATGACCGG CTACAACATC GCCGAGACGG TCGCGGTGCT GATCAAGCAA TGCGGCGACG ATCTGTCCCG CGAGAACGTC ATGAAGCAGG CGGCCAACCT CAAGGACGTC CAGCTCGGCG GCCTGCTGCC GGGCATCAAG CTCAACACCA GCGCGACCGA CTTCGCGCCG ATCGAACAGC TGCAGCTGAT GAAGTTCCAG GGCGAGAACT GGAAGCTGTT CGGCGACGTG ATCGAGGGCG AAGTCGCCGC GCCGACCGGC GGCTAG
|
Protein sequence | MSALSRSIAT LAAAALLSAA AGQAMAQKKY GPGASDTEVK IGNIVPYSGP ASAYGSVGKA QEAYFKMIND KGGINGRKIV YISNDDAYSP PKSVEQTRKL VESDEVLFMF SPLGTPSNTA IQKYLNAKKV PHLFLASGAT KWNDPKHFPW TMGWLPSYQS EGRIYAKYLM KEKPDAKIAV LYQGDDFGKD YLKGLKDGLG AKASQVVIED SYELTEPTVD SHIVKIKAAN PDVLVIFATP KFAAQTIKKV AELAWKPMMI VPNVSASTGS VMKPAGFENA QGIVSASYAK DATDKQWEND PGMKAYYEFM EKYAPQASRA DSSFMTGYNI AETVAVLIKQ CGDDLSRENV MKQAANLKDV QLGGLLPGIK LNTSATDFAP IEQLQLMKFQ GENWKLFGDV IEGEVAAPTG G
|
| |