Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1739 |
Symbol | |
ID | 3909726 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 1986262 |
End bp | 1987494 |
Gene Length | 1233 bp |
Protein Length | 410 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637883633 |
Product | extracellular ligand-binding receptor |
Protein accession | YP_485358 |
Protein GI | 86748862 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.152869 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAACTT CGCCTTCGCG CGCGCTCGTC TTTTCAGCAG CCGTCGCGGT CAGCCTCGCA GCGTTCAACG GCGCTGCCTT CGCTCAAAAA AAATACGACA CCGGCGCCTC CGATACCGAG ATCAAGATCG GCAACATCAT GCCCTATTCG GGCCCGGCCT CGGCCTACGG CACGATCGGC AAGACCGAAG AAGCCTACTT CCGGATGGTG AACGAGAACG GCGGCATCAA CGGCCGCAAG GTCAACTTCA TCAGCTATGA CGACGCCTAT TCGCCGCCGA AGGCGGTCGA ACAGGTCCGC AAGCTGGTCG AGAGCGACGA AGTGCTGGCG GTGTTCAACC CGCTCGGCAC GCCGTCCAAC ACCGCGGTCC AGAAATATCT CAACGCCAAG AAAGTGCCGC AACTGTTCGT CGCCACCGGC GCCACCAAGT GGAACGATTC GAAGAACTTC CCCTGGACGA TCGGCTGGCA GCCGTCCTAC CAGAGCGAAG CGCAGATCTA CGCCAAGTAC CTGCTGAAAG AGAAGCCGGA CGCCAAAATC GGCATCCTCT ATCAGAACGA CGATTTCGGC AAAGACTATC TGAAGGGCAC CAAGGACGGC CTCGGCGCCA AGGCCGCCTC GATGATCATC GCCGAAGAGA GCTACGAGAT CTCGGCGCCG ACCATCGACA GCAACGTCGT CAAGCTGAAG TCGGCCAATC CCGACGTCGT GCTGGTCTAC ACCACGCCGA AATTCGCCGC GCAGACCATC AAGAAGATCG CCGAACTGAG CTGGAAGCCG CTGGTGATCC TCACCAACGT CTCGGCCTCG GTCGGCAGCG TGATGCAGCC CGCCGGCTTC GAAAACGCCC AGGGCGTGCT GTCGGCGAAC TACGCGAAGG ACCCGACCGA CCAGCAGTGG GACAGCGATC CGAAGTTCAA AAAGTGGCAC GCCTTCGTCG AGAAATACAT GCCGGGCTCC AACAAGAACG ACAGCAACAT GGTCTACGGC TACGGCGCCG CCCAGACGCT GCACAAGGTG CTGGAGATGT GCGGCGACGA CCTGACCCGC GCCAATTTGA TGAAGCAGGC CGCCAGCCTC AAGGACTTCG AGCCGGACAC CCTGTTGCCC GGCGTCAAGA TCAATACCTC GGCGACCGAC TTCGCGCCGA TCAGCCAACT CCAGATGATG CGCTTCAAGG GCGACAGGTG GGAGCTGTTC GGCGAGATCA TCTCCGGCGA CATCACCCAG TAA
|
Protein sequence | MPTSPSRALV FSAAVAVSLA AFNGAAFAQK KYDTGASDTE IKIGNIMPYS GPASAYGTIG KTEEAYFRMV NENGGINGRK VNFISYDDAY SPPKAVEQVR KLVESDEVLA VFNPLGTPSN TAVQKYLNAK KVPQLFVATG ATKWNDSKNF PWTIGWQPSY QSEAQIYAKY LLKEKPDAKI GILYQNDDFG KDYLKGTKDG LGAKAASMII AEESYEISAP TIDSNVVKLK SANPDVVLVY TTPKFAAQTI KKIAELSWKP LVILTNVSAS VGSVMQPAGF ENAQGVLSAN YAKDPTDQQW DSDPKFKKWH AFVEKYMPGS NKNDSNMVYG YGAAQTLHKV LEMCGDDLTR ANLMKQAASL KDFEPDTLLP GVKINTSATD FAPISQLQMM RFKGDRWELF GEIISGDITQ
|
| |