Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4614 |
Symbol | |
ID | 3912431 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 5213725 |
End bp | 5214945 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637886518 |
Product | extracellular ligand-binding receptor |
Protein accession | YP_488208 |
Protein GI | 86751712 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTTAC GCCCGAATTC CGTTTTCGCG TCGGCAGCCG CGGTGGCCGC GGTTATGCTT GCGGCCACAT CGGCTGCAGC GGCGGAGAAG AAATACGATC CCGGCGCCAG CGACACTGAA ATCAAGATCG GCCAGACCGT GCCGCATTCG GGTCCCGGTT CGCTCTATGG CGTGCTCGGC CGCGTCGGCG AAGCCTATTT CCAGATGCTG AACGACAAGG GCGGCATCAA CGGCCGCAAG GTCAAATTCC TGACCCTGGA CGATTCCTAC AGCGCGCCGA AGGCGGTCGA AGCCACCCGG CGGCTGGTCG AGCAGGAAGA GGTGCTGGCG CTGTACGGCT CGCTCGGCAC CGCGCCGCAG ACGGCTGTCC ACAAATATCT CAACAACAAG AAGGTGCCGC AGCTGCTGCT GAACACCGGC GCGTCGAAAT GGAACGACCC GAAGAACTTC AAATGGACCA TGGCGGGTCT GCCGCTCTAT CCGACCGAGG CGCGGATTCT CGCCAAATAC GTGCTCAGCG TGAAACCGGA CGCCAAGATC GCGATCCTCT ATCAGAACGA CGATTTCGGC CGTGACTTCC TCGGCCCGTT CAAGAAGGTT TTGGAAGACG CCGGCGGCAA GGCCAAGGTG ATCGCCGAGG CCAGCTACGA TCTGACCGAG CCGACCATCG ACTCGCAGAT GATCAATCTG TCGAAATCCG GCGCCGACGT GTTCTACAAC ATCACCACCG GCAAGGCGTC GTCGCAGTCG ATCCGCAAGG TCGTGGAACT CGGCTGGAAG CCGCTGCAAC TGCTGTCGGC CGGCTCGACC GGGCGCTCGA TTCTCGAGGC CGCAGGCCTC GACAACGCCA AGGGGATCGT GGCGATCGCC TATACCAAGG ACATCGGATC GCCGAAATAC GCCGGCGACC CCGACGTGAT GGCGTTCGAG GAATTGCGCA AGAAGTACCT GCCGAACGTC ACGCCGGACA ATTCGATCGC GTTCTCCGGC TATGCGCAGG CCGCCGCCAT GGCGGAAATT CTGCGCCGCT GCGGCGACGA TCTGACGCGT GAAAACGTCA TCAAGCAGGC GTCGATGCTG GGCGGATTCC GCGCTCCGCA CATGCTGCCC GGCGTGAGCT ACTCCTACAA GCCGGACGAC TACACTTCGA TCAAGACGCT CTACACGATG GAATTCAGCG GCAAGGACTG GATCGTGTCC GACAAGCCGG TCGCTGAATA A
|
Protein sequence | MNLRPNSVFA SAAAVAAVML AATSAAAAEK KYDPGASDTE IKIGQTVPHS GPGSLYGVLG RVGEAYFQML NDKGGINGRK VKFLTLDDSY SAPKAVEATR RLVEQEEVLA LYGSLGTAPQ TAVHKYLNNK KVPQLLLNTG ASKWNDPKNF KWTMAGLPLY PTEARILAKY VLSVKPDAKI AILYQNDDFG RDFLGPFKKV LEDAGGKAKV IAEASYDLTE PTIDSQMINL SKSGADVFYN ITTGKASSQS IRKVVELGWK PLQLLSAGST GRSILEAAGL DNAKGIVAIA YTKDIGSPKY AGDPDVMAFE ELRKKYLPNV TPDNSIAFSG YAQAAAMAEI LRRCGDDLTR ENVIKQASML GGFRAPHMLP GVSYSYKPDD YTSIKTLYTM EFSGKDWIVS DKPVAE
|
| |