Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4204 |
Symbol | |
ID | 3912012 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4776167 |
End bp | 4777369 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637886107 |
Product | extracellular ligand-binding receptor |
Protein accession | YP_487806 |
Protein GI | 86751310 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.669867 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGTCATC TTTCCATTGT TGCAGCTGCA GCATTCGTCG CTGCCGCCGT CCCGGCGCTG ACCGCATCGG TCGCCGCTCG CGCCGACGAT CTCAAGATCG CGCTGATCTA CGGCAAGACC GGTCCGCTGG AGGCCTACGC CAAGCAGACC GAAACCGGCC TGATGATGGG GCTCGAATAC GCGACCAAGG GCACGATGAC GCTCGACGGC CGCAAGATCA AGGTGATCAC CAAGGACGAC CAGAGCAAGC CCGACCTCTC CAAGGCGGCA CTTGCGGAAG CGTACCAGGA CGACGGCGTC GACATCGCGA TCGGCACCTC GTCGTCGGCC GCAGCACTCG CCGACCTGCC GGTCGCCGAG GAAAACAAGA AGATCCTGAT CGTCGAGCCG GCGGTAGCCG ATCAGATTAC CGGCGAGAAG TGGAATCGCT ACATCTTCCG CACCGGCCGC AACTCGTCGC AAGATGCGAT CTCCAACGCG GTCGCGATCG GCAAGCCGGG CGTCACCATC GCCACGCTGG CACAGGACTA CGCGTTCGGC CGCGACGGCG TCGCCGCCTT CAAGGAGGCG CTGGCCAAGA CCGGCGCGAC GCTCGCCGCC GAGGAATACG TCCCGACCAC CACCACCGAC TTCACCGCGG TCGGCCAACG ACTGTTCGAC ACGCTGAAGG ACAAGCCCGG CAAGAAGATC ATCTGGGTGA TCTGGGCCGG CGGCGGCGAT CCGCTGACCA AGCTGCAGGA CATGGACCCG AAGCGCTACG GCATCGAGCT GTCGACCGGC GGCAACATCC TGCCGGCGCT CGCCGCCTAC AAGCGCCTGC CCGGCATGGA AGGCGCGACC TATTACTATT ACGACATCCC CAAGAACCCG ATCAACGACT GGCTGGTGAC CGAGCATCAG AAGCGCTTCA ACGCACCGCC GGACTTCTTC ACCGCGGGCG GCTTCTCGGC CGCGATGGCG GTGGTCACCG CCGTGCAGAA GGCGAAGTCG ACCGACACCG AGAAGCTGAT CGCGGCGATG GAAGGCATGG AGTTCGACAC GCCGAAGGGC AAGATGATGT TCCGCAAGGA AGATCACCAG GCGCTGCAGA GCATGTATCA CTTCAAGGTC AAGGTCGACC CGAACGTCGC CTGGGCCGTG CTCGAGCCGG TGCGCGAACT GAAGATCGAG GACATGAATA TCCCGATCAA GAACAAGCGG TGA
|
Protein sequence | MRHLSIVAAA AFVAAAVPAL TASVAARADD LKIALIYGKT GPLEAYAKQT ETGLMMGLEY ATKGTMTLDG RKIKVITKDD QSKPDLSKAA LAEAYQDDGV DIAIGTSSSA AALADLPVAE ENKKILIVEP AVADQITGEK WNRYIFRTGR NSSQDAISNA VAIGKPGVTI ATLAQDYAFG RDGVAAFKEA LAKTGATLAA EEYVPTTTTD FTAVGQRLFD TLKDKPGKKI IWVIWAGGGD PLTKLQDMDP KRYGIELSTG GNILPALAAY KRLPGMEGAT YYYYDIPKNP INDWLVTEHQ KRFNAPPDFF TAGGFSAAMA VVTAVQKAKS TDTEKLIAAM EGMEFDTPKG KMMFRKEDHQ ALQSMYHFKV KVDPNVAWAV LEPVRELKIE DMNIPIKNKR
|
| |