Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0422 |
Symbol | |
ID | 3909978 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 464874 |
End bp | 466073 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637882308 |
Product | extracellular ligand-binding receptor |
Protein accession | YP_484044 |
Protein GI | 86747548 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGAAC TGCGCGATCG AAATACCACG TCTCTGCGAA CCAGCCGCCG CACGGCGGTC GGATTGATCC TCGGCGCGCC GTTGCTCGGC GCCTGCTCGG GGATGCAGCA GACGCTCTCC AGTCAGTTCG GCCAGCAGCC GACCGCGCCG GAGGCCGCGC AGCAATCGCA ATCCGTCGGC AACGGTCGGG TCAAGGTCGG TCTCGTGCTG CCGCTGTCTG CGGCCGGAAA TGCCGGCGTC GCCGCGCAGT CGATGAAGAA TGCCGCCGAG ATGGCGCTCG CCGAGTTCAA CAATCCCGAC ATCCAGTTGC TGGTGAAAGA CGATGCCGGC AATCCGCAGG GCGCGCAGGC CGCGACCCAG CAGGCGCTCG ACGAAGGCGC CGAGATCATG CTCGGTCCGC TGTTCGCGCA ATCGGTGCCG GCTGCCGCCC AGCTGACGCG CGCCCGCGGC ATCTCGATGA TCGCGTTCTC GACGGATTCG AGCGTCGCCG GCCGCGGCGT CTATCTGTTG AGCTTCCTGC CGGAGTCCGA CGTCAACCGG ATCATCGGCT ACGCGTCGAG CGTCGGCAAA CGTTCCTATG CGGCACTGCT GCCGGACAAC GCCTATGGCG GCGTCGTCGA GGCCGCCTTC AAGCAGGTGG TCGGGACCAA GGGCGGCCGC ATCGCCGCGT TCGAGAAATA CGGCGCCGAC CGAGCCGGCC CGGCGCGGAC CATCGCGCAG GCGCTGTCCG GCGCCGATTC GCTGCTGCTC GCCGATGACG GTGATGCGCT GGCAAGCGTC AGCGAGGCGC TGACCGCGGC GGGCGCCGAT CTGCGCCGCG TGCAGTTGCT CGGCACCGGG CTGTGGGACA ATCCGCGCGT GTTCGCAACG CCGGCGTTGC AGGGTGGACT CTACGCCGCG CCCGATCCGT CCGGCTTTCG CAGTTTCTCT GGCCGCTACC GCGCCAAATT CGGCCAGGAA CCCGTCCGCA CCGCGACGCT CGCTTACGAT GCAGTGGCGC TGGTGGCGGC CTTGTCGAAG ACGCAGGGTG CCAAGCGGTT CTCGGCCGAG GTGTTGACCA ATCCGTCGGG CTTCGCCGGC ATCGACGGCC TGTTTCGCTT CCGCGCCGAC GGCAGCAACG AGCGGGGCCT CGCGGTGATG CGCGTTGCGA CCGGCGGCGC CCAGGCGGTG GCTGGATCGC CGAAGAGCTT CGGGGCGTAG
|
Protein sequence | MAELRDRNTT SLRTSRRTAV GLILGAPLLG ACSGMQQTLS SQFGQQPTAP EAAQQSQSVG NGRVKVGLVL PLSAAGNAGV AAQSMKNAAE MALAEFNNPD IQLLVKDDAG NPQGAQAATQ QALDEGAEIM LGPLFAQSVP AAAQLTRARG ISMIAFSTDS SVAGRGVYLL SFLPESDVNR IIGYASSVGK RSYAALLPDN AYGGVVEAAF KQVVGTKGGR IAAFEKYGAD RAGPARTIAQ ALSGADSLLL ADDGDALASV SEALTAAGAD LRRVQLLGTG LWDNPRVFAT PALQGGLYAA PDPSGFRSFS GRYRAKFGQE PVRTATLAYD AVALVAALSK TQGAKRFSAE VLTNPSGFAG IDGLFRFRAD GSNERGLAVM RVATGGAQAV AGSPKSFGA
|
| |