Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0038 |
Symbol | |
ID | 3909721 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 37966 |
End bp | 39186 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637881919 |
Product | extracellular ligand-binding receptor |
Protein accession | YP_483661 |
Protein GI | 86747165 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAGCC TGATTCAGGC CGCCACGGCG GCTGTTGCTG CGATTGTGCT CACTGCCGCC CCGGCTGCGG CACAGAAGAA ATACGACACC GGCGCCACCG ATACCGAGAT CAAGATCGGC CAGACCGTGC CGTTCTCCGG TCCCGCCTCG GCCTATGCGG GCATCGGCAA GACCCAGGCG GCCTATATGC GGATGATCAA CGATGCCGGC GGCATCAACG GCCGCAAGAT CAACCTGATC CAGTACGACG ACGCCTATTC GCCGCCGAAA GCGGTCGAGC AGGTGCGCAA GCTGGTCGAA AGCGACGAGG TGCTGCTGAC CTTCCAGATC ATCGGCACCC CGTCCAACGC CGCGGTGCAG AAATATCTCA ACCAGAAGAA GGTGCCGCAA CTGCTCGCGG CGACCGGCGC GACACGGTTC ACCGATCCGA AGAATTTCCC CTGGACGATG GGCTACAACC CGAACTACCA GACCGAGGGT CGGATCTACG CGCGCTACAT CCTGAAGAAC CACCCCGACG CCAAGATCGG CGTGCTGTTC CAGAACGACG ATCTCGGCCG CGACTACGTC ACCGGCCTGC GGGCCGGCCT CGGCGACAAG GCCGACAAGA TGATCGTGGC GGAGACGTCC TATGAACTCA CCGACCCGAC CGTCGACTCG CAGATCGTCA AGCTGAAATC CGCCGGCGCC ACGCTGCTGT ACGACGCATC GACGCCGCGC TTCGCCGCGC AGGCGATCAA GAAAGTCGCC GATCTCGGCT GGAATCCGGT GCACATCCTC GACATCAACG CCAGCCCGGT GTCGGCGACG CTGAAGCCCG CCGGCCTCGA CATCTCCAAG GGCATCATCA GCGTCAATTA CGGCAAGGAC CCCGCCGACC CGCAATGGGC CGACGACCCG GGCGTGAAGA AGTACTTCGC CTTCATGGAC AAGTACTATC CCGAGGGCGA CAAGATGAAC ACCGTCAACA GCTACGGCTA TTCCACCGCG GAGCTGCTGA TCACCATCCT GAAGCAGTGC GGCGACAATC TCACCCGCGA CAACATCATG AAGCAGGCCG CCAATCTGAA GAACGTCACG CTCGACCTGT CGCTGCCGGG CATGTCGATC AATACCTCGC CGACCGACTT CCGCGTCAAC AAGCAGTTGC GGATGATGAA GTTCAACGGC GAGCGCTGGG AGCTGTTCGG CCCGATCATT GAGGACGACG CCGCGATGTG A
|
Protein sequence | MKSLIQAATA AVAAIVLTAA PAAAQKKYDT GATDTEIKIG QTVPFSGPAS AYAGIGKTQA AYMRMINDAG GINGRKINLI QYDDAYSPPK AVEQVRKLVE SDEVLLTFQI IGTPSNAAVQ KYLNQKKVPQ LLAATGATRF TDPKNFPWTM GYNPNYQTEG RIYARYILKN HPDAKIGVLF QNDDLGRDYV TGLRAGLGDK ADKMIVAETS YELTDPTVDS QIVKLKSAGA TLLYDASTPR FAAQAIKKVA DLGWNPVHIL DINASPVSAT LKPAGLDISK GIISVNYGKD PADPQWADDP GVKKYFAFMD KYYPEGDKMN TVNSYGYSTA ELLITILKQC GDNLTRDNIM KQAANLKNVT LDLSLPGMSI NTSPTDFRVN KQLRMMKFNG ERWELFGPII EDDAAM
|
| |