Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3687 |
Symbol | |
ID | 3911489 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4227083 |
End bp | 4228078 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637885589 |
Product | peptidase S58, DmpA |
Protein accession | YP_487293 |
Protein GI | 86750797 |
COG category | [E] Amino acid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3191] L-aminopeptidase/D-esterase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAAGAACC TCATCACCGA CATTGCCGGC GTCCGCGTCG GCCACGCCCA CGACGAACGC TTGGCCTCCG GCGTCACCGC GATCCTGTTC GACACGCCGG CCGTCGCCTC GATCGACGTG CGCGGCGGCG GACCGGGGAT TCGCGACGGC GTGCTGTTGG AGCCGGTCAA TACCGTCGAG CAGGTCGACG GCTTCACGCT GTCGGGCGGC TCGGCGTTCG GGCTCGATTC CGGCGGCGGC GTGCAGGCCT GGCTCGCCGA ACAGGGCCGC GGCTTCTCGA TCGGCACCGC GACGATCCCG ATCGTTCCGG GCGGCGTGAT CTTCGACATG CTCAATGGCG GCGACAAGGC GTGGGGGCAA TTCTCGCCCT ATCGCGAGCT CGGCTATCAG GCCGCAGCGG CGGCAAGCGA CGCCTTCGCG CTCGGCAGCG TCGGCGCGGG GCTCGGCGCG ACCACGGCGA ACTACAAAGG CGGGCTCGGC TCGGCCTCCG CGACGACGCC GGGCGGAATC ATGGTCGGCG CGATCGCGGC GGTGAATGCG ATCGGCAGCG TCACCATCGG CGACGGCCCG TGGTTCTGGT CGGCGCCTTA CGAAGTCGGC GATGAGTTCG GCGGCTGCGG CATGCCGCAG CATTTCACCG ACGAGATGCT GGCGGTGAAG ATCAAGGGCG CCGCCGCCGC GACCGCGGAG AACACCACGC TGGTCGCGGT CGTCACCGAC GCGCTGCTGA CCAAGACGCA GGTGAAGCGG CTGGCGATGA TGGCGCAGAC CGGGTTTGCG CGGGCCATCT ATCCGGTGCA TGCGCCGCTC GACGGCGACG TGGTGTTCGC CGCCGCGACC GGCAACAAGC CGGTCGATCC GCTGGCGGGG CTCACCGAAC TCGGCGCGAT CGCCGCCAAC ACGGTGGCGC GGGCGATCGC GCGCGGCGTC TACGAGGCGA CCGCGCTGCC GTTCAAGGAC GCGCAGCCGG CGTGGCGCGA TCAGTTCAGG CGCTGA
|
Protein sequence | MKNLITDIAG VRVGHAHDER LASGVTAILF DTPAVASIDV RGGGPGIRDG VLLEPVNTVE QVDGFTLSGG SAFGLDSGGG VQAWLAEQGR GFSIGTATIP IVPGGVIFDM LNGGDKAWGQ FSPYRELGYQ AAAAASDAFA LGSVGAGLGA TTANYKGGLG SASATTPGGI MVGAIAAVNA IGSVTIGDGP WFWSAPYEVG DEFGGCGMPQ HFTDEMLAVK IKGAAAATAE NTTLVAVVTD ALLTKTQVKR LAMMAQTGFA RAIYPVHAPL DGDVVFAAAT GNKPVDPLAG LTELGAIAAN TVARAIARGV YEATALPFKD AQPAWRDQFR R
|
| |