Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4338 |
Symbol | |
ID | 6412022 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 4664819 |
End bp | 4665823 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 642714220 |
Product | peptidase S58 DmpA |
Protein accession | YP_001993309 |
Protein GI | 192292704 |
COG category | [E] Amino acid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3191] L-aminopeptidase/D-esterase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.105129 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCACAATC TCATCACCGA TGTCGCCGGC GTCCGCGTCG GCCATGCGCA CGACCACAAG CTCGCTTCCG GCGTTACCGC GATCCTGTTC GACAAGCCCG CGGTCGCTTC GATCGACGTG CGCGGTGGCG GTCCTGGGAT TCGCGACGGC GCGCTGCTGG AACCGGTGAA CACCGTCGAG CAGATCGACG GCTTCACGCT GTCGGGCGGC TCGGCATTCG GCCTCGATTC CGGCGGCGGC GTGCAGGCCT GGCTCGCCGA GCGCGGTCGC GGTTTTGCGA TCGGCAATGC GACGATTCCG ATCGTGCCGG GCGCGGTGGT GTTCGACATG ATCAACGGCG GCGACAAGGC CTGGGGTCGG TTCTCGCCGT ATCGCGACCT CGGCTACGCG GCTGCGGACG CCGCCGGCGA CAGCTTCGCG CTCGGCAGTG TCGGCGCCGG CCTCGGCGCC ACCACGGCAA CGCTGAAGGG CGGGCTGGGC TCGGCGTCAG CAACCACGCC CGGCGGCGTC ACGGTGGGCG CCATCGCGGT GGTCAACGCG ATCGGCAGCG CCACGATCGG CGACGGCCCG TGGTTCTGGT CGGCACCGTT CGAACAGGAC GGCGAATTCG GCGGGCTCGG GATGCCGGAA AGCTTCACGC CGGACATGCT GAAGGTGCGA CTGAAGGGCG CGGCGGCAGC GAGCGCGATC GAGAACACCA CGCTGGTCGC GGTGGTGACC GACGCGAACC TCACCAAGCC GCAGGTGAAG CGGCTGGCGA TGCTGGCGCA GACCGGGTTC GCCCGCGCGA TCTATCCGGT GCACGCGCCG CTCGATGGCG ACGTGGTGTT TGCCGCGGCG ACCGGCGTCA AACCGGTCGA GCCGCTCGCA GGTCTCACCG AGCTCGGCAC CATCGCGGCC AACACGGTGG CGCGGGCAAT CGCTCGCGGC GTCTATGAGG CCACCGCGCT GCCGTTCAAG GACGCGCAGC CGGCGTGGCG CGATCGGTTC GGCTCGAAGC GATAA
|
Protein sequence | MHNLITDVAG VRVGHAHDHK LASGVTAILF DKPAVASIDV RGGGPGIRDG ALLEPVNTVE QIDGFTLSGG SAFGLDSGGG VQAWLAERGR GFAIGNATIP IVPGAVVFDM INGGDKAWGR FSPYRDLGYA AADAAGDSFA LGSVGAGLGA TTATLKGGLG SASATTPGGV TVGAIAVVNA IGSATIGDGP WFWSAPFEQD GEFGGLGMPE SFTPDMLKVR LKGAAAASAI ENTTLVAVVT DANLTKPQVK RLAMLAQTGF ARAIYPVHAP LDGDVVFAAA TGVKPVEPLA GLTELGTIAA NTVARAIARG VYEATALPFK DAQPAWRDRF GSKR
|
| |