Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_0347 |
Symbol | |
ID | 6407993 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 365195 |
End bp | 366760 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 642710257 |
Product | peptidase S10 serine carboxypeptidase |
Protein accession | YP_001989383 |
Protein GI | 192288778 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2939] Carboxypeptidase C (cathepsin A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.272983 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGCTGT CGCCCCGGTC CGCCGTGACC GCCTCGATGT TGATCGCAGC GCTGGCGCTG CCGCAGTTCG GCTCGAGCGC TGTCGCCCAA GAGGCGCGAC CCGCGGCACC ACACGCCGAG CGTGCCAAGC CGAATGCAGA GCAGGCAGAC GCAGTCGCGA GTAAAGCCGA GGCCTCTCCG GCCCGTGCAG AGCTCAACAG CCTGCCGCCG GACGTCACCA CCAAGCACAG TCTGGCGTTG CCGGGGCGAA CTCTCGCGTT CACCGCCACC GCCGGCTCGA TCCGGCTGTT CAACGGCAAG GGCGAACCGC AGGCCGATGT TGCCATCACC ACCTACAAGC TCGACGGCGC CGATGCACGA ACCCGGCCGG TGACTTTCCT GTTCAACGGC GGCCCCGGCG CATCCTCGGC CTGGCTGCAG CTCGGCGCCG CCGGACCGTG GCGGCTGCCG ATCGGCAACA GCGTGGTGGC GTCTTCGCCG CCGGTGCTGC AGGCCAATGC AGAGACCTGG CTCGATTTCA CCGACCTGGT GTTCATCGAT CCGGTCGGCA CCGGCTACAG CCGTTTCGTC GCCAGCGGCG ACGAGGTCCG CAAGCATTTC TATGCGGTCG AGGGTGACAT CTCGGCGATG GCGGTGGTGA TCCGGCGTTG GCTCGAGAAG AACGACCGGC TCGTTTCGCC GAAGTATCTA GCCGGTGAAA GTTACGGCGG CATCCGCGGA CCGAAGGTCG TGGACAATCT GCAGACCAAG CAGGGCGTCG GCGTCAATGG TCTGATCCTG GTGTCGCCGG TGCTCGACTT CCGCGATCTG TCCGGCTCCA GCCTGCTGCA ATATGCGGCG CGGCTGCCGT CGATGACCGC GGTGGCGCGG CAGCAGAAGG GCAAGGTGAA CCGCGCCGAC CTCGCCGACG TCGAAAGCTA CGCGCGCAGT GAATTCCTCA CCGATCTGGT CAAAGGCGAG GCCGACAAGG AAGCCACCAC GCGGCTTGCC GACCGCGTCT CCGCGCTCAC CGGGATCGAC AAGACCGTGA GCCGGCGGCT CGCCGGACGG TTCGACACCC GCGAATTCCA GCGTGAATTC GACCGCGATC GCGGCCGGGT CACCGGACGG TTCGACGGCG CCAAGCTAGG GCTCGATCCG TTCCCGGATT CCAGCGCTGC GCATTTCGGC GATCCGTCGG CGGATTCACT GATCGCGCCG CTGACAAGTG CTGCCGTGCA GCTGACCCGC TCCACGCTGA ACTGGAAACC GGACGGATCG TACGAACTGT TGAACAGCTC GGTCGCCGAG CAATGGGATT TCGGCCGTGG CCGGCAGCCG CTGGAATCGA CCACGCAGCT GCGCGAGATC CTCAGCGTCG ATCCGAGCCT GCAGGTGCTG GTCACCGGCG GGTTGTTCGA TCTCGCCGCG CCGTATTTCG GCACCCAGAT GGTGCTCGAT CAGCTGCCGC CGACGCTGGC GGAAAAACGC GTGAAGTTCG TCGTCTATCC CGGCGGCCAC ATGTTCTACG CCGAGGACGC TGCCCGGCAA TCGCTGCATG ACGAAGTGAA GGCGATGATG AAGTAG
|
Protein sequence | MPLSPRSAVT ASMLIAALAL PQFGSSAVAQ EARPAAPHAE RAKPNAEQAD AVASKAEASP ARAELNSLPP DVTTKHSLAL PGRTLAFTAT AGSIRLFNGK GEPQADVAIT TYKLDGADAR TRPVTFLFNG GPGASSAWLQ LGAAGPWRLP IGNSVVASSP PVLQANAETW LDFTDLVFID PVGTGYSRFV ASGDEVRKHF YAVEGDISAM AVVIRRWLEK NDRLVSPKYL AGESYGGIRG PKVVDNLQTK QGVGVNGLIL VSPVLDFRDL SGSSLLQYAA RLPSMTAVAR QQKGKVNRAD LADVESYARS EFLTDLVKGE ADKEATTRLA DRVSALTGID KTVSRRLAGR FDTREFQREF DRDRGRVTGR FDGAKLGLDP FPDSSAAHFG DPSADSLIAP LTSAAVQLTR STLNWKPDGS YELLNSSVAE QWDFGRGRQP LESTTQLREI LSVDPSLQVL VTGGLFDLAA PYFGTQMVLD QLPPTLAEKR VKFVVYPGGH MFYAEDAARQ SLHDEVKAMM K
|
| |