Gene Rpal_0347 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_0347 
Symbol 
ID6407993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp365195 
End bp366760 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content66% 
IMG OID642710257 
Productpeptidase S10 serine carboxypeptidase 
Protein accessionYP_001989383 
Protein GI192288778 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2939] Carboxypeptidase C (cathepsin A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.272983 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGCTGT CGCCCCGGTC CGCCGTGACC GCCTCGATGT TGATCGCAGC GCTGGCGCTG 
CCGCAGTTCG GCTCGAGCGC TGTCGCCCAA GAGGCGCGAC CCGCGGCACC ACACGCCGAG
CGTGCCAAGC CGAATGCAGA GCAGGCAGAC GCAGTCGCGA GTAAAGCCGA GGCCTCTCCG
GCCCGTGCAG AGCTCAACAG CCTGCCGCCG GACGTCACCA CCAAGCACAG TCTGGCGTTG
CCGGGGCGAA CTCTCGCGTT CACCGCCACC GCCGGCTCGA TCCGGCTGTT CAACGGCAAG
GGCGAACCGC AGGCCGATGT TGCCATCACC ACCTACAAGC TCGACGGCGC CGATGCACGA
ACCCGGCCGG TGACTTTCCT GTTCAACGGC GGCCCCGGCG CATCCTCGGC CTGGCTGCAG
CTCGGCGCCG CCGGACCGTG GCGGCTGCCG ATCGGCAACA GCGTGGTGGC GTCTTCGCCG
CCGGTGCTGC AGGCCAATGC AGAGACCTGG CTCGATTTCA CCGACCTGGT GTTCATCGAT
CCGGTCGGCA CCGGCTACAG CCGTTTCGTC GCCAGCGGCG ACGAGGTCCG CAAGCATTTC
TATGCGGTCG AGGGTGACAT CTCGGCGATG GCGGTGGTGA TCCGGCGTTG GCTCGAGAAG
AACGACCGGC TCGTTTCGCC GAAGTATCTA GCCGGTGAAA GTTACGGCGG CATCCGCGGA
CCGAAGGTCG TGGACAATCT GCAGACCAAG CAGGGCGTCG GCGTCAATGG TCTGATCCTG
GTGTCGCCGG TGCTCGACTT CCGCGATCTG TCCGGCTCCA GCCTGCTGCA ATATGCGGCG
CGGCTGCCGT CGATGACCGC GGTGGCGCGG CAGCAGAAGG GCAAGGTGAA CCGCGCCGAC
CTCGCCGACG TCGAAAGCTA CGCGCGCAGT GAATTCCTCA CCGATCTGGT CAAAGGCGAG
GCCGACAAGG AAGCCACCAC GCGGCTTGCC GACCGCGTCT CCGCGCTCAC CGGGATCGAC
AAGACCGTGA GCCGGCGGCT CGCCGGACGG TTCGACACCC GCGAATTCCA GCGTGAATTC
GACCGCGATC GCGGCCGGGT CACCGGACGG TTCGACGGCG CCAAGCTAGG GCTCGATCCG
TTCCCGGATT CCAGCGCTGC GCATTTCGGC GATCCGTCGG CGGATTCACT GATCGCGCCG
CTGACAAGTG CTGCCGTGCA GCTGACCCGC TCCACGCTGA ACTGGAAACC GGACGGATCG
TACGAACTGT TGAACAGCTC GGTCGCCGAG CAATGGGATT TCGGCCGTGG CCGGCAGCCG
CTGGAATCGA CCACGCAGCT GCGCGAGATC CTCAGCGTCG ATCCGAGCCT GCAGGTGCTG
GTCACCGGCG GGTTGTTCGA TCTCGCCGCG CCGTATTTCG GCACCCAGAT GGTGCTCGAT
CAGCTGCCGC CGACGCTGGC GGAAAAACGC GTGAAGTTCG TCGTCTATCC CGGCGGCCAC
ATGTTCTACG CCGAGGACGC TGCCCGGCAA TCGCTGCATG ACGAAGTGAA GGCGATGATG
AAGTAG
 
Protein sequence
MPLSPRSAVT ASMLIAALAL PQFGSSAVAQ EARPAAPHAE RAKPNAEQAD AVASKAEASP 
ARAELNSLPP DVTTKHSLAL PGRTLAFTAT AGSIRLFNGK GEPQADVAIT TYKLDGADAR
TRPVTFLFNG GPGASSAWLQ LGAAGPWRLP IGNSVVASSP PVLQANAETW LDFTDLVFID
PVGTGYSRFV ASGDEVRKHF YAVEGDISAM AVVIRRWLEK NDRLVSPKYL AGESYGGIRG
PKVVDNLQTK QGVGVNGLIL VSPVLDFRDL SGSSLLQYAA RLPSMTAVAR QQKGKVNRAD
LADVESYARS EFLTDLVKGE ADKEATTRLA DRVSALTGID KTVSRRLAGR FDTREFQREF
DRDRGRVTGR FDGAKLGLDP FPDSSAAHFG DPSADSLIAP LTSAAVQLTR STLNWKPDGS
YELLNSSVAE QWDFGRGRQP LESTTQLREI LSVDPSLQVL VTGGLFDLAA PYFGTQMVLD
QLPPTLAEKR VKFVVYPGGH MFYAEDAARQ SLHDEVKAMM K