Gene Plav_0503 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_0503 
Symbol 
ID5455588 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp542722 
End bp543702 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content64% 
IMG OID640876069 
Productproline iminopeptidase 
Protein accessionYP_001411783 
Protein GI154250959 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01249] proline iminopeptidase, Neisseria-type subfamily 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value0.624874 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGACA CCACGCCAGC CGCCACCCGC CGGACACTCT ATCCGGAGAT CGAGCCATAC 
CGGACCGGAT CGCTCAAAGT ATCCGATTTG CATACGCTCT ATTTCGAGGA ATGCGGCAAC
CCGAAGGGCA AACCCGTGGT GATCGTGCAT GGCGGGCCGG GGGGCGGCAC CAACCCGACG
ATGCGGCGGA CGCACAATCC GGACGCTTAC CGAATCATTC TGTTCGACCA ACGGGGCTGC
GGGAAGTCCA CACCCCATGC GGAGCTGCGC GAGAACACGA CATGGGACCT GGTGGCGGAC
ATGGAGCGGC TGAGGGAACA TCTCGGCATC GACCGCTGGC AGCTCTGCGG CGGCTCCTGG
GGATCGACGC TGGCGCTTGC CTATGGCGAA ACCCACCCTG CCCGCGTCAC CGAAATTATC
CTGCGCGGCA TCTTCACACT CCGGAAGCGG GAGCTCCACT GGTTCTATCA GGAGGGGACG
GATGCGCTTT TTCCCGATGC CTGGGAGGAA TTCATCGCGC CCATTCCCGA GGCCGAGCGC
GGCAACCTGA TGGCCGCCTA TTACAAGCGG CTCACCGGCG ACAATGAGGC GGAGAAGCTT
GCCTGCGCCC GGGCATGGAG CATATGGGAG GGGACGACGC TCTCGCTCTA TTCCGACCCC
GAGCGGGTGA AGCGCTTTGC CGACGGGCAT TTCGCCCTCG CCTTCGCGCG GATCGAATGC
CACTACTTCA TGAACAAGGG CTGGTTCGAG CCGCAGAACC AACTGATCCG CGAGGCCGGC
AAGCTGAAGG GCATTCCGGG CGTCATCGCG CAGGGGCGCT ATGATGTCGT GACGCCGATG
TTCACGGCAT GGGAACTCGC CAAGGCCTGG CCGGAAGCGG AGCTCACCAT CGTGCCGGAC
GCCGGACACA CGGCGACGGA GCCCGGCATT GTCGATGTGA TGGTGCGGGC GAGCGACCGC
TACGCTGCGG TGAAGAACTG A
 
Protein sequence
MPDTTPAATR RTLYPEIEPY RTGSLKVSDL HTLYFEECGN PKGKPVVIVH GGPGGGTNPT 
MRRTHNPDAY RIILFDQRGC GKSTPHAELR ENTTWDLVAD MERLREHLGI DRWQLCGGSW
GSTLALAYGE THPARVTEII LRGIFTLRKR ELHWFYQEGT DALFPDAWEE FIAPIPEAER
GNLMAAYYKR LTGDNEAEKL ACARAWSIWE GTTLSLYSDP ERVKRFADGH FALAFARIEC
HYFMNKGWFE PQNQLIREAG KLKGIPGVIA QGRYDVVTPM FTAWELAKAW PEAELTIVPD
AGHTATEPGI VDVMVRASDR YAAVKN