Gene Pnuc_0887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnuc_0887 
Symbol 
ID5053905 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1 
KingdomBacteria 
Replicon accessionNC_009379 
Strand
Start bp879642 
End bp880946 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content44% 
IMG OID640471045 
Productpeptidase U32 
Protein accessionYP_001155667 
Protein GI145589070 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0117216 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA TCCCTGAACT CCTAGCCCCT GCCGGCAGCC TCCAAATGTT ACGCACCGCT 
TTTGATTTTG GTGCCGATGC AATTTATGCA GGTCAACCGC GATATTCGCT ACGCGTACGC
AACAATGACT TTGGCAAGAT TGAGGTTCTT AAACAGGGAA TCGATACCGC CCATGACTTA
GGTAAAAAGT TTTATCTAGT ATCTAACTTA TTGCCGCATG GTGGTAAAAC GCGTACCTAT
ATTAAGGATA TGGACCCAGT CATTGCATTG AAACCTGATG CAATCATCAT GTCAGATCCT
GGCCTCATCA TGATGGCAAG AGAAGCGTGG CCTGATATGC CTATTCATTT ATCCGTTCAA
GCCAATACAG TCAATGGAGC ATCTGCAAAG TTCTGGCGTT CTGTTGGTAT AAGCCGCGTT
ATTTTGTCTC GTGAACTTTC TTTTGATGAA ATCGAAGAAG TACGTCAAGA CTGTCCAGAA
ATGGAACTTG AAGTTTTTGT ACACGGTGCG CTGTGTATTG CTTACTCTGG TCGCTGCTTA
CTTTCCGGCT ATATGTCTCA CCGCGACTCT AATCAAGGCG CATGTACTAA TGCTTGCCGC
TGGGACTACA AAGTAAAGCC AGGTCAACAA AATCAAAGTG GAGATGTAGT GTTACTACAA
GAAGCTAAAA GGCCTGATGA TTTGATGCCG ATGGAAGAGG ATGAGCATGG CACCTACATC
ATGAACTCTA AAGATTTACG CGCTATTGAA CACATCGAGA AACTGACTGC GATGGGAGTA
GATTCTTTCA AAATTGAAGG TCGTACAAAA TCACCCTATT ATGTTGCTCG CACCGCTCAA
GCCTACCGTG CTGCTATTGA TGATGCTGTT GCTGGGCGAC CATTTAATAC CGCATTACTA
GGTAATTTAG AGGGATTAGC CAATCGTGGT TATACCGACG GCTTCTATGA GCGTCATCAC
GATAAAGAAT ATCAACTCTA TATGCGTGGT TACTCTCTCT CGGGTAGAAG TCTATATGTT
GGAGAAACTT TAGAGGTTGA CGAAGTAAAT GGTCGCGTCA AAGTGGATGT CAAGAATCGT
TTTTCAGTAG GTGATAAGCT AGAAATTCTT GATCCTAAAG GCAATCAAGA TCTAGTACTC
GATGCCATGT GGAATATGAG TGGCGAACCC ATTACTGTTG CACCAGGATC AGGTCACTTT
GTTTGGATTC CGTTAAAAAT TAAAGGTCAG AAAGCCTACA TTGCACGCTA CACCAATGAG
CCTGCGCCTG TTGAAACTGA AAGCGCCTGC GCAACTTGCG GTTGA
 
Protein sequence
MKKIPELLAP AGSLQMLRTA FDFGADAIYA GQPRYSLRVR NNDFGKIEVL KQGIDTAHDL 
GKKFYLVSNL LPHGGKTRTY IKDMDPVIAL KPDAIIMSDP GLIMMAREAW PDMPIHLSVQ
ANTVNGASAK FWRSVGISRV ILSRELSFDE IEEVRQDCPE MELEVFVHGA LCIAYSGRCL
LSGYMSHRDS NQGACTNACR WDYKVKPGQQ NQSGDVVLLQ EAKRPDDLMP MEEDEHGTYI
MNSKDLRAIE HIEKLTAMGV DSFKIEGRTK SPYYVARTAQ AYRAAIDDAV AGRPFNTALL
GNLEGLANRG YTDGFYERHH DKEYQLYMRG YSLSGRSLYV GETLEVDEVN GRVKVDVKNR
FSVGDKLEIL DPKGNQDLVL DAMWNMSGEP ITVAPGSGHF VWIPLKIKGQ KAYIARYTNE
PAPVETESAC ATCG