Gene Pnap_1065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_1065 
Symbol 
ID4686650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp1125856 
End bp1129221 
Gene Length3366 bp 
Protein Length1121 aa 
Translation table11 
GC content60% 
IMG OID639834064 
Producttransglutaminase domain-containing protein 
Protein accessionYP_981303 
Protein GI121603974 
COG category[E] Amino acid transport and metabolism
[S] Function unknown 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases
[COG4196] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.417532 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCCTG TCCGCGTCGC GCCCACCACC CATCGAAAGG CACCTCCAAT GGCAATCCGT 
GTTGCGCTGC ACCACAAAAC CAGCTACCAC TATGATCGCC TGGTTTCCCT GTCACCGCAT
GAAGTTCGCC TGCGTCCGGC GCCTCATGCG CGTACCCCGA TTCTTTCGTA TTCGCTGACC
GTCACACCAG AAGACCACTT CATCAACTGG CAGCAGGATC CTTACGGCAA TTACATTGGC
CGCTACGTAT TTCCTGAAAA ATCGGACAAG CTGGAATTCA CGGTTGACCT GGTGGCGGAC
ATGACCGTCA TCAATCCATT TGATTTCTTC GTCGAAAAAT ATGCTGAATT TTTCCCATTC
AGTTACCCGG TCCAGCTCAC GGCCGAGTTA AGCCCCTATT TGAAGACGGA AGAGCCTGGC
GAATTGCTGG TCGAATGGGT CGCAGCGGCC CGTCGCGATA TTCTGAAGAA AAACCTGGCG
ACCAACGACT TTCTGGTGGC TGTCAATCAA CGGCTCCAGG GTGATATTGC CTATTTGCTG
CGGATGGAGC CTGGCGTGCA AACACCCGAG GACACCCTGC AAAAACGCTC GGGCTCGTGC
CGCGACACCG GCTGGTTGCT GGTGCAAATT TTTCGCGAGA TGGGCTTGGC GGCACGCTTT
GTGTCGGGTT ACCTGATTCA GTTGCGTGCT GATCAGGCAT CGCTCGACGG CCCCAGCGGA
ACCGAAGTTG ACTTTACCGA CCTGCATGCC TGGACCGAGG TGTATATCCC CGGCGCAGGC
TGGATTGGAC TGGACCCGAC CTCAGGCATG CTGGCCAGTG AAGGCCATAT TCCGCTGGCC
TGTACCGCCA TGCCGTCATC GGCGGCCCCG GTCACCGGCT TTACCGACAA GGCCGAGGTC
ACATTCTTTC ATGAAATGAC GATCACGCGC ATTCACGAAG ATCCGCGCGT GACCAAGCCT
TACAAGGAAG AAGACTGGCA AAAAATCGAC CTGCTCGGCC AGCAGGTTGA CGCGGATCTG
AAGCGCCAGG ATGTGCGCCT GACCCAAGGC GGCGAGCCTA CGTTTGTTTC CATCGACGAT
ATGGATGGAG CCGAATGGAA TACCCTGGCG CACGGCGATA AAAAACGCGA ACTGGCCGGC
CAGCTCATGC ACCGCCTGAA AAATCATTTT GCGCCCGGGG GCATGCTGCA TTACGGACAG
GGAAAATGGT ATCCGGGCGA GCCCTTGCCA CGCTGGGCCT TGAACATCTA TTGGCGCATC
GACGGTCAAC CGATGTGGCT CGACCCCACG CTGTTTTCGG ATGAGAACAA GAACGACGGC
TATGGTTATG AAGATGCCGA ACGCTTTGCC GCTGAACTGG TCAAGTCACT GGGCCTGCCG
GCTTTCAGCC AGATTCCGGC TTATGAGGAC GTCGTTCAGC AAGCTCATCT GGAGCAAGGG
CTGCCAGTCA ACCTGGACCC ACTCCAGGTT GACCTGAAGG CGTCGGAGCA GCGGCGGCGC
CTGGCGCGCC TGCTGGAAAC GGGCCTGGGC CAGGTCGTGG GCTATGTCTT GCCGTTAAAA
CCCAAGGACC TGGACCCCAA TATCGCATTG GGCACGGCCT GGCGCACTTC CCCATGGCCG
CTCAAACGCG ATCACCTGTA CCTGACCGAA GGCGACTCGC CCATGGGACT GCGCCTGCCT
TTGAATGCGC TTCCCTGGGT GCTGCCCGAA GAAAAGGAGC CCGAGTTTGA TGTGGATCCG
TTCGCTCCCC GGACCGCACT GGATGCCGGG CAAAAGGAAA AGCGGGCCGC CCTCAAGGCC
AGGAAACCTG CTCCGGTCGA TGGCAACCCC GAGCCACGCG ACGTGATTCA TACCGCGCTG
TGCGTGCAGG TGCGCAATGG ACGCCTGCAT GTATTCATGC CGCCAGTCCA ACGCATCGAG
GACTACCTGG CCCTGGTCAC TTCCGTTGAA AATACCGCCG CCAAGCTGAA ACTGAAACTG
TGGATTGAAG GCTATCCGCC TCCGCGCGAC CCGCGCATCA AGCTGCTCAG CGTGACGCCC
GATCCCGGCG TCATCGAGGT GAACATTCAC CCGGCGGCCA GCTGGAACGA GCTGGTCTAC
AACATGACTA CCTTGTATGA AGAGGCGCGA CTCACGCGCC TTGGCACTGA AAAGTTCATG
GTCGATGGCC GGCACACCGG CACAGGTGGT GGCAACCACG CCACGCTCGG CGGCGCAACG
GCCGAAGACA GTCCGATGCT GCGCCGCCCG GACCTGCTCA AGAGCCTGAT CACCTACTGG
CAAAACCATC CGGCGCTGTC CTATCTGTTT TCAGGCACGT TCATCGGCCC CACCAGCCAG
GCGCCCCGGG TTGACGAAGC ACGGGACGAC AACCTGTACG AACTGGCCAT TGCCTTCCAG
CAGATGGACA AGGTGCTGCC CACCATGCAG CCCGGCGACA AGCCCTGGAT GGTGGACCGC
TTGCTGCGCA ATTTGCTGGT GGACCTGACC GGAAATACGC ACCGCTCGGA ATTCTCCATC
GACAAGCTGT ATTCGCCCGA TGGCCCGACC GGCCGTTTGG GGCTGGTCGA ATTCCGCGCC
TTCGAGATGC CGCCGCACGA GCGCATGAGC TTGCTGCAAA TGCTGTTGCT GCGCGCGCTG
GTGGCCCGTT TCTGGCGCCA GCCCTACCAG GCCAAGCTGG TGCACTGGGG AACGGCATTG
CATGACCGCT GGATGCTGCC GCACTTTGTG GCGCAGGACA TCCGCGACGT GGTCAAGGAT
TTGCGCGCCG CAGGCTATGC GTTTGAAGAG CACTGGTTCG ATCCTTTCAT CGAATTCCGT
TTCCCGCGTT TTGGCACGGT GGTCTATGAG GGCGTGGAAA TGGAATTGCG CCAGGCCATC
GAGCCCTGGA ATGTGCTGGG TGAGGAAATG ACCGGCGGCG GCACCGCGCG CTATGTCGAT
TCGTCGGTGG AGCGCATGCA GCTGCTGGTG CGCGGACTGA CCGACGGGCG CCACGTGATT
GCCTGCAATG GTCGCATGCT GCCCTTGCAT CCGACCGGCA TTCCGGGCGA ATACGTGGCG
GGTGTCCGCT TTCGCGCCTG GAGTCCGTGG TCGGCCCTGC ATCCGACCAT CCGGGTGCAG
GCGCCCTTGA CGTTTGACCT GGTCGATACC TGGAGCGGCC GCGCCATTGG CGGCTGCACC
TACCATGTCA CGCATCCGGG CGGACGCAGC GAGGAAAGTT CGCCCATCAA TGCCAACGCT
GCCGAAGCAC GCCGATTTGC CCGCTTTTGG GCGCATGGGC ATACACCGGG ACCGATGAGC
GTGTATAAGG AAGAGTTGAA TCCAGGCTTT CCCATGACGC TCGACTTGCG TTGGCAGCCT
TACTGA
 
Protein sequence
MRPVRVAPTT HRKAPPMAIR VALHHKTSYH YDRLVSLSPH EVRLRPAPHA RTPILSYSLT 
VTPEDHFINW QQDPYGNYIG RYVFPEKSDK LEFTVDLVAD MTVINPFDFF VEKYAEFFPF
SYPVQLTAEL SPYLKTEEPG ELLVEWVAAA RRDILKKNLA TNDFLVAVNQ RLQGDIAYLL
RMEPGVQTPE DTLQKRSGSC RDTGWLLVQI FREMGLAARF VSGYLIQLRA DQASLDGPSG
TEVDFTDLHA WTEVYIPGAG WIGLDPTSGM LASEGHIPLA CTAMPSSAAP VTGFTDKAEV
TFFHEMTITR IHEDPRVTKP YKEEDWQKID LLGQQVDADL KRQDVRLTQG GEPTFVSIDD
MDGAEWNTLA HGDKKRELAG QLMHRLKNHF APGGMLHYGQ GKWYPGEPLP RWALNIYWRI
DGQPMWLDPT LFSDENKNDG YGYEDAERFA AELVKSLGLP AFSQIPAYED VVQQAHLEQG
LPVNLDPLQV DLKASEQRRR LARLLETGLG QVVGYVLPLK PKDLDPNIAL GTAWRTSPWP
LKRDHLYLTE GDSPMGLRLP LNALPWVLPE EKEPEFDVDP FAPRTALDAG QKEKRAALKA
RKPAPVDGNP EPRDVIHTAL CVQVRNGRLH VFMPPVQRIE DYLALVTSVE NTAAKLKLKL
WIEGYPPPRD PRIKLLSVTP DPGVIEVNIH PAASWNELVY NMTTLYEEAR LTRLGTEKFM
VDGRHTGTGG GNHATLGGAT AEDSPMLRRP DLLKSLITYW QNHPALSYLF SGTFIGPTSQ
APRVDEARDD NLYELAIAFQ QMDKVLPTMQ PGDKPWMVDR LLRNLLVDLT GNTHRSEFSI
DKLYSPDGPT GRLGLVEFRA FEMPPHERMS LLQMLLLRAL VARFWRQPYQ AKLVHWGTAL
HDRWMLPHFV AQDIRDVVKD LRAAGYAFEE HWFDPFIEFR FPRFGTVVYE GVEMELRQAI
EPWNVLGEEM TGGGTARYVD SSVERMQLLV RGLTDGRHVI ACNGRMLPLH PTGIPGEYVA
GVRFRAWSPW SALHPTIRVQ APLTFDLVDT WSGRAIGGCT YHVTHPGGRS EESSPINANA
AEARRFARFW AHGHTPGPMS VYKEELNPGF PMTLDLRWQP Y