Gene Pnap_4086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_4086 
Symbol 
ID4686153 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp4367746 
End bp4370757 
Gene Length3012 bp 
Protein Length1003 aa 
Translation table11 
GC content63% 
IMG OID639837099 
Productpentapeptide repeat-containing protein 
Protein accessionYP_984298 
Protein GI121606969 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.791775 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.807746 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAAAC ACCGCCTTGC CCCGGTTCTA CTGGCCGCAG CTGTTCTGGC TGCGTCCTGC 
GGCGGCGGTG ACACGCCATC CCCGGCAGGC GGCACCGGCA TCAACACCGG GGTCAGCCCC
GGCACCGGCA GCGTGCCGCT GTACCAGTCC TCCACCGAGA CGCTGACACT GCCGCAGCTG
CGGGTAGCCG GCAACATCCG GGCCGATGTC AAGCTTGTGC TCAAGAAGGA CGGCAACTGG
GCGCTGCTTT CCTCCGGCCC GACTCGTCCC GCCACCGCAT CCGACACGCC CGGCGCCGCG
CTGGCCGCGC CCGGCGGCAA CACCGACCTG GGCGGCACGC AGACCGACAC GACGCTGACG
GTGGCGCGGC TGCACGTAGG CTCTCGCGTG TTTGGCAACG TGGCGGTCCG GCTCACCGGC
AAGGCCTGGG CCTTTGTGAG CAGCCCGCAA GAGGTGAAGA CGCTGCACCA GGAGGACTTC
AAATCCAACA CGGCGATTCG CGCCGACGAG TCGCACCACG TCATCCTTCA ATCCAGCCCG
GACAGCGGTG TGCAGAACGT GCCGATGCAG TTGTCCGGCA GAAACTACAA GTTCTGCATG
GACGCGCAGG CCGAGGGCGC CGACAGCACG ACGCTGCTGG ATGCGGCGGG CCATACCATT
TTTACGCTCA AGGCGGGTGA GCCTTGCGTC ACCCTCCAGG CCAGGGAGGG GGCGTACACC
CTGCAGCACC GCTATGGCGG CACCGGCAGC GCCCGCACCC TGTTCATGCG CAACCAGGCC
AACACCACGA CAGCCACTGC GCTGGCGGCG CAAGCAGCGT CCGCACCGCC CGGTTTGCTC
AACGCGCCCA AGCTGCTGGC CGCTTCGGCA ACCGCGACAA ACGCCATGGC GCCGGTCGCC
GAATACTGGT CTGTCCGCAA CCCAGCGGCA AATTCCGGCG CGCAACCCAG ATCGCTCGGA
AATGCGGGTA CTTTTTACGC GGAGCCGGTT TATGGGCTTG ATGCTCTCGG CACGGGCTGC
AACGGAAAGA TCGCCTTCAG CTTCATGAAT CCCTGGAGCG CCCAGGCACT ATTTCGCATC
GACAAGAATG CTTTCAACGT ACCCGTCTAC ATGGGCGTTC CTCTTGGATG TGAGTTTTAT
GCGATGGAGC TTTATTCTGG AACGGGTTCG ATTCCACCGT TCGTATACGG CAACGCCATT
TACCCTCAAT ACGACCCCAC GCTGGCATCC ACCCTCTGGA ACAGCGGGCT TGGCACCACC
TACGCCAGCC TGGTAAAGGA TGACTACCTG ACAGAAGTGG ATGACAACTC TCTGCGCCTG
TATCCAACGC TTGGCCTGGA TGCTTTTGTC GTGCCGTCGC ATCCGACCGT CACCATTGGC
GGGGTGGCGT ACCCCGCTCC AATTCCGATG CCAGATCCGA ACGCGCCTGA GGGCTTCTAC
AGCATGGGCG GTTTTTTTGC GAGTTCCACC CTGGTGGAAC GCAGGCAACA AACCATCACC
GAACAGTCAA GCACGCGCTT TACCCTGGCG TCCATGTATT CGACCGTGTA TTCGCCGGCG
GGTGTACAGG TGAGCAGCGC GACGGCACCT GTTCTGGCCA CCAGCGACTC CCTGCTCACG
GTGGGAAATG CAGCCGCAGG CGATGTGGCC TCCTCGCTCT CGATTGCCTA CCGCTATTAC
CCAGACGGAC CGCCATCGGC TGTTTTAGGT TTCGGCCAGA TTGCGCTCTA CACCGGGCCC
AACTGCACGG GCGCCGTGGT CATGCCACAG GCTGACGTGC CGACCTTCGA CGTCTCGGGC
GCTCCTGCGC TCAAGGGACT TGGCACCTCG TTCAAGCTGG GCCTGCAGAC CAGCGCCACG
GCTTTTTCAT TGCCGCTGTA CAACGGCGAG CAGCAGCACT TTGACCAGCT GACCTGCTAT
TCCGGCGGCT TTGGCTCAAC AGGCTGGACG CCCCAATCCA TGCAGATCGC AGTGGACACC
GTCACCATGG TCATCAGCAC CGACTCGTGT GAATACTGCA ACCTGGCCGG TGTCGATTTT
TCCGGCGTCA ACCTCACCAA CGTCAAGCTG AGCTACGCCA ACCTGAACGG CGCCATCCTG
TCCAACATCG ACCTGTCGGG CGCGGACCTG CGCAGCGTCA GCCTGCAGGG CGCCTACCTC
ATCAATGCCA ATCTGGACGG CGCCAATTTA TGCGCGGCGC AATTGAACGG CAGCCAGGGC
GTCACCCAGG CGGCCACGCT GACCGGGGCG CACCTGCGCA ACACCAACCT GGCGCTGTCC
AACCTCGATG GCGTGAAGCT GTCATCAGCC AGCTTTTACA GCAGCAATGG GCAGGGCACC
TGCCAGCAGA CCAGCTGCAG CAGCTATGTG GCCTCCACCT GTGCCAGCGC CTACAACGCC
TCCCTCAACA ACGCGAGTTT TGATTCGGCC TACCTGTCCA ATGTCGATAT GAGCAACGTC
ACCGGCGCGG GTGTCAGCTT CAACAACGCC GCCTTGTTTG GCGTGCTGTT TGGCCAGGCC
AATTTGGCGC ACAACAAGCT TTCATCCGTC AGTTCATCCT TCATCAATGC CTACCTGCAG
GGAACCGACC TGTCTCGCGC CAACCTGCAG TTCGCAGACT TCACGGGGGC GCAGTTTGAT
GCGGCCAGCA ATTGCATCCA GGCCAATTTA AACCCCGCCT ATTCAAACTT CCCCGGCGCC
AAGGTGCCCG CCAGCCCGGG CAGCTCGACC TGCGTGCCGG GCAAGCCTGC TGCGGCGTTT
TGCGTCCAGT CTTCTTTCGC CCCATCGGCG GGCTACCCCC AGACCGACTG CACCAACATC
TGCGCCGATG GGAGCACGGC AGGCGTCGGG CTGACCAATG GCACCTGCCC GAACGCTTTC
ACCTGCTCCT CGGCAAGCTG GACCACGCCG CTGAACGGCG GAGGCAACGG CGCCATGCCC
ACCAGCAACT GCCAGGGCGC AGCGGCGCTG TGCGGCAACC CGTTCACGGG CGGCGCCGAC
CCGTGCTGGT AA
 
Protein sequence
MMKHRLAPVL LAAAVLAASC GGGDTPSPAG GTGINTGVSP GTGSVPLYQS STETLTLPQL 
RVAGNIRADV KLVLKKDGNW ALLSSGPTRP ATASDTPGAA LAAPGGNTDL GGTQTDTTLT
VARLHVGSRV FGNVAVRLTG KAWAFVSSPQ EVKTLHQEDF KSNTAIRADE SHHVILQSSP
DSGVQNVPMQ LSGRNYKFCM DAQAEGADST TLLDAAGHTI FTLKAGEPCV TLQAREGAYT
LQHRYGGTGS ARTLFMRNQA NTTTATALAA QAASAPPGLL NAPKLLAASA TATNAMAPVA
EYWSVRNPAA NSGAQPRSLG NAGTFYAEPV YGLDALGTGC NGKIAFSFMN PWSAQALFRI
DKNAFNVPVY MGVPLGCEFY AMELYSGTGS IPPFVYGNAI YPQYDPTLAS TLWNSGLGTT
YASLVKDDYL TEVDDNSLRL YPTLGLDAFV VPSHPTVTIG GVAYPAPIPM PDPNAPEGFY
SMGGFFASST LVERRQQTIT EQSSTRFTLA SMYSTVYSPA GVQVSSATAP VLATSDSLLT
VGNAAAGDVA SSLSIAYRYY PDGPPSAVLG FGQIALYTGP NCTGAVVMPQ ADVPTFDVSG
APALKGLGTS FKLGLQTSAT AFSLPLYNGE QQHFDQLTCY SGGFGSTGWT PQSMQIAVDT
VTMVISTDSC EYCNLAGVDF SGVNLTNVKL SYANLNGAIL SNIDLSGADL RSVSLQGAYL
INANLDGANL CAAQLNGSQG VTQAATLTGA HLRNTNLALS NLDGVKLSSA SFYSSNGQGT
CQQTSCSSYV ASTCASAYNA SLNNASFDSA YLSNVDMSNV TGAGVSFNNA ALFGVLFGQA
NLAHNKLSSV SSSFINAYLQ GTDLSRANLQ FADFTGAQFD AASNCIQANL NPAYSNFPGA
KVPASPGSST CVPGKPAAAF CVQSSFAPSA GYPQTDCTNI CADGSTAGVG LTNGTCPNAF
TCSSASWTTP LNGGGNGAMP TSNCQGAAAL CGNPFTGGAD PCW