Gene Pnap_0452 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_0452 
Symbol 
ID4688439 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp472986 
End bp474545 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content59% 
IMG OID639833449 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_980695 
Protein GI121603366 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGAT TTCTGGACAA GTTCCGGTTG ACCCCCGCAG GTAACTTGCC CGACGTGAGC 
TTGCAGAGCG CCAATTCCAG CCTGCCGGCA GCGCAGCCAG AATGGGGACC CCTGGACCTG
ACCCTCGATG GCATCGTGCT TCTTGACGAC AAAGCCGTCA TCCTTGACGC CAATGCTCCG
GCGCTTGAGT TCCTGCACAC CTCGCTGTCC GCCATCAGCG GCTACAGCTT GTGGGACGTG
GTGCCCGAGG AAGTTGCCAA GCAGAATGAA GAGGCCACCG AACTGGCTCT GGATTCTGCG
GACCGGCATA GCTTTATTGC CCATGAGAGT TTTGAAAACA GTTGGGTCGA ATACACCTTC
AGGCGATGCC CGGCAGGTTA CGTCGTCAAC CTGCGCGAAG CGGGTCCGGC CCAAAAATAC
CAGCGCCTGC TGGAAGACAG CAAACGCTAC AACCAGCTGA TTTTTGAAGC CAATCCGAAT
GTCATGTGGG TCTTTGACCG GACCACGCTT CGAATCTTTG CCGTCAACCA GGCTGCCGTC
AAGTTTTACG GCATTGCCCG CAAGATTTTC ATGACGCTTG GCATGGAAGC GCTCTTTCCC
CAGGGAGAAG GCGCCGAACT GCTGCGCTCC CTGCACACCG GCAAGGAAGA GCAGTCTGAA
ATGCGGCTGT GCAAACAGAA AAAAATGGAC GGCAAGGAAG TGCTGGTTGA GCTGGCCTGG
AGCCAGGTCA AGTGGGATGG CCATCAGGCG GTGCTGGTGA GCCTGGCCGA CATCAGTGAC
CGGCACCTCG CTGACGGCGT CCTGAAAAAA ACCAATGAGG AATTGCAAAA GACACTCGTC
GCCCAGCAGG TCGAACTGAA AAATGTCCGG CATGACCTGC TCACCTTTAC CCAGGCCGTC
TCCAGTGACT TGCAGGACTC CCTGCATGCG GCCCATGGCT TTGCCGCCAG GCTGGCTGAA
AAGTATTCGC TGGCGCTGGA CGACCAGGGC CGCCACTATG TCAGGCGCAT TCAGGCCAGC
ATCAGCCAGT TGGCCAAGCT GGTCGATGAC CTGCGAACAC TCGCCCAGCT GCCGCTGCGC
TCCGGAGTTC CTGCAATGGT CAATCTGGCA CCGGTCTGCC TTTCATTGAT CGCTGATTTG
CGCAAGCGCG ACCCGGACCG GGATGTGACC ATCGAAATGG ACAGCAAGCT GATGCTGGTC
GGGAACAAGG GGCTTTTGAC GACAGCCATG ACCTGCCTGC TGGAAAATGC CTGGAAATTC
ACCTCCAAAA AGACCGAGGC ATGGATCAAG GTCGGGCTGC TGCCCGGCAA GGCGCCGGGT
GAGCTGGTGT TGCTGGTGGC CGATAATGGC GCCGGATTTG ATCCGGTTTA CAGCGGCAAT
CTTTTCACCG CGTTTCAGCG CCTGCACTCT TCAGCCGACT TTCCGGGCAA CGGGCTGGGG
CTGGCCATCA TCAAGCGGGT TGCGCAGGTG CATGGCGGCA CGGTATGGGC TGAAAGTCCG
GGCCAGGGGG GGGCCAGTTT TTTCATGTCG CTTCCGCAAG GGGAAGCCAG CGCTTCCTGA
 
Protein sequence
MSRFLDKFRL TPAGNLPDVS LQSANSSLPA AQPEWGPLDL TLDGIVLLDD KAVILDANAP 
ALEFLHTSLS AISGYSLWDV VPEEVAKQNE EATELALDSA DRHSFIAHES FENSWVEYTF
RRCPAGYVVN LREAGPAQKY QRLLEDSKRY NQLIFEANPN VMWVFDRTTL RIFAVNQAAV
KFYGIARKIF MTLGMEALFP QGEGAELLRS LHTGKEEQSE MRLCKQKKMD GKEVLVELAW
SQVKWDGHQA VLVSLADISD RHLADGVLKK TNEELQKTLV AQQVELKNVR HDLLTFTQAV
SSDLQDSLHA AHGFAARLAE KYSLALDDQG RHYVRRIQAS ISQLAKLVDD LRTLAQLPLR
SGVPAMVNLA PVCLSLIADL RKRDPDRDVT IEMDSKLMLV GNKGLLTTAM TCLLENAWKF
TSKKTEAWIK VGLLPGKAPG ELVLLVADNG AGFDPVYSGN LFTAFQRLHS SADFPGNGLG
LAIIKRVAQV HGGTVWAESP GQGGASFFMS LPQGEASAS