Gene Pnap_4049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_4049 
Symbol 
ID4687509 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp4317799 
End bp4319460 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content63% 
IMG OID639837062 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_984261 
Protein GI121606932 
COG category[T] Signal transduction mechanisms 
COG ID[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains
[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.92854 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAGC CGCAGAACCG GCGTATTTTG CTGATCGACG ACATGCCATC GATCCACGAG 
GACTTCCGCA AGATCCTTGC GCCCCGGCCG GCCGCGCGCG AACTCGACAA CTTCGAGGCG
GCGCTGTTTG GCCAGGCCGA GGCGCCTGCG TGCGACGGCT ACGAGCTGGA CTCGGCCTAC
CAGGGGCGCG ACGGCGTGGC GATGGTCGAG GCCGCTGTCC AGGCCGGCCG GCCCTACGCC
ATGGCCTTCG TGGACATGCG CATGCCCCCG GGCTGGGACG GCGTGGAGAC CATCGAGCGG
CTCTGGCGCA TCGACCCGCA GGTGCAGGTG GTGATCTGCA CGGCCTACTC CGACCATGCC
TGGGAGGACG TGCTGGCGCG CCTGGACGTG CAGGACCGGC TGCTGATCCT GAAAAAGCCG
TTCGACATGA TCGAGGTCAG CCAGCTGGCC AGGACGCTGA CGGCCAAGTG GACGCTGGCG
CGGCAGGCGG CGTCGCAGGT GAGCGGCCTG GAGGAGGCCG TGCAGGAGCG GACCCGGGCT
TTGCACGCCA GCGAATCGCA ACTGCACCAG ATCGCGGACA CCTTGCCAGC CTTGATTGCC
TACGTGGACG CAGAACAGCG CTTTCAGTTT CACAACCAGT CCTACGAAGA AGTTTTTGGA
CTGAAGCACG AACAGATCCA CGGCAAGACC CTGCGCGAGA TGATGGGCGG CGAGGTTTAC
GGAAAAGTTC AAGGCAAGGT CGAGGAAGCC TTGGCGGGCT ACTCGGTGCA GTATGACCGC
GTGCAGAAAA CCGCCAATGG CCAGCTTCGC GACTACGTCA TGAAGTACCT GCCGCGCTAT
GGCGAAGACG AGGATGAAGG CAAGGTGCTG GGCTTTTTCG TGCTGGGCGC CGACGTGACC
GAACTCAAAC GCCTTGAGCG CATGAAAGGT GAATTTATCT CAAGCGTCAG CCACGAACTG
CGCATGCCGC TGGTCTCCAT CCGCGGCACG CTGGGCCTGA TCGCGGGAGG TGTTGCCGGC
GAGTTGCCGG CCATGGTCAA AAATCTGGTC GGCATTGCCA CCAATAACTC CGAACGCCTG
ATCAGGCTCC TCAACGACAT TATTGACAGC GAGACCATCG AATCCGGAAC CATGCACTTT
GAACTGCGGC CGGTGGAACT GCTGCCTTTG CTGGCACAGG CCGTCGCGGC CAGCGAGGGC
TTTGCCAGCC AGCACAACAT CAAGCTGATT CTTGATGGCC CGGCAGAGGC GGTGAGCGTC
AATGTTGACC ATGACCGGCT CAGCCAGGTG ATCGCCAATT TGCTGTCCAA TGCCGTGAAA
TTCTCGCCTC CGGCGGCGTC AGTACGCATC CGGCTGCTGC GCCTTGATAA CGGGCGGGTC
CGGGTTGAAG TGGCTGACAG CGGCCCCGGC ATTCCCGAAG AGTTTGGCAA ACGCATCTTC
CAGAGCGGCC CGCAGGCCGA CGCGCCGGAT ACCCAACTCA AGGACGGCAC GGACCTGGGC
CTGAATATCT CGCACTCCAT CGTCGAGCGC ATGGGCGGCA GCATGGGATT CACGACCGAT
GCCGGCACAG GCAGCGTCTT CTTCTTTGAG CTGCCCCAGG CGGTGCCGCT GCCGGTTGAT
GACAGACTGT GGGGCGCGGC CCGTCCACCG CTGAGTGCTT AG
 
Protein sequence
MNKPQNRRIL LIDDMPSIHE DFRKILAPRP AARELDNFEA ALFGQAEAPA CDGYELDSAY 
QGRDGVAMVE AAVQAGRPYA MAFVDMRMPP GWDGVETIER LWRIDPQVQV VICTAYSDHA
WEDVLARLDV QDRLLILKKP FDMIEVSQLA RTLTAKWTLA RQAASQVSGL EEAVQERTRA
LHASESQLHQ IADTLPALIA YVDAEQRFQF HNQSYEEVFG LKHEQIHGKT LREMMGGEVY
GKVQGKVEEA LAGYSVQYDR VQKTANGQLR DYVMKYLPRY GEDEDEGKVL GFFVLGADVT
ELKRLERMKG EFISSVSHEL RMPLVSIRGT LGLIAGGVAG ELPAMVKNLV GIATNNSERL
IRLLNDIIDS ETIESGTMHF ELRPVELLPL LAQAVAASEG FASQHNIKLI LDGPAEAVSV
NVDHDRLSQV IANLLSNAVK FSPPAASVRI RLLRLDNGRV RVEVADSGPG IPEEFGKRIF
QSGPQADAPD TQLKDGTDLG LNISHSIVER MGGSMGFTTD AGTGSVFFFE LPQAVPLPVD
DRLWGAARPP LSA