Gene Pnap_0114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_0114 
Symbol 
ID4689464 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp122049 
End bp123968 
Gene Length1920 bp 
Protein Length639 aa 
Translation table11 
GC content67% 
IMG OID639833107 
ProductATP-dependent DNA helicase RecQ 
Protein accessionYP_980360 
Protein GI121603031 
COG category[L] Replication, recombination and repair 
COG ID[COG0514] Superfamily II DNA helicase 
TIGRFAM ID[TIGR00614] ATP-dependent DNA helicase, RecQ family
[TIGR01389] ATP-dependent DNA helicase RecQ 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.186389 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCTGGCC CGCCCTTGTC CTCATTGCCC ATTTCTTCAG CCCCTTCAGA CCAGCCCACG 
CCGCTGGACG TGCTTGGCCA GGTTTTTGGC TACTCCGACT TTCGCGGCCC GCAGCAAGCC
ATCGTCGAGC ATGTGATTGC TGGCGGCGAT GCGCTGGTCT TGATGCCCAC GGGCGGCGGC
AAGTCGCTGT GCTACCAGAT TCCGGCCATC GCCCGGCAAA ACGCCGGCCA CGGCGTGACC
ATCGTGATCT CGCCGCTGAT CGCGCTGATG CACGACCAGG TCGGCGCGCT TCTGGAAGCG
GGCGTGTCGG CGGCGTTCCT CAACTCCACC CAGACTTTCG AGGAAAGCAG CCAGCTGGAA
AAGCAGCTGC TGCGCAATGA GCTGACGCTG CTCTACGCCG CGCCCGAGCG CATCAACACG
CCGCGCATGA AGGGCCTGCT GGCGTCGCTG CACGAGCGCG GGCTGCTAAG CCTCTTTGCC
ATCGACGAGG CGCATTGCGT GAGCCAGTGG GGCCACGACT TCCGGCCCGA GTACCGCAGC
TTGAGCCTGT TGCACGAGAC CTTCCCCGAC GTGCCGCGCA TGGCGCTGAC CGCCACGGCC
GACGCGCTGA CGCGCCAGGA CATGATCGAG CGGCTCAAGC TCGAAGACGC GCGCTTGTTT
CTAAGCAGCT TTGACCGGCC CAACATCCGC TACACCATCG TCGAAAAGAC CGACGCGACG
CGCCAGCTGC TGCGCTTCAT CCAGGCCGAG CACCACGGCG AAGCGGGCAT CGTCTATTGC
CAGTCGCGCA AGCGCGTCGA GGAAATCGCC GGCATGCTCG AAGACGCGGG CATCAAGGCC
ATGGCCTACC ACGCCGGGCT CGATGCCAAG CTGCGCCAGC AGCGCCAGGA CCGTTTCCTG
CGCGAAGACG GCTGCGTGAT GGTGGCGACG ATTGCCTTCG GCATGGGCAT CGACAAGCCC
GACGTGCGCT TCGTCGCGCA CCTGGACATG CCAAAGAACA TCGAAGGCTA CTACCAGGAA
ACCGGCCGCG CCGGACGCGA CGGCTTGCCC GCTGACGCCT GGATGGTCTA TGGCCTGCAG
GACGTGGTGA ACCAGCGCCG CATGATCGAC ACCAGCGAAG TCGCCAGCGA GGAGTTCAAG
GCGGTGATGC GCGGCAAGCT GGACGCGCTC TTGACGCTGG CCGAGGGCAC GCGCTGCCGC
CGCGTCAGCC TGCTGGGCTA TTTTGGCGAG GCCAGCGAGC CGTGCGGCAA CTGCGACAAC
TGCCTGACCC CGCCGGCCGT GTGGGACGCG ACCGAGGCGG CGCGCAAGAT GCTCAGCTGC
ATCTACCGCG TGCAGCAGGC CAGCGGCATC AGCTTTGGCG CCGGGCACAT CATGGACATC
CTGCGCGGCA AGCCGACCGA AAAAGTCGTG CAGTACGGCC ATGACCAGCT CAGCACCTTC
GGCATCGGCG CCGACCTGGC CGAGCCGCAG TGGCGCGGCG TGCTGCGCCA GCTGATCGCC
AGCAACCTGG TGCGCGTCAA TGCCGAGGCC TTCAACACGC TGCAACTGAT GCCCGACGCG
CGCCAGGTGC TCAAGGGCGA AGTCAGCGTG CTGCTGCGCC AGCAGGCCGC CAGCGCCAAG
GCCGAGCGCA CGCGGCGCGG CAGCAAATCG ACGGTGAAAA CATCGGTCAA GGGCATGGCC
GAAGCGACGC TGAACGCCGG CGCGCTGGAA CGCCTTGGCC GCCTGAAAGC CTGGCGTACC
GACGTCGCCC GGGAGCACAA CCTGCCGCCA TTCGTGATCT TCCACGACGC CACGCTGCGC
GCGATTGCCG AGCAGGCGCC GCAAGACCTG CACGCGCTGA GCGGCATCAG CGGCATGGGC
GTGAAGAAGC TGGCAGCGTA TGGCGCCGAG GTGCTGCGGG TGTGCGCCGA GCCGGGGTAA
 
Protein sequence
MSGPPLSSLP ISSAPSDQPT PLDVLGQVFG YSDFRGPQQA IVEHVIAGGD ALVLMPTGGG 
KSLCYQIPAI ARQNAGHGVT IVISPLIALM HDQVGALLEA GVSAAFLNST QTFEESSQLE
KQLLRNELTL LYAAPERINT PRMKGLLASL HERGLLSLFA IDEAHCVSQW GHDFRPEYRS
LSLLHETFPD VPRMALTATA DALTRQDMIE RLKLEDARLF LSSFDRPNIR YTIVEKTDAT
RQLLRFIQAE HHGEAGIVYC QSRKRVEEIA GMLEDAGIKA MAYHAGLDAK LRQQRQDRFL
REDGCVMVAT IAFGMGIDKP DVRFVAHLDM PKNIEGYYQE TGRAGRDGLP ADAWMVYGLQ
DVVNQRRMID TSEVASEEFK AVMRGKLDAL LTLAEGTRCR RVSLLGYFGE ASEPCGNCDN
CLTPPAVWDA TEAARKMLSC IYRVQQASGI SFGAGHIMDI LRGKPTEKVV QYGHDQLSTF
GIGADLAEPQ WRGVLRQLIA SNLVRVNAEA FNTLQLMPDA RQVLKGEVSV LLRQQAASAK
AERTRRGSKS TVKTSVKGMA EATLNAGALE RLGRLKAWRT DVAREHNLPP FVIFHDATLR
AIAEQAPQDL HALSGISGMG VKKLAAYGAE VLRVCAEPG