Gene Pnap_4801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_4801 
Symbol 
ID4685978 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008761 
Strand
Start bp49675 
End bp51168 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content58% 
IMG OID639826790 
Producttransposase, IS4 family protein 
Protein accessionYP_973952 
Protein GI121583526 
COG category[L] Replication, recombination and repair 
COG ID[COG3666] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value0.00542003 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones101 
Fosmid unclonability p-value0.0251535 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGAT ACATTCAAGG AAGGGATCGC AGTCAAATCA CGTTACCTGG GCGCCTGGAT 
GACTACATTG GACAAGACAA TCCAGTGCGC GTGGTTGATG CATTCGTTGA TGCACTCGAT
TTGGCAGAGC TCGAATTTGC GCGGATGACG CCCGCAGTGA CCGGACGTCC GGGCTATCAC
CCCGCAGTGC TCCTCAAACT CTACCTTTAC GGCTACCTCA ACCGCATCCA GTCCAGCCGG
CGCCTGGAGC GGGAGTGCCA GCGCAACATT GAACTGATGT GGCTTATTGG CTGCTTGACG
CCTGACTTCA AGACCATCGC CGATTTCCGC AAAGACAATG GTGCGGGCAT CCGCAATGTG
TGCCGCCACT TTGTGATGCT GTGCCGGGAA CTGAAACTGC TGACGCAAGC TGTTGTGGCC
ATCGATGGCA GCAAATTCAA GGCGGTCAAC AACCGTGAGC GCAACTACAC CTCCGGCAAG
ATCGAGCGGC GTGAGCGCGA GATTGACGAA AGCATCCAGC GCTACCTGAA CGCACTGCAA
ACCCTCGATC GCACCCAGCC CGCCGAATTG CCAGCCAAAA CAGAGCGCTT GCAGGGCAAG
GTTCAGAAGA TGCGTCAGCG ACTGCAAGAA CTCAAAGAGA TCAAGGCGCA GGTAGAGATG
CAACCCGATA AACAGCTATC GTTGACAGAC TCGGATGCGC GGGCGATGAG CACCCACAGC
ATGAAGGGCA CCGCCCTGGT GGGCTACAAC GTGCAGACGG TGGTGGAGAC CCAGCACCAC
CTGATCGTGG CCCATGAAGT GACCAATACC GCCAGTGACC GGGCGCAGTT AAGCAAACAA
GCGCGGGCCG CACTCGAGGC CATGGGGGTG CGTCAACTGC AAGCCCTTGC CGATCGCGGC
TATTACAGCG GCCCCGAACT CAAGGCCTGC GAAGACGCGG GCATTGCCGC CTGTGTCCCC
AAGCCCATGA CTTCCAATGC CCGGGCGCAG GCGCGCTTTG GCAAGGACGA CTTTATCTAC
ATGGCGCGTG ATGATGAATA CCTGTGCCCG GCGCGTCAAC GGGCCATTCA CCGGTTCACC
AGGGAGGAAG ATGGCCTGCA GATTCACGTC TACTGGAGCA GCGCCTGCCC AGCATGCCCG
ATGAAAGCGC AATGCACCAC CAGCAACTAC CGGCGCATCA GGCGTTGGGA GCACGAAGCG
GTGATGGAGG CGGTGCAGCG CCGCCTGGAC CGCCAGCCCG AGGCGATGAA GGTGCGAAAG
AGCACCGTGG AGCATGTCTT TGGAACGCTC AAGCACTGGA TGGGCTGGAC GCACTTTCTC
ATGCGCGGCA AAGCCAAGGT GGCAACCGAA ATGAGTCTGC ATGTTCTGGC TTACAACCTC
AAGCGGGTGA TGAAAATTCT TGGCATTGCC GAGTTGCTCA AGGCCATCAC AGAGGAGGGC
TTGAAAGCCC TTTGTTCACT TCAATGCCGA CAGGCAATTC AAGCTCGGGC TTAA
 
Protein sequence
MKRYIQGRDR SQITLPGRLD DYIGQDNPVR VVDAFVDALD LAELEFARMT PAVTGRPGYH 
PAVLLKLYLY GYLNRIQSSR RLERECQRNI ELMWLIGCLT PDFKTIADFR KDNGAGIRNV
CRHFVMLCRE LKLLTQAVVA IDGSKFKAVN NRERNYTSGK IERREREIDE SIQRYLNALQ
TLDRTQPAEL PAKTERLQGK VQKMRQRLQE LKEIKAQVEM QPDKQLSLTD SDARAMSTHS
MKGTALVGYN VQTVVETQHH LIVAHEVTNT ASDRAQLSKQ ARAALEAMGV RQLQALADRG
YYSGPELKAC EDAGIAACVP KPMTSNARAQ ARFGKDDFIY MARDDEYLCP ARQRAIHRFT
REEDGLQIHV YWSSACPACP MKAQCTTSNY RRIRRWEHEA VMEAVQRRLD RQPEAMKVRK
STVEHVFGTL KHWMGWTHFL MRGKAKVATE MSLHVLAYNL KRVMKILGIA ELLKAITEEG
LKALCSLQCR QAIQARA