Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pnap_4801 |
Symbol | |
ID | 4685978 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Polaromonas naphthalenivorans CJ2 |
Kingdom | Bacteria |
Replicon accession | NC_008761 |
Strand | + |
Start bp | 49675 |
End bp | 51168 |
Gene Length | 1494 bp |
Protein Length | 497 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 639826790 |
Product | transposase, IS4 family protein |
Protein accession | YP_973952 |
Protein GI | 121583526 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3666] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 0.00542003 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 101 |
Fosmid unclonability p-value | 0.0251535 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCGAT ACATTCAAGG AAGGGATCGC AGTCAAATCA CGTTACCTGG GCGCCTGGAT GACTACATTG GACAAGACAA TCCAGTGCGC GTGGTTGATG CATTCGTTGA TGCACTCGAT TTGGCAGAGC TCGAATTTGC GCGGATGACG CCCGCAGTGA CCGGACGTCC GGGCTATCAC CCCGCAGTGC TCCTCAAACT CTACCTTTAC GGCTACCTCA ACCGCATCCA GTCCAGCCGG CGCCTGGAGC GGGAGTGCCA GCGCAACATT GAACTGATGT GGCTTATTGG CTGCTTGACG CCTGACTTCA AGACCATCGC CGATTTCCGC AAAGACAATG GTGCGGGCAT CCGCAATGTG TGCCGCCACT TTGTGATGCT GTGCCGGGAA CTGAAACTGC TGACGCAAGC TGTTGTGGCC ATCGATGGCA GCAAATTCAA GGCGGTCAAC AACCGTGAGC GCAACTACAC CTCCGGCAAG ATCGAGCGGC GTGAGCGCGA GATTGACGAA AGCATCCAGC GCTACCTGAA CGCACTGCAA ACCCTCGATC GCACCCAGCC CGCCGAATTG CCAGCCAAAA CAGAGCGCTT GCAGGGCAAG GTTCAGAAGA TGCGTCAGCG ACTGCAAGAA CTCAAAGAGA TCAAGGCGCA GGTAGAGATG CAACCCGATA AACAGCTATC GTTGACAGAC TCGGATGCGC GGGCGATGAG CACCCACAGC ATGAAGGGCA CCGCCCTGGT GGGCTACAAC GTGCAGACGG TGGTGGAGAC CCAGCACCAC CTGATCGTGG CCCATGAAGT GACCAATACC GCCAGTGACC GGGCGCAGTT AAGCAAACAA GCGCGGGCCG CACTCGAGGC CATGGGGGTG CGTCAACTGC AAGCCCTTGC CGATCGCGGC TATTACAGCG GCCCCGAACT CAAGGCCTGC GAAGACGCGG GCATTGCCGC CTGTGTCCCC AAGCCCATGA CTTCCAATGC CCGGGCGCAG GCGCGCTTTG GCAAGGACGA CTTTATCTAC ATGGCGCGTG ATGATGAATA CCTGTGCCCG GCGCGTCAAC GGGCCATTCA CCGGTTCACC AGGGAGGAAG ATGGCCTGCA GATTCACGTC TACTGGAGCA GCGCCTGCCC AGCATGCCCG ATGAAAGCGC AATGCACCAC CAGCAACTAC CGGCGCATCA GGCGTTGGGA GCACGAAGCG GTGATGGAGG CGGTGCAGCG CCGCCTGGAC CGCCAGCCCG AGGCGATGAA GGTGCGAAAG AGCACCGTGG AGCATGTCTT TGGAACGCTC AAGCACTGGA TGGGCTGGAC GCACTTTCTC ATGCGCGGCA AAGCCAAGGT GGCAACCGAA ATGAGTCTGC ATGTTCTGGC TTACAACCTC AAGCGGGTGA TGAAAATTCT TGGCATTGCC GAGTTGCTCA AGGCCATCAC AGAGGAGGGC TTGAAAGCCC TTTGTTCACT TCAATGCCGA CAGGCAATTC AAGCTCGGGC TTAA
|
Protein sequence | MKRYIQGRDR SQITLPGRLD DYIGQDNPVR VVDAFVDALD LAELEFARMT PAVTGRPGYH PAVLLKLYLY GYLNRIQSSR RLERECQRNI ELMWLIGCLT PDFKTIADFR KDNGAGIRNV CRHFVMLCRE LKLLTQAVVA IDGSKFKAVN NRERNYTSGK IERREREIDE SIQRYLNALQ TLDRTQPAEL PAKTERLQGK VQKMRQRLQE LKEIKAQVEM QPDKQLSLTD SDARAMSTHS MKGTALVGYN VQTVVETQHH LIVAHEVTNT ASDRAQLSKQ ARAALEAMGV RQLQALADRG YYSGPELKAC EDAGIAACVP KPMTSNARAQ ARFGKDDFIY MARDDEYLCP ARQRAIHRFT REEDGLQIHV YWSSACPACP MKAQCTTSNY RRIRRWEHEA VMEAVQRRLD RQPEAMKVRK STVEHVFGTL KHWMGWTHFL MRGKAKVATE MSLHVLAYNL KRVMKILGIA ELLKAITEEG LKALCSLQCR QAIQARA
|
| |