Gene Pnap_4449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_4449 
Symbol 
ID4685644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008758 
Strand
Start bp9739 
End bp11325 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content62% 
IMG OID639826303 
Producttransposase IS66 
Protein accessionYP_973468 
Protein GI121583027 
COG category[L] Replication, recombination and repair 
COG ID[COG3436] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.543683 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.0438734 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCCGCG TCGACCAACT CAGCGCCCAG GACTTTGTCG GTCTGAGTCC CGGTGCCGCT 
GCCGAATTGG CCGCCCGCGT GCTGGCACAC GTTGGCGAGC AGAGCCGGCA AATTGAATCC
CAGGCCCAGG CCATCGAGTT CAGGGATGCC AAGATCGCCA GCATCACCTT TGAGCTGCGC
CGCCTCAAGG CCCGGCAGTT TGCGGCCAGG ACCGAACGCA TGAATGCCGA ACAGCGCCAG
CTGTTTGAAG AAACGATGGC GGCCGACCAG GCGGATCTTG AAGCCCAGCT CGCAGCGCTG
AAGGCAGCAT TCAAAGTGCC GGGCAATGCT CCAGACACCG ACACGCCTCG CAAGCCCAAA
CGCCAAGCCT TGCCCGAGCA CCTCAAACGG GTGGACCACC ACCATGAGCC AGAGAACACG
ACCTGTGACT GCGGCCAGGC CATGGTGCGT ATCGGCGAAG ACGTGAGCGA ACGCCTGGAC
ATCATTCCAG CCGAATTCTT CGTGCACCGC CACATCCGCG GCAAGTGGGC CTGCAGGTGC
TGCCAGACCC TGGTGCAAGA ACCGGTGGAG CCGCACATCA TTGACAAGGG CATGCCCACG
GCCGGTCTGG TGGCGCACAC CACCGTGAGC CGCTTCGTTG ACCACATTCC GTATTACCGC
CAGGAACAAA TCAACGCCCG CTCCGGCGTT CATACACCTC GCTCCACGCT GGCGTCGTGG
TCTGGCCAGG CAGGCGCAGC TTTGGTGCCC CTGTATGCGG CGCACCGGGA GTTTGTTCTC
AGCTGCGCAG TGGTGCACGC CGACGAGACG CCGGTGGCCA TGCTGGACCC CGGCGCGGGC
AAGACCAAAC GGGCCTACGT CTGGGCTTAT GCTCGAAGCG GCTTTGATGT CTCACCCGGC
GTGGTGTATG AATTTTGCCT GGGCAGAGGG GCCAAATACC CGCTGGAATT CCTCAAGGGC
TGGTCGGGCA CGCTGGTCTC CGACGGGTAC GGGGTGTACG AGCAGGTCCT CAAGCAGGAA
ACCCGCATCG GCGCAGCTTG TTTCGCACAC GCACGTCGAA AGTTTGATGA ACTGGTCAAG
GACCGCTTGA GTCCGGTAGG CACGCAGGCC ATCCAGCGCA TGGCAGCGCT ATACAAAATC
GAACGCCAGG TCAAAAACTT CTCGCCTGAA GATCGGCAGG CCATCAGGCA ATCGAGCGCC
AAGCCGCTTT GCCAGGATCT GCACGCCTGG TTGAAGCTGG AGCGCCAACG AGTGCCCGAG
GGCAGCGCCA CGGCCAAGGC GATTGATTAC AGCCTGAACC GCTGGGAAGC GCTGACAACT
TACCTGGCCG ATGGCAACGT CCAGATTGAT AACAACCATC TTGAGAATTT GATTCGGCCA
TGGGCAATGG GACGCCGGGC ATGGCTGTTC GCAGGCAGTG AGCTGGCCGG CCAACGTGCT
GCTGTCGTGA TGAGCTTGCT GCAGTCGGCA AAGTTACATG GCCATGACCC ATGGGCTTAT
TTAAAGGACG TGCTGACGAG GCTGCCTGGC CACATGAACT CTCGCATCGA CGAACTGCTG
CCACACCGCT GGCAACCGCA ATCCTGA
 
Protein sequence
MLRVDQLSAQ DFVGLSPGAA AELAARVLAH VGEQSRQIES QAQAIEFRDA KIASITFELR 
RLKARQFAAR TERMNAEQRQ LFEETMAADQ ADLEAQLAAL KAAFKVPGNA PDTDTPRKPK
RQALPEHLKR VDHHHEPENT TCDCGQAMVR IGEDVSERLD IIPAEFFVHR HIRGKWACRC
CQTLVQEPVE PHIIDKGMPT AGLVAHTTVS RFVDHIPYYR QEQINARSGV HTPRSTLASW
SGQAGAALVP LYAAHREFVL SCAVVHADET PVAMLDPGAG KTKRAYVWAY ARSGFDVSPG
VVYEFCLGRG AKYPLEFLKG WSGTLVSDGY GVYEQVLKQE TRIGAACFAH ARRKFDELVK
DRLSPVGTQA IQRMAALYKI ERQVKNFSPE DRQAIRQSSA KPLCQDLHAW LKLERQRVPE
GSATAKAIDY SLNRWEALTT YLADGNVQID NNHLENLIRP WAMGRRAWLF AGSELAGQRA
AVVMSLLQSA KLHGHDPWAY LKDVLTRLPG HMNSRIDELL PHRWQPQS