Gene Pnap_1747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_1747 
Symbol 
ID4688853 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp1855409 
End bp1856449 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content65% 
IMG OID639834753 
Producttransposase IS116/IS110/IS902 family protein 
Protein accessionYP_981978 
Protein GI121604649 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.279205 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATTA CCGCACAACT GCCCGTCGTT GGAATGGACA TTGCCAAGAA TGTGTTTCAG 
ATTCATGCCG TTGATCCGGA AACAGGCGAG ATCGAGCGCA TCAAGCTCAA GCGCGCCAAG
GTGGCCGAGT TCTTCGCCAA CCGCCAGCCC TGCCTGGTTG CCCTGGAGGC TTGCGGCGGA
GCCCACCACT GGGGACGAAC GCTGGCAGCC CAGGGCCACC AAGTCAAGCT GCTGCCAGCC
AGACAGGTCA AGGCCTTCGT GCTGCGTGAC AAGACCGATG CGCGCGATGC CCAGGCGATC
TGGGTCGCGG CCCAGCAGCC GCACATCCAT GAGGTGCCCG TCAAGAGCGA GCCGCAGCAG
GCTTGCCTGG CCCTGCACCG CATGCGCGCC CAGCTGATGA AGATGCGCAT CATGCAAACC
AACGCCCTGC GTGGGTTGCT CTGCGAGTTC GGCATCGTGC TGCCCGAAGG CCATCGCATC
TTGCTGCAGC GCATTCCCGG TGAACTGGCC CAGGCGCAGG ACAAGCTGCC TGGCGTGCTC
ATCGAGAGCA TGCAGGAGCA GCTCAGCCGC ATTGAGCGGC TGCAGCAAGA CATCGATCAC
ATCGACAGGC GCCTGGCAGC GCTGGGCAAA CAAGACCAGC ATATGCTGGC CTTGCAGGCG
GTGCCGGGCA TTGGCCCTCT CACGGCCACG GCCTTGGCGG CCACAGCCAC CGACATCTCC
GGCTTTCACT CGGGCCGGCA GTTTGCCGCA TGGCTGGGCT TGACGCCCAG GCAGAGCGGC
ACGGGCGGCA AGATCCGCCA GCTGGGCATC TCCAAGCGAG GCGATCCATA TGTGCGAACG
CTGCTGATGC ACGGGGCCAG GGCCATTATT GCCAGGACGC AGCGAACAGG CTGGATCACC
GCGCTGCTGG CCCGCAGGCC CTACAGCGTG GTGGTCGCGG CCTTGGCCAA CAAGCTGGCG
CGCACCGCCT GGGCGGTGCT GACCAAAGGC AAGGCCTTTG ATCAGCTCAG ATGGAATCCG
GCCGCTGCGG TGGCTGCCTG A
 
Protein sequence
MKITAQLPVV GMDIAKNVFQ IHAVDPETGE IERIKLKRAK VAEFFANRQP CLVALEACGG 
AHHWGRTLAA QGHQVKLLPA RQVKAFVLRD KTDARDAQAI WVAAQQPHIH EVPVKSEPQQ
ACLALHRMRA QLMKMRIMQT NALRGLLCEF GIVLPEGHRI LLQRIPGELA QAQDKLPGVL
IESMQEQLSR IERLQQDIDH IDRRLAALGK QDQHMLALQA VPGIGPLTAT ALAATATDIS
GFHSGRQFAA WLGLTPRQSG TGGKIRQLGI SKRGDPYVRT LLMHGARAII ARTQRTGWIT
ALLARRPYSV VVAALANKLA RTAWAVLTKG KAFDQLRWNP AAAVAA