Gene Pnap_3738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_3738 
Symbol 
ID4686503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp3981425 
End bp3983011 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content71% 
IMG OID639836756 
Producthypothetical protein 
Protein accessionYP_983955 
Protein GI121606626 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.071134 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAACC CGAGCAATCC CTCACCCGCC ACGCACTGGC AAAGCCTGCT GCCCGCCGCC 
ATGGTCGGCA CCGAAAAAAT GGCCTTTACC GCCCCCGCCC TGAGCGGCCC AGTGGGCGCG
CTGCTGGCGC AGATCGAGGC GCAGCAGGCC GCGCAGCCGG CGCTGGCGCA GCCGGCGCTG
GCGCTGCTGC AGATGGCCGG CGTGCTGGCG GTGTGCGAGC GCGCCGCCCG GCAGGGCCAG
GCCAGCGCCG CGTCCGCCCC CAACAGCGCC GCGCCCGAAA CCGCGCCAGC GCTGCACGCC
AAAGGCCTGC AGCTCAGCCT GCGCTGGGCG CTGGCCGAAG CGCCGCCCCG GCTGCAGATT
GAACTGCTGC AGCGCCTGAG CGCCGCCGGC CTGCGCCTGC CCACCGGGCT GCTGGCGCTG
GCGCTCGAAG CCGGGCAGCG CAGCGTGGCC CTGCGCCCGG CGCTGCTGCC CGTGCTGGGC
GAGCGCGGCA TGTGGCTGGC GCGGCATAAC AGTTACTGGC GCTACGCCAG CGGCACGGCG
GCCAGCGCGC CGCTGGAAAC CCGCTGGTCC GAAGGCAGCC TGGCGCAGCG CGTCGAGTTG
CTGCGCGAGC AACGCCAGCT TGATCCGGCC AGCGCCCGCG AACGCCTAAA AACCGCCCTG
CCCGACCTGC CCGCCAAGGA CCGCGCCGAA CTTGCCGCCG TGCTGATCGA AGGCTTGTCG
ATGGACGATG AGGCGCTGCT CGTTGCCCTG TGCAAAGACC GGGGCGCCGA CGTGCGGCAA
ACCGCCCGCG CCCTGCTGGT GCAGCTGCCC GACAGCGCCC AGACCCAGCG CGCCATCGCC
CGGCTGCAGC CTTGCCTGGA AAAGCCCTCC ACCCTGAAAA GCCTGCTGGG CGCGAAATGG
AAAATCCAGG CCCCGGAGGC CGCCGGCGAC GACTGGAAAG CCGATGGCCT CGAAGCCGAG
CGCCCCAAGT TTGAGACCAT GGGCGAGCGC GCCTGGTGGC TCTACCAGCT GGCGCGGCAG
GTGCCGCTGG ACTGGTGGAC GCAGCACACC GGCCTGACGC CCGATGCGCT GGTGCAATGG
GCCGCCAAGG GAGACTGGAA CGAGGCGCTG GTGCGCGCCT GGCTCGACGT GCTGCGCACC
GCGCCCGACG CCGCCTGGTG CCGGGCATTT CTGGACCACT GGCCCGGCAA GGCACTGAAG
CAAACGTCCG CCGTCGTGCT GGCCCTGCTG CCGCCCGCCC AGCGCGAGCC TTACTGGATG
CAGACGCTGC AGCAGATCAG CCCCAAGAAT GTGGACGGCT TCACCCAGCT CATCAACCAG
ATTCTGCAGG CCTGCCCGCC CGGCGAACAG GTCTCGCCGG CCCTGTCCGG GCTGCTGCTG
GAAAAGCTGC CGCTCTACCT GAAAAAGTAC TACATCGACG CCTCGCTGGC CGAGCTGTGC
TGCGTGCTGC ATTCCGACAT GCTGCCCGGC CTGGTGCGGC CCCCGTCCGA AGACGACCCG
CTGGCCTACG CGCTGCGAAA CCATCAGCCG GTGATTGCGG CCCGCCGCGC CTTTTCCAAC
CTGACCCCCG CCGTAATCCC TTCCTGA
 
Protein sequence
MSNPSNPSPA THWQSLLPAA MVGTEKMAFT APALSGPVGA LLAQIEAQQA AQPALAQPAL 
ALLQMAGVLA VCERAARQGQ ASAASAPNSA APETAPALHA KGLQLSLRWA LAEAPPRLQI
ELLQRLSAAG LRLPTGLLAL ALEAGQRSVA LRPALLPVLG ERGMWLARHN SYWRYASGTA
ASAPLETRWS EGSLAQRVEL LREQRQLDPA SARERLKTAL PDLPAKDRAE LAAVLIEGLS
MDDEALLVAL CKDRGADVRQ TARALLVQLP DSAQTQRAIA RLQPCLEKPS TLKSLLGAKW
KIQAPEAAGD DWKADGLEAE RPKFETMGER AWWLYQLARQ VPLDWWTQHT GLTPDALVQW
AAKGDWNEAL VRAWLDVLRT APDAAWCRAF LDHWPGKALK QTSAVVLALL PPAQREPYWM
QTLQQISPKN VDGFTQLINQ ILQACPPGEQ VSPALSGLLL EKLPLYLKKY YIDASLAELC
CVLHSDMLPG LVRPPSEDDP LAYALRNHQP VIAARRAFSN LTPAVIPS