Gene Pnap_3942 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_3942 
Symbol 
ID4688624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp4208075 
End bp4209595 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content65% 
IMG OID639836960 
ProductO-antigen polymerase 
Protein accessionYP_984159 
Protein GI121606830 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3307] Lipid A core - O-antigen ligase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.00471657 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCCACCT GGGGGGTGGC CGCGTTCCTG CTCCTCCCCT GGCTCAACCC CTACACTGCA 
GGCCCCACGC CCAACGTCTG GCCCCTGCTG CTTTCGACCC TGTGCGCGGT ATTCCTGTGG
ACTTTCCGGA ACCGGCTCAA TGCGAGGCTG GTCACCGCTG GCTGGCTGAT CGCCGCAGCG
ATCAGCGCGC TCATCGGCCT GGTGCAGTAC TTTGGCCTGG CATCGGCTTT GAGTCCGTTG
ATCAGCCAGA CGGAGGCCGG CGAGGCGTTT GCCAACTTGC GCCAGCGCAA CCAGTTCGCC
ACGCTGACGA GCATCGGTTT GCTCGTCCTG ATCGGCTGGC TGGCGCAACG GGACAAGCCT
GGAGGCCCGC AACCGGCTCA GCAATGGCTG ATGCCCTGGT GGGCCTGGCT GCTGGCCCTG
CTGCTGGCGC TGGGCAATGC CGTCAGCAGT TCGCGCACCG GCCTGCTGCA ATGGCTGCTG
ATTGCCGCGC TCACCGCCTG GTGGGCGCTG CCGGACCGCT GGTGGCGGCT GCCGGTGTTC
GCCGTGCTGG CCTTGCTGGC CTATGGCGTC GCCGTGCTGA TGCTGCCGTG GCTGCTGGAA
CTGGCGACCG GATTGCACAG CGATGGCCTG TTTGGCCGGC TGGTCAAATC GCCGGGCTGC
TCCAGCCGCA AGATATTGTG GTCCAACGTG CTGACGCTGA TTGCCCAAAA GCCGTGGCTG
GGCTGGGGCT GGGGCGAACT GGACTATGCC CACTTCATCA CGCTGTACCC AGGGCCGCGC
TTTTGCGAAA TCCTCGACAA CGCGCACAAC CTGCCGCTGC ACCTGGCAGT GGAACTGGGC
GTTCCGGCCG CCGTGGCGAT TTGCGGCGCC CTGGGCTGGC TGGTGTGGCG CGCCCAGCCG
TGGCGCGAAC GCAATCCGGT GCGGCAGACG GCCTGGGGCG TGCTGGCGGT CATCGGCCTG
CACAGCCTGC TGGAATATCC GCTGTGGTAC GGGCCGTTCC AGATCGCCGC CGGGCTGAGT
GCCTGGCTGC TGTTGTCAGG CGGGCGCGAA AACCAGCCAT TGCCTGATGC CGATGTTGAC
GGTCTAAATG AAAAATTCAA GCAAAAAAAG CCTTTTGCGC AATACCTGCG GGCGCTGGCA
GCTATCACTA TGATAGTAAT GTTGGCGTGC GTGGCCATCA GCTATCACCG CGTCAGCCAG
ATTTACCTGT TGCCCGACAA GCGCAGCGCC GCCTACCGCG AAGACACGCT GGACAAGGTG
CGCGGCAGCC GGCTGTTCCG CAATCAGGCG CGCTTTGCGG AGTTAAGCAT GACGCCGCTC
ACGCCTGCCA ATGCCGCCGC CGTGTACGGC CTGGCCAAGG ATATGCTGCA CTACTCGCCC
GAGCCGCGCG TGATTGAAAA GGTGATTGAA AGCGCGGTGA TGCTGGGGCG CGATGACGAA
GCCCTGCAGT ATCTGGCGCG CTACCGGGCG GCTTTTCCGC AAGACCATGC GCGCTGGGCC
AGCAAGAACG CGGGCCGGTA A
 
Protein sequence
MPTWGVAAFL LLPWLNPYTA GPTPNVWPLL LSTLCAVFLW TFRNRLNARL VTAGWLIAAA 
ISALIGLVQY FGLASALSPL ISQTEAGEAF ANLRQRNQFA TLTSIGLLVL IGWLAQRDKP
GGPQPAQQWL MPWWAWLLAL LLALGNAVSS SRTGLLQWLL IAALTAWWAL PDRWWRLPVF
AVLALLAYGV AVLMLPWLLE LATGLHSDGL FGRLVKSPGC SSRKILWSNV LTLIAQKPWL
GWGWGELDYA HFITLYPGPR FCEILDNAHN LPLHLAVELG VPAAVAICGA LGWLVWRAQP
WRERNPVRQT AWGVLAVIGL HSLLEYPLWY GPFQIAAGLS AWLLLSGGRE NQPLPDADVD
GLNEKFKQKK PFAQYLRALA AITMIVMLAC VAISYHRVSQ IYLLPDKRSA AYREDTLDKV
RGSRLFRNQA RFAELSMTPL TPANAAAVYG LAKDMLHYSP EPRVIEKVIE SAVMLGRDDE
ALQYLARYRA AFPQDHARWA SKNAGR