Gene Pnap_0149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_0149 
Symbol 
ID4686110 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp159398 
End bp160423 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content68% 
IMG OID639833142 
Productlipopolysaccharide heptosyltransferase II 
Protein accessionYP_980395 
Protein GI121603066 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0859] ADP-heptose:LPS heptosyltransferase 
TIGRFAM ID[TIGR02195] lipopolysaccharide heptosyltransferase II 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGCA CTAAACCCGA AATGTCCAAC GCCCTCGTCA TCGCCCCGCA GTGGATCGGC 
GATGCGGTCA TGACCGAGCC GCTGCTGCGC CGGCTGCATG CGCGCGGCGA GCGCCTGACG
GTGGGCGCCT TGCCGTGGGT GGCGCCGGTG TACCGCGCCA TGCCGCAGGT GGCCGAGGTC
ATCGAGTTCC CGTTTGCCCA TGGCGGCCTG CAGTTCAAGG CGCGCCGCGC AATTGCCAAG
CGCATCGAAG GCCAGTTCGG CAAAGCCTAT GTGCTGCCCA ATTCGCTGAA AAGCGCCTTG
CTGCCGTTCC TGGCCAGCAT TCCCGAGCGC ATCGGCTACC TGGGCGAGGC GCGCGTCGGC
CTGCTGACGC ATCGGCTCAA GAACCCCAAG AACAAGCCGC CCATGGTGGC GTTTTATTCG
GCCCTGAGCG GCGAAGGCGA CCTGGCCAGC GACCGGCCAG AGTTGCACAT CAGCGCGGCG
GACATTGCGC TCACGCTGCA CGAACTGGGC TTGCGGCAAG GCGGCTATGT GGTGTTTGCG
CCGGGCGCCG AATTCGGCCC GGCCAAGCGC TGGCCGGCGC GCCATTTCGC CGAACTCGCC
GCCCGGCTGG ACCTGCCGGT GGTGCTGCTT GGCTCCGGCA AGGAAGCCGC GCTGTGCGAC
GAGATTGCCG CCCCCGTGAA TGCCGCGCAA GCCGGCAAGT GCCTGAACCT GGCCGGCAAA
ACCTCGCTGC CGCAGGCGCT GGCGCTGATC GCCGCCAGCC GCAGCATCGT CAGCAACGAC
TCCGGCCTGA TGCATGTCGC TGCCGCGCTG GGCGTGCCGC AGGTGGCGAT CTTCGGCTCG
TCGAGCCCGC TGCACACGCC GCCGCTGAGC GACAAGGCGC GCGTGCTCTG GCTCAAGGCC
GACCCGGCCT ACCAGCCGCC GCTGGACTGC GCGCCGTGTT TCGAGCGCGA ATGCCCGCTG
GGCCATACCC GCTGCCTGAA TGACATCGGT GCGCAGCAGG TGCTGCAGGC GCTTTCTTCC
CATTAG
 
Protein sequence
MSSTKPEMSN ALVIAPQWIG DAVMTEPLLR RLHARGERLT VGALPWVAPV YRAMPQVAEV 
IEFPFAHGGL QFKARRAIAK RIEGQFGKAY VLPNSLKSAL LPFLASIPER IGYLGEARVG
LLTHRLKNPK NKPPMVAFYS ALSGEGDLAS DRPELHISAA DIALTLHELG LRQGGYVVFA
PGAEFGPAKR WPARHFAELA ARLDLPVVLL GSGKEAALCD EIAAPVNAAQ AGKCLNLAGK
TSLPQALALI AASRSIVSND SGLMHVAAAL GVPQVAIFGS SSPLHTPPLS DKARVLWLKA
DPAYQPPLDC APCFERECPL GHTRCLNDIG AQQVLQALSS H