Gene Pnap_4235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_4235 
Symbol 
ID4685282 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008757 
Strand
Start bp107238 
End bp109205 
Gene Length1968 bp 
Protein Length655 aa 
Translation table11 
GC content53% 
IMG OID639826095 
Producttype II secretion system protein E 
Protein accessionYP_973260 
Protein GI121582818 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCATGG TGAACCTTTT TAGAAAACTT TTTTTACGAC CCGCACTGGT TGAAAATCAA 
GTTATCGCTT CGGACATTGC AGCAGCCGTC AATTTTCGCC CGCGCAAGCC CCTGAAGGTT
TCGGTGGCTG ACGCCATGGC CGACAGTGAA GACATCCCTA GCAATATCGA AGCGTTCGCT
ACTTCCGTAG CTTCTACAAA TGTCCAGCAC GGGCCATTTG GAAAATCCAT TTCCGTGGAA
ACCATCAAAG ACTTTGCAAC GCTTCCTCTG TTGTTTGAAG GTCTCGTCAA TGACTTCCAG
CTCGGCAGTC ACCTTGGCTA TAACGTCCAG CCCGCACTGT TTGGAGGCGT TCAGGGAAGT
GAGGCCGCAA GGATTGCCAT CCTCGTCAAT CCTTTGTATG CCAACTCGGA TGAAGTGGTA
GAACTCCTTT CGATCTTGGC AAATCCCAAG CAACACAAGC ACGAATTCAA ACTCTGGGAA
CACGCCTCCA GAGTCATCAT TCCGCATTCG CTGATGCTCT CGCTAAGCCG CGGGGAAATT
GACAAAAACA GCTTGAGCAA TCGCAGAAAA ATAAATTCCG ATCCGAAGAA GCATTCCTTT
TACGTCGCGC TGGAAAACAT CATCCAGTGG GGAGTGCGGC AAAACGCATC TGACGTTCAC
TTCAACCTGA ACTTCAACGA AGAACACAGC GAGGTCCGCT TCACCATCAA CGGCCGTTAT
GTGGCTCCAC AGATGTGGCG CATGCCAACA CAAACCATGT TCCAGATTCT GAGCGTTGCC
TGGATGTCTG GCACGGGCGG CAATGGGCCT AGCTATGACA CGCGCGTCGA GCAGCAATGC
CGGATCGCAT TGAAAATTGA CGATAAGCCG ATCATGCTTC GATGGGCCTC AGCATCGACC
GAATCGCCTG GCCCCTCCGT CACGACGCGG ATCATCAAAT CGGATGCCAA CATCCTGACG
CTTGAGCAAC TCTCGTATTT GCCAGACCAG ATTGCCACCT ACAACCGGGC CATGAATGCC
GAGAAGGGCG CGATTATCCT GGCAGGTGTT GTGAACTCCG GCAAATCCGT CACGATCGCG
TCCCTCATGG GCAGCCTGCC TGATTACCGA AAAAAGCTGG GCATCGAAGA TCCGGTAGAA
ATCCTGATCC CCGGCATGCT GCAAAAGTCC ATCAATCGGC CGCTCGAAGG CGACACGGGC
AAGGAATTTG ACGCCATCGC AAAAACCGCC AAGCGCTTTG CGATGAATGA CATTTTGATC
GGTGAAATCC GTGACAAGGC TACCGGAATG CTGATGGTGG ACATTGTGCT GATGGGCACA
TCCGTGTACT GCACGACCCA CACACCTTCA GCACTGGGCG CCTTCGACAA GCTAGCGAGC
GACATGGTGG GAGTTTCGCG TGACTTTCTC TCGGTTCCGG GAAACGTCAA GCTCATCACA
TACCAGGCAT TGCTGCCCAA GCTGTGTACC TGTGCGCTGC CACTTAAATG CATTACCCAG
CAAGGCCATG CAGAGTACAA GGCCTCCGGT AATGACTACA TCAAACGGCT ACAAAGGCTC
TATGACCTGG ACGAATCCAT GCTGCGCATC GGCAACCGGG AGGGTTGCCC CAAATGCCGG
CATGAAGGAC TGCCCGAGCT CAATGGCTTT AAAGGAAGAA CACTTGCAGC AGAAATGGTG
GAGCCTGACG AAACGCTGCT CTCGATGGTC AAGCAGGCCG ACAGCATTGG AATGCGGCGC
CACGTAGCCG AGTTGCGGGG CAGCACCCGA TTTGACGACC CGTTGATGGT AGGAAAGAGC
GCAATGGAAT GCGCCGTTTA CAAAATGTCC ACCGGCATCC TGGACCCGCG CGAGATTGAG
CCCCGGTTTC ATTCGTTCGA ACGGGAAGAA ATGATGCGTG CGTCTTCGAA GCACTACGCT
GAACACGTTG AAAAGCAAGT TTACACAGCA CTTAAAAAGG TAAGGTAA
 
Protein sequence
MAMVNLFRKL FLRPALVENQ VIASDIAAAV NFRPRKPLKV SVADAMADSE DIPSNIEAFA 
TSVASTNVQH GPFGKSISVE TIKDFATLPL LFEGLVNDFQ LGSHLGYNVQ PALFGGVQGS
EAARIAILVN PLYANSDEVV ELLSILANPK QHKHEFKLWE HASRVIIPHS LMLSLSRGEI
DKNSLSNRRK INSDPKKHSF YVALENIIQW GVRQNASDVH FNLNFNEEHS EVRFTINGRY
VAPQMWRMPT QTMFQILSVA WMSGTGGNGP SYDTRVEQQC RIALKIDDKP IMLRWASAST
ESPGPSVTTR IIKSDANILT LEQLSYLPDQ IATYNRAMNA EKGAIILAGV VNSGKSVTIA
SLMGSLPDYR KKLGIEDPVE ILIPGMLQKS INRPLEGDTG KEFDAIAKTA KRFAMNDILI
GEIRDKATGM LMVDIVLMGT SVYCTTHTPS ALGAFDKLAS DMVGVSRDFL SVPGNVKLIT
YQALLPKLCT CALPLKCITQ QGHAEYKASG NDYIKRLQRL YDLDESMLRI GNREGCPKCR
HEGLPELNGF KGRTLAAEMV EPDETLLSMV KQADSIGMRR HVAELRGSTR FDDPLMVGKS
AMECAVYKMS TGILDPREIE PRFHSFEREE MMRASSKHYA EHVEKQVYTA LKKVR