Gene Pnap_3735 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_3735 
Symbol 
ID4686771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp3977368 
End bp3978606 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content63% 
IMG OID639836753 
Producthypothetical protein 
Protein accessionYP_983952 
Protein GI121606623 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2342] Predicted extracellular endo alpha-1,4 polygalactosaminidase or related polysaccharide hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.756384 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0338258 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCACCC ACACACCTAC TTACACTCCC CTCCATGGCC ATCGCACAGG CTCCCGCCTG 
GCCACCACGA TGCGCTGCAC CACCTTGCTC GTTGCGATGG CAGGCGCCTT CACGGCCCAT
GGCCGCACCC TGACCGTCAT TGGCGACACG CAGCCCGCAG GCAGCACCTG GCAAGACAGT
TCCGCCACGG AATCCAGCTC CACCAGGACG TGGCAACGGA GAAAACACAC AAACACACAA
ACGCCAACAA CGCCGACTCC AACGCCAACG CCTACGCCTA CGCCTACGCC TACGCCTACG
CCTACGCCTA CGCCTACGCC TACGCCTACG CCTACGCCTA CGCCCACCCC AACGCCCCCA
GCAACGACTT CCAGCCGAGG CTTCCCCACC GCCGGTCCGT GGGCGTCCTT CTATGGGTCG
GCGGACAGCA TCGATCTGCC CAAATTGGCG GCGACGTACC GCATCCTGGA CATTGACGCC
GACCCGGACA TGGGCAACTT CAGCGTCAGC CAGATCAAGA CGCTCAAGAA CGGCGGCGCC
AACAAGGTGT TGAGCTACCT GAACCTGGGC TCGTGCGAAA ACTTCCGTGG CTACTGGTCA
AAGGTGCCGT CGGGATTCCT CTCATGCTCG GCCAACAAGG CGGCGCAATT GGGTACCTAC
TCGGGTTACA GCAACGAGGT CTGGATGAAT GTCGGCAATG CCGCTTACCA AAACCTGGTC
ATCAACTACA TCGTGCCCCG GCTCGCCGCC CAGGGCGTGG ACGGTTTTTA CTTCGACAAC
ATGGAAATCG TCGAGCACGG AACGAACACC AAGAACGGCC CCTGCGACGC CCAGTGCAGC
CAGGGCGGCC TCGACCTGAT TGCCAAGCTG CGCGACAAGT ACCCTTCGAT GCTCTTCGTG
CTGCAGAACG CGACCAGCGA CAAGACGCGC CTCGGCCGGG CAACGGGCGC ATCCGGCACA
GTCGCCTTCC CGAGCCTGCT CGACGGCATC GCGCACGAAG AGGTGTACAA GCCCGTCCAT
GACACGTCCG TCGAAGCCGA ACTGGTCAGC TGGTCGGGCA TGAACCTGAT GCCGGGCGGC
CGCAAGTTCT GGATCGGAAC GCTGGACTAC GCCAGCAGCT GCACCAACAC CAGCGCAGCC
CAGTCGGCCT TCCAGTCCAG CCGTGCGCGC GGCTTCTCGC CTTCAGTCTC CGACTCCAGC
GCGGGACAGC AGACCGTGTG TTACTGGCCT GCTTTTTAA
 
Protein sequence
MSTHTPTYTP LHGHRTGSRL ATTMRCTTLL VAMAGAFTAH GRTLTVIGDT QPAGSTWQDS 
SATESSSTRT WQRRKHTNTQ TPTTPTPTPT PTPTPTPTPT PTPTPTPTPT PTPTPTPTPP
ATTSSRGFPT AGPWASFYGS ADSIDLPKLA ATYRILDIDA DPDMGNFSVS QIKTLKNGGA
NKVLSYLNLG SCENFRGYWS KVPSGFLSCS ANKAAQLGTY SGYSNEVWMN VGNAAYQNLV
INYIVPRLAA QGVDGFYFDN MEIVEHGTNT KNGPCDAQCS QGGLDLIAKL RDKYPSMLFV
LQNATSDKTR LGRATGASGT VAFPSLLDGI AHEEVYKPVH DTSVEAELVS WSGMNLMPGG
RKFWIGTLDY ASSCTNTSAA QSAFQSSRAR GFSPSVSDSS AGQQTVCYWP AF