Gene Pnap_3501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_3501 
Symbol 
ID4689147 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp3715798 
End bp3716922 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content66% 
IMG OID639836515 
Productputative zinc protease protein 
Protein accessionYP_983719 
Protein GI121606390 
COG category[R] General function prediction only 
COG ID[COG4324] Predicted aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0403427 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATCGA GACGATTGAG GCGCCTGCTG GCCGCAGGCC TGGCGGCCGC CGGCCTAACC 
GGCTGCGCCG ACCTGGGCTA TTACTGGCAG TCGGTCAACG GCCACCTGAC GGTGATGAAC
GCAGCCCGCC CGGTCAAGGA CTGGCTGGAC GATGCGCGCA CCCCGGCGCC GCTGAAAACC
CGGCTGGCCC TGGCGCAGCG CATCCGCCGC TTTGCCGTCA CCGAACTGCA GCTGCCCGAC
AACCCGAGCT ACCACCGCTA TGCCGACCTG CAGCGCAGCG CCGTGGTCTG GAACGTGGTC
GCCGCGCCCG AGTTCTCGCT GACGCTGAAG ACCTGGTGTT TTGCGCTGGC CGGCTGCGTC
GGCTACCGGG GCTATTTCAG TGAACCGGAT GCCCGGGCCG AGGCCGCGCA ACTCGCCGCC
CAGGGCTTTG AAACCAGCGT TCATGGGGTG CCGGCCTATT CCACGCTGGG CTGGATGAAC
TGGGCCGGCG GCGACCCGCT GCTGAGCACC TTCATCCGCT ACCCCGAGGG CGAGCTGGCG
CGACTGGTGT TTCACGAACT CGCGCACCAG GTGGCTTATG CGCAGGACGA CACGGTGTTC
AACGAGTCGT TTGCGACGGC CGTCGAGCGG CTGGGCGTGC AGCGCTGGCT GGATGCGCGG
AGCAGCCAGA GCACCGATGA AGCCCGCCAG GCCTATGCGG CGTTTGACGC ACGGCGCCAG
CAGTTCCGGG CACTGGCGCA GGCCACACGC CGGGAATTGA CCGCCATTTA TGAACCAAAC
AAGGCTTTAG TGCACGTCCC ACCTGCGCAA GCAGCTCTTA AAATGATAGC AATGCAGAAT
TTTCGTGAGC GCTATGCGCA GCTCAAGGCG TCATGGGACG GTTATGCCGG CTACGACCCG
TGGGTGGCGC GCGCCAACAA TGCGTCGTTT GGCGCGCAGG CAGCCTATGA CGAACTGGTG
CCCGGTTTTG AAGCCCTGTT CGAGCGCGAA GGACGTGACT GGCCACGGTT TTACGGCGCC
GTCAAACGGC TGGCCGGCAT GCCCAAGAGC GAGCGGCACG CCCTCCTGGA GATCAATCAC
GGGCCGGCAA TTGCCGGGAT GGCGGCAGCC CACGCCGGGC AGTAA
 
Protein sequence
MASRRLRRLL AAGLAAAGLT GCADLGYYWQ SVNGHLTVMN AARPVKDWLD DARTPAPLKT 
RLALAQRIRR FAVTELQLPD NPSYHRYADL QRSAVVWNVV AAPEFSLTLK TWCFALAGCV
GYRGYFSEPD ARAEAAQLAA QGFETSVHGV PAYSTLGWMN WAGGDPLLST FIRYPEGELA
RLVFHELAHQ VAYAQDDTVF NESFATAVER LGVQRWLDAR SSQSTDEARQ AYAAFDARRQ
QFRALAQATR RELTAIYEPN KALVHVPPAQ AALKMIAMQN FRERYAQLKA SWDGYAGYDP
WVARANNASF GAQAAYDELV PGFEALFERE GRDWPRFYGA VKRLAGMPKS ERHALLEINH
GPAIAGMAAA HAGQ