Gene PA14_66690 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPA14_66690 
Symbol 
ID4385530 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas aeruginosa UCBPP-PA14 
KingdomBacteria 
Replicon accessionNC_008463 
Strand
Start bp5956226 
End bp5957665 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content66% 
IMG OID639327980 
Productputative protease 
Protein accessionYP_793516 
Protein GI116053195 
COG category[R] General function prediction only 
COG ID[COG4784] Putative Zn-dependent protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACCG CCTGGCTCCT GCTGCTGTTG CTGGGAGTCG GCGCTCTGGG CGGCTGCGCG 
GTGAACCCGG CCACCGGCAA GAGCGATTTC GTGATGATGA GCGAGCAGCA GGAACTCGGC
ATGGGCGCTC GCTACAACCA GGAAATCCTC AAGCAGTTCC CCCGCTACAA CGATGAAAAG
CTCCAGGCCT ACGTGCAACG GGTAGGCGAG CGGGTCGCCC GCAGCAGCCA CCGCAGCAAC
CTGCAATATC ATTTCACCGT CATCGATTCG CCGGACATCA ACGCCTTCGC CCTGCCGGGC
GGCTACATCT ATATCCATCG CGGACTGATC GCCTACCTTG GCTCCGAGGC CGAACTGGCC
GCGGTGCTCG GTCATGAGGT CGGCCATGTC ACGGCGCGCC ACAGCGTGCG CCAGCAGAGT
CAGGCCAGCG CCTGGAACAT CCTTGGCCAG GCGGTGGCGA TCGGCACCGG GGTCGGCGCC
GCCGGCGACC TGGCCAACGT GCTCGGCACG GCCTTCGTCC GCGGCTACGG GCGTGACATG
GAACTGGAGG CCGATGGTCT CGGCGCCCAG TACCTGGCCC GCGCCGGCTA CGATCCGACG
GCGATGATCC AGGTGGTGCG GGTGCTGAAG AACCAGGAGG ACTTCGCTCG CGAAGAGGCC
GCGCGTAATG GCCAGGCGGT ACAGGCCGGC GGCTACCACG GGTTGTTCGA TACCCATCCG
GACAACGACC GGCGCCTGCA GGAAGTGGTC GGCCCGGCCC GGCAGCTGGC CAACGGACAG
CAGGAAGTGG GGCGCGAGGT CTTTCTTCGC CATCTGGAAG GCATGCCGTT CGGCGACTCG
GCATCGGCCG GGGTGCGTCG CGGGCAGAAC TTCTACCATG CCGAGCTGGA CTTCACCCTG
AGCTATCCGG CGGGCTGGAA GATCCTCAAC CAGCCCAGCG CCCTGCTTGG CTATCCGGCG
GACGAGCAGT CGTTCATCGG CATGAAGCTG GTGCCCCATG ATTCCCGCCT GACGCCCGCG
GAGTTCCTGC GCAAGAACGC CGGGCAACGG CTGGCCCAGG AAGAGTCGCT GAAGCAGGCC
GGGCTGAACG GCTACACGGC GGTGGTGCCA GGCAACCCGG CGCGACGGGT TGCTGTGATC
TACCAGGGCG ACCGCGCCTA CCTGTTCGTC GGCGTGGTCA AGGTCGGTTC CCTGGAGACC
CAGGACGACC GCTTCCTCAG CGTGATCCGC AGCTTCCGTC CGCTGCGCGA CAAGGAGCGG
GCCCTGGCGC AGCCGCGACG CCTGCATCTG GTGCAGGTCA AGGCGGGGCA GACGCTGGAG
CAATTGGCGG CTGGCGGCGA GGGGTCATTG AGTGACTCCG TGGCTCGTCT GCGTCTGCTG
AATGACCTTT ATCCGAGCGG TGAACCCCGT CCGGGCGATT GGCTTAAAGT CGTACGCTAG
 
Protein sequence
MKTAWLLLLL LGVGALGGCA VNPATGKSDF VMMSEQQELG MGARYNQEIL KQFPRYNDEK 
LQAYVQRVGE RVARSSHRSN LQYHFTVIDS PDINAFALPG GYIYIHRGLI AYLGSEAELA
AVLGHEVGHV TARHSVRQQS QASAWNILGQ AVAIGTGVGA AGDLANVLGT AFVRGYGRDM
ELEADGLGAQ YLARAGYDPT AMIQVVRVLK NQEDFAREEA ARNGQAVQAG GYHGLFDTHP
DNDRRLQEVV GPARQLANGQ QEVGREVFLR HLEGMPFGDS ASAGVRRGQN FYHAELDFTL
SYPAGWKILN QPSALLGYPA DEQSFIGMKL VPHDSRLTPA EFLRKNAGQR LAQEESLKQA
GLNGYTAVVP GNPARRVAVI YQGDRAYLFV GVVKVGSLET QDDRFLSVIR SFRPLRDKER
ALAQPRRLHL VQVKAGQTLE QLAAGGEGSL SDSVARLRLL NDLYPSGEPR PGDWLKVVR