Gene PA14_18630 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPA14_18630 
Symbol 
ID4381500 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas aeruginosa UCBPP-PA14 
KingdomBacteria 
Replicon accessionNC_008463 
Strand
Start bp1600241 
End bp1603228 
Gene Length2988 bp 
Protein Length995 aa 
Translation table11 
GC content70% 
IMG OID639324042 
Productputative serine protease 
Protein accessionYP_789630 
Protein GI116051534 
COG category[S] Function unknown 
COG ID[COG4625] Uncharacterized protein with a C-terminal OMP (outer membrane protein) domain 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain
[TIGR02601] autotransporter-associated beta strand repeat 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.169589 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACG ACCACTCCTT CCGCCCTCGC CCCACCTCGT TGTCAGCCGC CCTGCTGCTG 
GGCGCCTGGA TCGCCCAGCC GGCCACGGCC GCCTATGTCG AAGCCGGTCG GCCCGGCGAT
CCGGCCAGTT GGCGCTCCGC CGAATACCAG CAGGACTGGG GCCTGGAACG GATGCGGGCC
GACCAGGCCT ATGCCGCCGG CATCGACGGC CAGGGCGTGA AGATCGGCGA GATGGACTCC
GGTTTCGACC CGAGCCATCC GGATACCCCC GCCTCGCGCT ACCAGCCGGT GACGGCCAGC
GGCACCTATG TCGACGGCAC GCCGTTCAGC GTCAGCGGCG CGATGAACGG CAACAACGAC
TCCCACGGCA CCCACGTCGG CGGCACCCTC GGTGCCTCGC GCGACGGCGT CGGCATGCAC
GGGGTGGCCT ACGCGGCACA GGTGTACGTC GCCAACACCA ACCAGAACGA CAGCTTCCTG
TTCGGCCCGA CGCCCGACCC GAACTATTTC AAGGCCGCCT ACCAGGCGCT GGCCGACGCC
GGGGTGCGGG CGATCAACAA CAGTTGGGGC AGCCAGCCCA AGGACGTCAG CTACGAGACC
CTCGACGGCC TGCACGCCGC CTATGCCCAG CACTACGGGC GCTCCACCTG GCTGGACGCC
GCCGCCGGCG TCTCCCGCCA GGGCGTGATC AACGTCTTCA GCGCCGGCAA CAGCGGCTAC
GCCAACGCCA GCGTGCGCTC CGCCCTGCCC TACTTCCAGC CGGACCTGGA AGGCCACTGG
CTGGCCGTGT CCGGCCTCGA CCAGCAGAAC GGCCAGCGCT ACAACCGCTG CGGCATCGCC
AAGTACTGGT GCATCACCAC GCCCGGCCGC CTGATCAACA GCACCATGCC CGGCGGCGGC
TACGCCAACA AGTCCGGTAC CTCGATGGCC GCGCCCCACG CCACCGGCGC GCTGGCCCTG
GTCATGCAGC GCTATCCGTA CCTGAACAAC GAGCAGGCGC TGCAGGTTCT GCTGACCACC
GCCACCCAGC TCGACGGCAC GCCGACCGGC GCCCCCACCG ACACCGTCGG CTGGGGCGTG
CCGGATCTCG GTCGGGCGAT GCATGGGCCT GGACAATTGC TCGGCCGCTT CGAGGCCAAC
CTCCCGGCCG GCCTGCGCGA CGAATGGAGC AACCCGATTT CCGATAGCGC CCTGCTCCAG
CGCCAGGCCG AGGACGCCGC CGAGCACGCG GCCTGGCAGC GGACGCTGAA GGACAAGGGC
TGGGAAAACG GCTTGCCAGC CGGCGCCAGC CAGCAGGAAC GCACCGACTA TGCCATCGGC
ATGGCCCGCG ACCAGGCCGC CGCCCAGCGC CAGTACCAGG GCAGCCTGGT CAAGGCCGGT
GCCGGCAGCC TGGTCCTGAG CGGCGACAGC ACCTATCGCG GGCCGACCCT GGTCGATGGC
GGGCTGCTCA GCGTCGACGG TTCGCTGCTG TCCGCCGTCG AAGTCAATGC CGGCGGCACC
CTCGGCGGCA GCGGCAGGAT CGGCGGCCTG CTGGCGCGCT CCGGCGGCAC GGTGGCCGCG
GGCAACTCCA TCGGCACCCT GGAGGTCGCC GGGGACCTGC GCTTCGAATC CGGCTCGACC
TACGCGGTGG AGCTTTCGGA AAGCGCCAGC GACCGGATCG TCGCCAGCGG CAAGGCGAGC
ATCGCGGGCG GCAATGTCAC CCTGGCCATG GAAAACAGCC CCGACCTGCT CAGCCAGTCC
CAGGTCGAGA GCCTGGTCGG CCGCCGCTAC GACATCCTCG ACGCCGCCGG CGGCATCGAC
GGGCGCTTCG ACGCGGTATT GCCGAACTAC CTGTTCCTCG GCGGCACCCT GGACTACGCG
GCCAACGCCA TCCGCCTGGA TATCGGACGC AACGGCACGG CCCTCGCCAG CGTCGCGCAG
ACGCCCAACC AGGCGGCGGT CGCTGGTGCC GTGGAAGCGC TCGGCGCCGG CAACCCGGTC
TACGAAAGCC TGCTCCTGTC GGAAAACGCC GCAACCGCCC AACGGGCCTT CCAACAATTG
TCCGGGGAAA TCTACCCGGC GCTCGCCGGC CTGTTGCTCA ACGACAGCCG CTATCTGCGT
GACAGCGTCG GCGAACGCCT GCGCCAGGCC AGCGACGGCG AGGCCGGCAG GGAGGCTCCC
GAAGGCTGGT TCAAGGCGCT CGGTTCCTGG GGCAAGAGCG CCGATGGCAG CCACGGCAGC
GAAGGCTACC GGCATTCGGT CGGCGGCTTC CTGCTCGGCG TCGACAGCCA GGTCGCCAGC
GACACGCGCC TCGGCCTGGT GGCCGGCTAC AGCAACAGCT CGCTGAACAT GGACAGCAGC
CTGCAATCCT CCGCCAGCAT CGACAGCTAC CACCTCGGCG CCTACCTCGG CCGGCAATTG
CAGCAATGGC GCCTGAGCCT CGGCGCGGCG CACGCCTGGC ACCGCGCCGA GGTCAAGCGC
GACCTGCAAT ACGGCGCCGT GGCCGGCAAG CAGAAGGCCA AGCTCGACGC ACAGAGCAGC
CAGTTGTTCG CCGAGGCCGC CTACGCGCTG GGCTGGCGCA GCCTGGAGCT GGAACCCTTC
GCCGGGCTGG CCTACGTGCA CGTCGCCAGC GATGACTTCC GCGAACGCGG TAGTGCCGCG
GCCCTGGAGG GTGGCGACGA CAACCTGGAC GCCGCCTTCA CCACCCTGGG CCTGCGCGCG
AAACGGCATT TCGAGCTGGA TGCCGAACGC CGCCTGGCGC TCTCCGGCAC CCTCGGCTGG
CGCCACAACC TGAGCGACAC CACCCCGCAA CGCCACCTGG CGTTCGCCAG CGGCAGCCAG
CCATTCAACG TGGAAAGCGT GGCCCTGTCC CGCGACGCCG CGCTGCTCGG CGTCGACGCC
AGCCTCGCGG TGAATCGCGA AGTGAGCGTG CGGCTGGGCT ACAACGGCCT GCTGGGCAGC
CGCGAGAAGG ACCATGGCGT CGGACTGGCC GTCGACTGGC GTTTCTGA
 
Protein sequence
MTDDHSFRPR PTSLSAALLL GAWIAQPATA AYVEAGRPGD PASWRSAEYQ QDWGLERMRA 
DQAYAAGIDG QGVKIGEMDS GFDPSHPDTP ASRYQPVTAS GTYVDGTPFS VSGAMNGNND
SHGTHVGGTL GASRDGVGMH GVAYAAQVYV ANTNQNDSFL FGPTPDPNYF KAAYQALADA
GVRAINNSWG SQPKDVSYET LDGLHAAYAQ HYGRSTWLDA AAGVSRQGVI NVFSAGNSGY
ANASVRSALP YFQPDLEGHW LAVSGLDQQN GQRYNRCGIA KYWCITTPGR LINSTMPGGG
YANKSGTSMA APHATGALAL VMQRYPYLNN EQALQVLLTT ATQLDGTPTG APTDTVGWGV
PDLGRAMHGP GQLLGRFEAN LPAGLRDEWS NPISDSALLQ RQAEDAAEHA AWQRTLKDKG
WENGLPAGAS QQERTDYAIG MARDQAAAQR QYQGSLVKAG AGSLVLSGDS TYRGPTLVDG
GLLSVDGSLL SAVEVNAGGT LGGSGRIGGL LARSGGTVAA GNSIGTLEVA GDLRFESGST
YAVELSESAS DRIVASGKAS IAGGNVTLAM ENSPDLLSQS QVESLVGRRY DILDAAGGID
GRFDAVLPNY LFLGGTLDYA ANAIRLDIGR NGTALASVAQ TPNQAAVAGA VEALGAGNPV
YESLLLSENA ATAQRAFQQL SGEIYPALAG LLLNDSRYLR DSVGERLRQA SDGEAGREAP
EGWFKALGSW GKSADGSHGS EGYRHSVGGF LLGVDSQVAS DTRLGLVAGY SNSSLNMDSS
LQSSASIDSY HLGAYLGRQL QQWRLSLGAA HAWHRAEVKR DLQYGAVAGK QKAKLDAQSS
QLFAEAAYAL GWRSLELEPF AGLAYVHVAS DDFRERGSAA ALEGGDDNLD AAFTTLGLRA
KRHFELDAER RLALSGTLGW RHNLSDTTPQ RHLAFASGSQ PFNVESVALS RDAALLGVDA
SLAVNREVSV RLGYNGLLGS REKDHGVGLA VDWRF