Gene PA14_33340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPA14_33340 
Symbol 
ID4380486 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas aeruginosa UCBPP-PA14 
KingdomBacteria 
Replicon accessionNC_008463 
Strand
Start bp2931455 
End bp2934685 
Gene Length3231 bp 
Protein Length1076 aa 
Translation table11 
GC content66% 
IMG OID639325249 
Producthelicase 
Protein accessionYP_790818 
Protein GI116050365 
COG category[R] General function prediction only 
COG ID[COG1203] Predicted helicases 
TIGRFAM ID[TIGR02562] CRISPR-associated helicase Cas3 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.0278581 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATCC TGCTGGTGTC GCAATGCGAA AAGCGCGCCC TGAGCGAAAC CCGCCGCATT 
CTCGACCAGT TCGCCGAGCG CCGCGGCGAA CGGACCTGGC AAACGCCCAT CACTCAAGCC
GGACTGGATA CCCTGCGACG CCTGCTGAAG AAAAGCGCAC GGCGCAACAC CGCCGTAGCC
TGTCACTGGA TCCGCGGCCG CGACCACAGC GAACTGCTGT GGATCGTCGG TGATGCCAGC
CGCTTCAACG CCCAGGGTGC GGTGCCGACC AACAGGACCT GCCGCGACAT CCTGCGCAAG
GAAGACGAGA ACGACTGGCA CAGCGCCGAG GACATCCGCC TGCTGACGGT GATGGCAGCG
CTGTTCCACG ATATCGGCAA GGCCAGCCAG GCCTTCCAGG CCAAACTGCG GAACCGCGGC
AAACCGATGG CCGATGCCTA TCGTCACGAA TGGGTATCAC TGCGCCTGTT CGAAGCCTTC
GTTGGCCCAG GCAGCAGCGA CGAGGACTGG CTGAGGCGCC TGGCGGACAA GCGAGAGACG
GGCGATGCCT GGCTGTCGCA ACTGGCCAGG GACGACCGGC AATCCGCGCC ACCCGGCCCG
TTCCAGAAAA GCCGGCTACC GCCGCTCGCC CAGGCGGTCG GCTGGTTGAT CGTCAGCCAT
CATCGCCTGC CCAACGGGGA CCATCGCGGC AGCGCCTCGC TGGCACGCTT GCCGGCCCCC
ATCCAGAGCC AATGGTGCGG CGCACGCGAC GCAGACGCAA AAGAAAAGGC CGCCTGCTGG
CAGTTCCCCC ACGGCCTGCC CTTCGCCAGC GCCCATTGGC GCGCCAGGAC AGCGCTATGC
GCGCAGAGCA TGCTCGAGCG TCCCGGCCTG CTGGCTCGGG GACCGGCCTT GTTGCATGAT
TCCTACGTCA TGCATGTGTC CCGACTGATC CTGATGCTCG CGGACCACCA CTATTCCAGT
CTCCCTGCCG ACTCCCGGCT GGGCGACCCG AACTTCCCCT TGCACGCCAA CACCGACCGG
GACAGCGGCA AACTAAAGCA GCGCCTGGAC GAACACCTGC TCGGCGTCGC CCTGCACAGT
CGCAAGCTCG CCGGCACCCT GCCACGCCTG GAGCGACAAC TACCGCGCCT TGCCCGGCAC
AAGGGCTTCA CCCGCCGGGT CGAGCAGCCG CGCTTCCGCT GGCAGGACAA GGCCTACGAC
TGCGCGATGG CCTGCCGCGA GCAGGCTATG GAGCATGGAT TCTTCGGCCT CAACCTGGCG
TCGACCGGTT GCGGTAAGAC CCTCGCCAAC GGCCGTATCC TGTATGCGCT GGCCGATCCG
CAACGCGGCG CGCGTTTCAG CATCGCTCTC GGCCTGCGCA GCTTGACCCT GCAAACCGGG
CAGGCCTACC GCGAGCGGCT CGGCCTGGGC GACGACGACC TCGCTATCCT GGTCGGCGGC
AGCGCCGCCC GCGAACTGTT CGAAAAGCAG CAGGAGCGCC TGGAGCGCAG CGGTAGCGAG
TCAGCCCAGG AGCTGCTGGC GGAAAACAGC CATGTACACT TCGCCGGCAC GCTCGAGGAC
GGCCCTCTAC GCGAGTGGCT CGGCAGGAAC AGCGCGGGAA ACCGCCTACT CCAGGCGCCC
ATCCTGGCCT GCACCATCGA CCACCTGATG CCCGCCAGCG AAAGCCTGCG CGGCGGACAC
CAGATAGCGC CACTGCTCCG CCTGATGACT TCCGACCTGG TGCTCGACGA GGTCGACGAC
TTCGATATCG ACGACCTGCC CGCCCTGTCG CGGCTGGTGC ACTGGGCCGG CCTGTTCGGC
AGCCGCGTGC TGCTCTCCTC CGCGACCCTG CCGCCGGCCT TGGTGCAGGG CCTGTTCGAG
GCCTATCGCA GCGGCCGGGA AATCTTCCAG CGCCATCGTG GCGCTCCCGG ACGCGCTACG
GAAATCCGCT GTGCCTGGTT CGACGAGTTC TCCAGCCAAT CCAGCGCCCA CGGCGCCGTA
ACCTCCTTCA GCGAAGCGCA TGCGACCTTC GTCGCCCAGC GTCTGGCGAA GCTCGAGCAA
CTGCCGCCAC GTCGCCAGGC GCAGCTATGC ACCGTGCATG CCGCTGGCGA GGCCCGTCCC
GCGCTGTGCC GCGAGTTGGC CGGGCAGATG AATACCTGGA TGGCTGACCT GCATCGCTGC
CATCACACCG AACACCAAGG ACGTCGCATC AGTTTCGGCC TGCTACGGCT GGCCAACATC
GAACCCCTGA TCGAACTGGC CCAGGCCATC CTCGCCCAGG GTGCGCCCGA GGGGTTGCAT
GTCCATCTGT GTGTCTACCA TTCGCGGCAT CCCCTTCTGG TCCGCTCGGC CATCGAGCGA
CAACTCGATG AACTGCTGAA GCGTTCGGAC GACGACGCCG CCGCGCTGTT CGCTCGTCCG
ACGCTGGCCA AGGCGCTCCA GGCCAGCACG GAGCGGGATC ATCTGTTCGT CGTACTCGCC
TCGCCGGTGG CGGAGGTCGG TCGCGACCAC GATTACGACT GGGCCATCGT CGAACCCTCC
TCCATGCGCT CGATCATCCA GTTGGCCGGG CGAATCCGCC GCCATCGCTC CGGCTTCAGC
GGCGAGGCCA ACCTATACCT GCTATCGCGC AATATCCGCT CGCTGGAAGG GCAGAATCCG
GCGTTCCAGC GGCCCGGCTT CGAGACCCCC GACTTCCCTC TTGACAGCCA CGACCTGCAC
GACCTGCTCG ACCCCGCCCT ACTCGCCCGC ATCGACGCCA GCCCACGAAT CGTCGAACCG
TTCCCACTGT TCCCACGCAG CCGGTTGGTC GACCTGGAAC ACCGACGCCT GCGCGCGCTG
ATGCTTGCCG ACGACCCACC GTCGTCCCTG CTCGGCGTAC CGCTCTGGTG GCAAACCCCG
GCATCGCTCA GCGGCGCCCT GCAAACCAGC CAACCATTTC GCGCAGGCGC CAAGGAGCGA
TGCTACGCCC TGCTGCCGGA CGAGGACGAC GAGGAGCGCT TGCATTTCAG CCGCTACGAA
GAAGGGACCT GGAGCAACCA GGACAACCTG TTGCGCAACC TCGACCTCAC CTATGGCCCG
CGCATCCAGA CATGGGGCAC GGTCAACTAT CGGGAGGAGC TAGTCGCAAT GGCCGGCCGC
GAGGACCTCG ACCTGCGTCA ATGCGCCATG CGCTACGGCG AGGTGAGATT GCGAGAAAAC
ACCCAGGGAT GGAGCTACCA CCCTTATTTG GGGTTCAAGA AATACAACTG A
 
Protein sequence
MNILLVSQCE KRALSETRRI LDQFAERRGE RTWQTPITQA GLDTLRRLLK KSARRNTAVA 
CHWIRGRDHS ELLWIVGDAS RFNAQGAVPT NRTCRDILRK EDENDWHSAE DIRLLTVMAA
LFHDIGKASQ AFQAKLRNRG KPMADAYRHE WVSLRLFEAF VGPGSSDEDW LRRLADKRET
GDAWLSQLAR DDRQSAPPGP FQKSRLPPLA QAVGWLIVSH HRLPNGDHRG SASLARLPAP
IQSQWCGARD ADAKEKAACW QFPHGLPFAS AHWRARTALC AQSMLERPGL LARGPALLHD
SYVMHVSRLI LMLADHHYSS LPADSRLGDP NFPLHANTDR DSGKLKQRLD EHLLGVALHS
RKLAGTLPRL ERQLPRLARH KGFTRRVEQP RFRWQDKAYD CAMACREQAM EHGFFGLNLA
STGCGKTLAN GRILYALADP QRGARFSIAL GLRSLTLQTG QAYRERLGLG DDDLAILVGG
SAARELFEKQ QERLERSGSE SAQELLAENS HVHFAGTLED GPLREWLGRN SAGNRLLQAP
ILACTIDHLM PASESLRGGH QIAPLLRLMT SDLVLDEVDD FDIDDLPALS RLVHWAGLFG
SRVLLSSATL PPALVQGLFE AYRSGREIFQ RHRGAPGRAT EIRCAWFDEF SSQSSAHGAV
TSFSEAHATF VAQRLAKLEQ LPPRRQAQLC TVHAAGEARP ALCRELAGQM NTWMADLHRC
HHTEHQGRRI SFGLLRLANI EPLIELAQAI LAQGAPEGLH VHLCVYHSRH PLLVRSAIER
QLDELLKRSD DDAAALFARP TLAKALQAST ERDHLFVVLA SPVAEVGRDH DYDWAIVEPS
SMRSIIQLAG RIRRHRSGFS GEANLYLLSR NIRSLEGQNP AFQRPGFETP DFPLDSHDLH
DLLDPALLAR IDASPRIVEP FPLFPRSRLV DLEHRRLRAL MLADDPPSSL LGVPLWWQTP
ASLSGALQTS QPFRAGAKER CYALLPDEDD EERLHFSRYE EGTWSNQDNL LRNLDLTYGP
RIQTWGTVNY REELVAMAGR EDLDLRQCAM RYGEVRLREN TQGWSYHPYL GFKKYN