Gene PA14_11100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPA14_11100 
SymbolcupB5 
ID4382071 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas aeruginosa UCBPP-PA14 
KingdomBacteria 
Replicon accessionNC_008463 
Strand
Start bp961839 
End bp964895 
Gene Length3057 bp 
Protein Length1018 aa 
Translation table11 
GC content68% 
IMG OID639323449 
Productadhesive protein CupB5 
Protein accessionYP_789038 
Protein GI116052118 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3210] Large exoproteins involved in heme utilization or adhesion 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.280418 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAAT GCTATGCACT GGTCTGGAAC GTATCCCAGG GCTGCTGGAA CGTCGTCAGC 
GAAGGCAGTC GCAGGCGCGG CAAGCCCGCC GGCGCCAAGG CGGCGATCGC CTCCGCCCTG
GCCCTGCTGG GCGCCACGGC CCTGGCTCCG GCCTATGCGC TGCCAAGCGG GGGAACGGTG
GTCGGCGGGA GCGCGAACGG GGAGATACAC CTGTCGGGCG GCAACAGCCT GTCGGTCAAC
CAGAAGGTCG ACAAGCTGAT CGCCAACTGG GACTCCTTCA GCGTTGCCGC CGGCGAGCGG
GTGATCTTCA ACCAGCCGAG CAGTAGCTCG ATTGCCCTGA ATCGGGTGAT CGGCACCAAG
GCCAGCGATA TCCAGGGCCG GATCGATGCC AACGGCCAGG TCTTCCTGGT CAACCCCAAC
GGCGTGCTCT TCGGTCGCGG CGCCCAGGTC AATGTCGGCG GCCTGGTGGC TTCCACGCTG
GACATCACGG ATGCCGAGTT CAACGGCAAC TCCTCCAGAT ACCGTTTCAC GGGTCCCTCT
ACCAACGGTG TCCTCAACCA CGGGGGCGCC ATCACCGCGG CGGAAGGCGG CAGCATCGCC
CTGCTGGGCG CGCAGGTCGA CAACCGCGGG ACGGTCCTGG CGCAGATGGG CGGTGTCGGG
CTCGGCGCGG GCAGCGACCT GACGCTGAAT TTCGACGGCA ACAAGCTGCT CGACATCCGC
GTCGACGCCG GGGTCGCCAA TGCGCTGGCA AGCAACGGCG GCCTGCTCAA GGCCGACGGG
GGGCGAGTCC TGATGGCGGC CAGGACCGCC AATGCGCTGC TCAACACGGT GGTGAACTCC
CAGGGTGCCA TCGAGGCCCG TTCGCTGCGC GGCAAGAACG GGCGGATCGT GCTCGATGGC
GGCCCGGACG GCAAGGTCAT GGTGGGAGGC GCCTTGTCCG CCAATGCGCT GAAAGGTCCG
GGGCACGGCG GCACGGTCGA GGTCCGGGGG CAGGCGGTGG AAGTGGCCCT GGGCACCCAG
GTGAACACGC TCGCCAGCAA TGGCCTCAAC GGCACCTGGA AGATTGCCGC CGACAAGATC
GACGTGCGCC CGTCGGCGGT GTCGGATGGC GTCACCGTTC ATGCCGACAC CCTGTCGCGG
AACCTGGCGA GCACCAATAT CGAACTGGTT TCGACCAAGG GCGACCTGGA CCTCGACGGC
TCGGTGAGCT GGGCATCGGG CAACCGGCTG GGGCTGGGCT CCGCGGCCGA CCTGACGCTG
AATGGCAGGC TGAATGCCAG TGGCGCCAAG GCTGGGCTGG AGCTGAAGGC CGAAGGCGCT
ATCGATATCA ATGACAAGAT CGTTCTCGGC GGGGCTGGCA GCGCGCTGGC CATGGATGCC
GGCGAAGGCC ACCGGGTGAA CGGCACGGCG TCGGTCTCCC TGGCCGGGGC CAACGCGACC
TACGTCTCCG GCGGCTATTA CTACACGGTG GTGCAGAACC TGGCGCAGTT GCAGGCGATC
AACAAGAACC TAGACGGCCT GTACGTGCTC GGCGGCAATA TCCTGGGCGG CAGCTATTAC
TGCACGGCAC TGCAGTCCAT CGGCGGGCCC GCCGGCGTCT TCAGCGGCAC CCTGGACGGT
CTCGGCAACA GCATCGGCAA TCTCTCGATC AGCAACACCG GGCCGAATGT CGGGCTGTTC
GCCCGCTCCT CGGGCACCCT GAGCAACCTG AAACTGAACA ACCTGCGGGT ATCCGATAAC
ACCTACGGCT CCGGTCCGTC TTCGCTCGGC GCCCTGGTCG GGATCAACAG CGGGCGTATC
GCCAACGTCA GCGCCAGCGG GGTCTCGGTC GTCGGCAGCC GACTGCGCTC CAACGCACTG
GGCGGCCTGG TCGGGCGCAA TATCAACGGG CAGATCACGA ACGCATCCGT CAGCGGCGGC
GTCACCGCTT ATGCGGCGAG CACAGCGGTC GGCGGTCTGG TGGGGGAGAA CTTCACCACC
GCCTGGGGGC CGGAGGCGGT CATCGAGAAC GCCCACAGCA ACGTCCATGT GGCTGCACAG
TCCACCGAGC GCAACAGCCT GGGCGGCGTC GGCGGCCTGG TCGGACTGAA TGCGAAGGCC
ACGATCAGGG CGTCCGGCAG CCAGGGGAAG GTCGAGACCT ACCGGCCCGG CCTGAACGTC
GGCGGCCTGG TCGGCTACAA CATGTTCGGC CACGTCTCCG ATAGTAGCGC CAGCGGCCAG
GTGGAGGCCG GCGGCGCGGG GTATACCGGC GGGCTGGTCG GCCTGAGTTC CGGCGGCGAG
ATATTCCGCT CGCAGGCGAG CGGGTCGGTG TACAGCAAGG GCGGCCTGGC GACCGGAGGG
TTGATCGGCA AGGCAGAAGG CAACGGCATG CTCGGAAACC TGAAAGCCAG CGGCAGCGTC
ATGGACCAGG GGGGCGCGGA TCTGGGCGGG CTGGTCGGCA ACAACAGCCA GGGTGCCATC
GAGACCGCCG AGGCGACGGG CAAGGTCAGC GGCGGCAGCA ACAGTCGCGT CGGCGGTCTG
ATTGGACACA ACCTCGGCGG TTCCGTCGCC CATGCGATCT CGCGCGGCGA CGTGAGCGGC
GGCTTCAACA GCCTGGTGGG CGGGCTCGTC GGCCACAACG GCGGCGAACT GTTCAACGTG
GATGCCAGCG GCAGGGTCAG CGCCGCTGCG AGTGCGTCGG TTGGCGGCCT GGTCGGCAGC
AACGCCGGTT CGATCCTGTC GGCGCGCAGC AGCAGTACCG TTAGCGGCGG CGGGCGCAGC
CGCATCGGCG GCCTGGTCGG CGAGAACCAG ATCCAGGGAC GCATCGTTTC GTCCATGTCG
GAAGGCACCG TCAGTGGCGA CTACTACGTC TCCATGGGCG GGCTGGCCGG CGTCAACCTG
GGATCGATCG AGTACTCCGG CGTCAGCGGC AAGATCGACT TCAAGCCTCA GTCCCATTAC
GGCCAGATCT ACGGTGCGCA GGTCGGCGAG AACCGTGGGG TCCTGGGCGG CAACTACGTG
ATCGGCGAGG CGGCGCTCCT GCCGCCTGCC GGTATCGACT ACGGCAACAT CTGGTAA
 
Protein sequence
MNKCYALVWN VSQGCWNVVS EGSRRRGKPA GAKAAIASAL ALLGATALAP AYALPSGGTV 
VGGSANGEIH LSGGNSLSVN QKVDKLIANW DSFSVAAGER VIFNQPSSSS IALNRVIGTK
ASDIQGRIDA NGQVFLVNPN GVLFGRGAQV NVGGLVASTL DITDAEFNGN SSRYRFTGPS
TNGVLNHGGA ITAAEGGSIA LLGAQVDNRG TVLAQMGGVG LGAGSDLTLN FDGNKLLDIR
VDAGVANALA SNGGLLKADG GRVLMAARTA NALLNTVVNS QGAIEARSLR GKNGRIVLDG
GPDGKVMVGG ALSANALKGP GHGGTVEVRG QAVEVALGTQ VNTLASNGLN GTWKIAADKI
DVRPSAVSDG VTVHADTLSR NLASTNIELV STKGDLDLDG SVSWASGNRL GLGSAADLTL
NGRLNASGAK AGLELKAEGA IDINDKIVLG GAGSALAMDA GEGHRVNGTA SVSLAGANAT
YVSGGYYYTV VQNLAQLQAI NKNLDGLYVL GGNILGGSYY CTALQSIGGP AGVFSGTLDG
LGNSIGNLSI SNTGPNVGLF ARSSGTLSNL KLNNLRVSDN TYGSGPSSLG ALVGINSGRI
ANVSASGVSV VGSRLRSNAL GGLVGRNING QITNASVSGG VTAYAASTAV GGLVGENFTT
AWGPEAVIEN AHSNVHVAAQ STERNSLGGV GGLVGLNAKA TIRASGSQGK VETYRPGLNV
GGLVGYNMFG HVSDSSASGQ VEAGGAGYTG GLVGLSSGGE IFRSQASGSV YSKGGLATGG
LIGKAEGNGM LGNLKASGSV MDQGGADLGG LVGNNSQGAI ETAEATGKVS GGSNSRVGGL
IGHNLGGSVA HAISRGDVSG GFNSLVGGLV GHNGGELFNV DASGRVSAAA SASVGGLVGS
NAGSILSARS SSTVSGGGRS RIGGLVGENQ IQGRIVSSMS EGTVSGDYYV SMGGLAGVNL
GSIEYSGVSG KIDFKPQSHY GQIYGAQVGE NRGVLGGNYV IGEAALLPPA GIDYGNIW