Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PA14_11100 |
Symbol | cupB5 |
ID | 4382071 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pseudomonas aeruginosa UCBPP-PA14 |
Kingdom | Bacteria |
Replicon accession | NC_008463 |
Strand | + |
Start bp | 961839 |
End bp | 964895 |
Gene Length | 3057 bp |
Protein Length | 1018 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639323449 |
Product | adhesive protein CupB5 |
Protein accession | YP_789038 |
Protein GI | 116052118 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3210] Large exoproteins involved in heme utilization or adhesion |
TIGRFAM ID | [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.280418 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 64 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAAAT GCTATGCACT GGTCTGGAAC GTATCCCAGG GCTGCTGGAA CGTCGTCAGC GAAGGCAGTC GCAGGCGCGG CAAGCCCGCC GGCGCCAAGG CGGCGATCGC CTCCGCCCTG GCCCTGCTGG GCGCCACGGC CCTGGCTCCG GCCTATGCGC TGCCAAGCGG GGGAACGGTG GTCGGCGGGA GCGCGAACGG GGAGATACAC CTGTCGGGCG GCAACAGCCT GTCGGTCAAC CAGAAGGTCG ACAAGCTGAT CGCCAACTGG GACTCCTTCA GCGTTGCCGC CGGCGAGCGG GTGATCTTCA ACCAGCCGAG CAGTAGCTCG ATTGCCCTGA ATCGGGTGAT CGGCACCAAG GCCAGCGATA TCCAGGGCCG GATCGATGCC AACGGCCAGG TCTTCCTGGT CAACCCCAAC GGCGTGCTCT TCGGTCGCGG CGCCCAGGTC AATGTCGGCG GCCTGGTGGC TTCCACGCTG GACATCACGG ATGCCGAGTT CAACGGCAAC TCCTCCAGAT ACCGTTTCAC GGGTCCCTCT ACCAACGGTG TCCTCAACCA CGGGGGCGCC ATCACCGCGG CGGAAGGCGG CAGCATCGCC CTGCTGGGCG CGCAGGTCGA CAACCGCGGG ACGGTCCTGG CGCAGATGGG CGGTGTCGGG CTCGGCGCGG GCAGCGACCT GACGCTGAAT TTCGACGGCA ACAAGCTGCT CGACATCCGC GTCGACGCCG GGGTCGCCAA TGCGCTGGCA AGCAACGGCG GCCTGCTCAA GGCCGACGGG GGGCGAGTCC TGATGGCGGC CAGGACCGCC AATGCGCTGC TCAACACGGT GGTGAACTCC CAGGGTGCCA TCGAGGCCCG TTCGCTGCGC GGCAAGAACG GGCGGATCGT GCTCGATGGC GGCCCGGACG GCAAGGTCAT GGTGGGAGGC GCCTTGTCCG CCAATGCGCT GAAAGGTCCG GGGCACGGCG GCACGGTCGA GGTCCGGGGG CAGGCGGTGG AAGTGGCCCT GGGCACCCAG GTGAACACGC TCGCCAGCAA TGGCCTCAAC GGCACCTGGA AGATTGCCGC CGACAAGATC GACGTGCGCC CGTCGGCGGT GTCGGATGGC GTCACCGTTC ATGCCGACAC CCTGTCGCGG AACCTGGCGA GCACCAATAT CGAACTGGTT TCGACCAAGG GCGACCTGGA CCTCGACGGC TCGGTGAGCT GGGCATCGGG CAACCGGCTG GGGCTGGGCT CCGCGGCCGA CCTGACGCTG AATGGCAGGC TGAATGCCAG TGGCGCCAAG GCTGGGCTGG AGCTGAAGGC CGAAGGCGCT ATCGATATCA ATGACAAGAT CGTTCTCGGC GGGGCTGGCA GCGCGCTGGC CATGGATGCC GGCGAAGGCC ACCGGGTGAA CGGCACGGCG TCGGTCTCCC TGGCCGGGGC CAACGCGACC TACGTCTCCG GCGGCTATTA CTACACGGTG GTGCAGAACC TGGCGCAGTT GCAGGCGATC AACAAGAACC TAGACGGCCT GTACGTGCTC GGCGGCAATA TCCTGGGCGG CAGCTATTAC TGCACGGCAC TGCAGTCCAT CGGCGGGCCC GCCGGCGTCT TCAGCGGCAC CCTGGACGGT CTCGGCAACA GCATCGGCAA TCTCTCGATC AGCAACACCG GGCCGAATGT CGGGCTGTTC GCCCGCTCCT CGGGCACCCT GAGCAACCTG AAACTGAACA ACCTGCGGGT ATCCGATAAC ACCTACGGCT CCGGTCCGTC TTCGCTCGGC GCCCTGGTCG GGATCAACAG CGGGCGTATC GCCAACGTCA GCGCCAGCGG GGTCTCGGTC GTCGGCAGCC GACTGCGCTC CAACGCACTG GGCGGCCTGG TCGGGCGCAA TATCAACGGG CAGATCACGA ACGCATCCGT CAGCGGCGGC GTCACCGCTT ATGCGGCGAG CACAGCGGTC GGCGGTCTGG TGGGGGAGAA CTTCACCACC GCCTGGGGGC CGGAGGCGGT CATCGAGAAC GCCCACAGCA ACGTCCATGT GGCTGCACAG TCCACCGAGC GCAACAGCCT GGGCGGCGTC GGCGGCCTGG TCGGACTGAA TGCGAAGGCC ACGATCAGGG CGTCCGGCAG CCAGGGGAAG GTCGAGACCT ACCGGCCCGG CCTGAACGTC GGCGGCCTGG TCGGCTACAA CATGTTCGGC CACGTCTCCG ATAGTAGCGC CAGCGGCCAG GTGGAGGCCG GCGGCGCGGG GTATACCGGC GGGCTGGTCG GCCTGAGTTC CGGCGGCGAG ATATTCCGCT CGCAGGCGAG CGGGTCGGTG TACAGCAAGG GCGGCCTGGC GACCGGAGGG TTGATCGGCA AGGCAGAAGG CAACGGCATG CTCGGAAACC TGAAAGCCAG CGGCAGCGTC ATGGACCAGG GGGGCGCGGA TCTGGGCGGG CTGGTCGGCA ACAACAGCCA GGGTGCCATC GAGACCGCCG AGGCGACGGG CAAGGTCAGC GGCGGCAGCA ACAGTCGCGT CGGCGGTCTG ATTGGACACA ACCTCGGCGG TTCCGTCGCC CATGCGATCT CGCGCGGCGA CGTGAGCGGC GGCTTCAACA GCCTGGTGGG CGGGCTCGTC GGCCACAACG GCGGCGAACT GTTCAACGTG GATGCCAGCG GCAGGGTCAG CGCCGCTGCG AGTGCGTCGG TTGGCGGCCT GGTCGGCAGC AACGCCGGTT CGATCCTGTC GGCGCGCAGC AGCAGTACCG TTAGCGGCGG CGGGCGCAGC CGCATCGGCG GCCTGGTCGG CGAGAACCAG ATCCAGGGAC GCATCGTTTC GTCCATGTCG GAAGGCACCG TCAGTGGCGA CTACTACGTC TCCATGGGCG GGCTGGCCGG CGTCAACCTG GGATCGATCG AGTACTCCGG CGTCAGCGGC AAGATCGACT TCAAGCCTCA GTCCCATTAC GGCCAGATCT ACGGTGCGCA GGTCGGCGAG AACCGTGGGG TCCTGGGCGG CAACTACGTG ATCGGCGAGG CGGCGCTCCT GCCGCCTGCC GGTATCGACT ACGGCAACAT CTGGTAA
|
Protein sequence | MNKCYALVWN VSQGCWNVVS EGSRRRGKPA GAKAAIASAL ALLGATALAP AYALPSGGTV VGGSANGEIH LSGGNSLSVN QKVDKLIANW DSFSVAAGER VIFNQPSSSS IALNRVIGTK ASDIQGRIDA NGQVFLVNPN GVLFGRGAQV NVGGLVASTL DITDAEFNGN SSRYRFTGPS TNGVLNHGGA ITAAEGGSIA LLGAQVDNRG TVLAQMGGVG LGAGSDLTLN FDGNKLLDIR VDAGVANALA SNGGLLKADG GRVLMAARTA NALLNTVVNS QGAIEARSLR GKNGRIVLDG GPDGKVMVGG ALSANALKGP GHGGTVEVRG QAVEVALGTQ VNTLASNGLN GTWKIAADKI DVRPSAVSDG VTVHADTLSR NLASTNIELV STKGDLDLDG SVSWASGNRL GLGSAADLTL NGRLNASGAK AGLELKAEGA IDINDKIVLG GAGSALAMDA GEGHRVNGTA SVSLAGANAT YVSGGYYYTV VQNLAQLQAI NKNLDGLYVL GGNILGGSYY CTALQSIGGP AGVFSGTLDG LGNSIGNLSI SNTGPNVGLF ARSSGTLSNL KLNNLRVSDN TYGSGPSSLG ALVGINSGRI ANVSASGVSV VGSRLRSNAL GGLVGRNING QITNASVSGG VTAYAASTAV GGLVGENFTT AWGPEAVIEN AHSNVHVAAQ STERNSLGGV GGLVGLNAKA TIRASGSQGK VETYRPGLNV GGLVGYNMFG HVSDSSASGQ VEAGGAGYTG GLVGLSSGGE IFRSQASGSV YSKGGLATGG LIGKAEGNGM LGNLKASGSV MDQGGADLGG LVGNNSQGAI ETAEATGKVS GGSNSRVGGL IGHNLGGSVA HAISRGDVSG GFNSLVGGLV GHNGGELFNV DASGRVSAAA SASVGGLVGS NAGSILSARS SSTVSGGGRS RIGGLVGENQ IQGRIVSSMS EGTVSGDYYV SMGGLAGVNL GSIEYSGVSG KIDFKPQSHY GQIYGAQVGE NRGVLGGNYV IGEAALLPPA GIDYGNIW
|
| |