Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PSPA7_1018 |
Symbol | |
ID | 5353995 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pseudomonas aeruginosa PA7 |
Kingdom | Bacteria |
Replicon accession | NC_009656 |
Strand | + |
Start bp | 1018039 |
End bp | 1021068 |
Gene Length | 3030 bp |
Protein Length | 1009 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640810072 |
Product | adhesive protein CupB5 |
Protein accession | YP_001346404 |
Protein GI | 152989378 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.284554 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAAAT GCTATGCACT GGTCTGGAAC GCATCCCAGG GCTGCTGGAA CGTCGCCAGC GAAGGTAGCC GTCGGCGTGG CAAGACCGCT GGCGTACGGG CGGTGGCGGC CTCGCTCCTC GCCCTGATGG GCGCGGGCGC CCTGGCTCCG GCCTATGCAC TGCCGACCGA CCCCAAGGTG GTGGCCGGCA GCGGAGACAT CCAGTTGGTC GGCGCCAAGA GCATGTCGAT CAACCAGCAC ACCGAGAAGC TGATCACCAA CTGGGATTCC TTCAGCGTCG GCGCCGGCGA GCGAGTAACC TTCAACCAGC CAACCTCGGC TTCGATTGCC CTGAACCGGG TGCTCGGCAC CAGGGCCAGC GATATCCAGG GCAATATCGA CGCCAACGGC AAGGTATTCC TGGTCAACCC CAACGGCGTG CTCTTCGGCC GTAATGCCCA GGTCAACGTC GGTGGCCTGG TGGTTTCGAC GCAGGACATC AAGGACGCCG ATTTCCTGGC CAGCAAGTTC CAGTTCTCCG GCTCGTCGAC CAGCGAAGTG CTCAACATGG GGCGCCTGAG CGCGGCGGAA GGGGGATCCA TCGCCTTGCT GGGCGCCCGG GTCGACAACC AGGGGACGAT CCTGGCGCAG ATGGGCACCG CCGCTCTCGG CGCAGGTGGC GATTTCACCC TGAACTTCGA TGGCAACCGG CTGCTCGATA TCCAGGTCAA CGGCGCCGTC GTCGGTGCGC TGGCGAAGAA CGGCGGCCTG CTGAAGGCCG ATGGCGGCCA GGTGCTGATG ACCGCCAGGG CAACCACCAC GGTCCTCGGC TCGGTGGTGA ACAACCAGGG CGCCATCGAG GCCAGGAGTC TGCGCGGCCA GTCCGGCAAG ATCATCCTCG ATGGCGGTTC GGGCAAGGTG CTCGTGGCCG GCGCGCTGTC CGCCAACGCG CTGAACGAGC CGGGCCACGG TGGCACGGTG GAGATGAAAG CGGCCGAGGT CGAGGTCAAC CTGGCGACCC AGGTGAACAC CCTGGCCAGC AACGGCAACA ATGGCATCTG GAAGATCAGC GCCGACAAGG TCGACGTCCA TCGCACGGCG CTGGCCAGCG GCGGCACCGT GCATGTCGAT ACCCTGTCGC GCAACCTGGC GACCACCAAT ATCGAACTGA ACTCGACCAA GGGCGACCTG AACCTGAATG GTCCGGTGGC CTGGGCCTCG GGCAACCGGC TGGCGCTGAA CTCCGCCGGC GACCTGAACC TGAACGGCAA GCTGACCGCT ACCGGCGTCA ATGCCAGGCT GGGTCTGCAG GCCAAGGGCG GGATCGAGAT CAACGACAAG ATCGTGATCA GCGGCGCCTC CAGCGGCATG AGCCTGGAGG CCGGCAACGG ACATCGGGTG AACGGCAATG CGTCGGTCAC GCTGTCCGGC GCCAATGCGA GCTATGTCTC GGGCGGTTAT TACTACACGG TGGTCCAGAC CCTGGCGCAA CTGCAGGCGA TCAACACCAA CCTGGACGGC CTCTACGTCC TCGGCAACAA CATCCTGGGC AGCTACTACT GCACGGCATT GCAGTCCATT GGCGGTCCCG CCGGCGTCTT CACCGGAACC CTGGATGGTC TGGGCAACAC CATCGGCAAC CTCTCGATCA CCAACACCGG GGCCAACGTC GGCCTGTTCG CCCGCTCTTC CGGCACCCTG TCCAACCTGA AGCTGGACAA CCTGCGGGTC TCCGACAGCA ACTACGGCGC CGGGCCGTCC TCCCTCGGCG CGCTGGTGGG TATCAACAAC GGCCTGGTTT CCAATGTCAC CGCCACCCGG ATCAGCGTCA ATGGCAGCAG TTCGCGCTCC AACGCCGTGG GCGGCCTGGT CGGCCGCAAC AACGGCGGGC GGATTCGCGC CGCCAGCGTC AGCGGCACGG TCAGCGGCTA CGCGCCCACC ACCGCGATCG GCGGTTTGGT CGGCGAGAAC GCTTCCAATG GTCGGCAGGC GCTGATCGAG GACAGTAATT CCAGCGTGCA GATCGCCGCC AGGTCCACCG AGCGCAACAG CCTGGGCGGC GTCGGCGGCC TGGTCGGGTT GAATTCGCGC GGCGTGATCA ACAACTCCCA TAGCCAGGGC CGGGTCGATG CCTCCAGGGC CGGCCTCAAC GTGGGTGGCC TGGTCGGTTT CAACCTGATG GGCGAGATAT TCCGCAGCAG CGCCAGCGGC CAGGTGGTGG CGGGCAGCGC CGGCTATACT GGCGGCCTGA TCGGCTTGAA CAGCAACGGT GTGATTTCGC AGTCCCAGGC CAGCGGCATG GTTTTCAGCA GCAGCGGCCT GGCGACCGGC GGGTTGGTGG GCAGGAACGA GGGCGACAGC GAGCTGAGGA ACGTCAAGGC CAGCGGCAAT GTCACGGACA GCTACGGTAC GGACATCGGC GGCCTGATCG GAGTCAATAC CGCCGCGCGG GTGGATACCG CCGAGGCGAC CGGCAAGGTC ACCGGGGGCG CCAACAGCCG GGTCGGCGGT CTGGTCGGCA GCAACATCAG CGCCTCGCTC GATCATGTGA TCGCCCGCGG CGCCGTGTTC GGCGGCATCA ACAGCCAGGT CGGCGGAATC GCCGGCAGCA ACAGCGGCGA GATCTCCAGC GCGGATACCC GCGGCATCGT CTCCGCTGGC TCGAACAGTT CGCTCGGCGG CCTGGTAGGG ACCAACTTCG GCAGCATCAT GGCATCCAGC AGCAAGAACG ACGTGCGGGG CGGGAGCCGC AGCCGCATCG GCGGTCTGGT GGGCGAGAAC CAGATCCAGG GACGAATCAG CTCTTCCAGC TCGGAGAGCA CCGTCAGCGG CGACTACTAC ACCACCATGG GCGGTATTGC CGGAGTCAAC CTGGGGCGGA TCGAGTACTC CGGCGTCAGC GGCAGGATCA ACTTCAGGCC GCAATCCAAC TATGGGCAGA TCTACGGCTC CCTGGTGGGC GAGAACCGTG GCGTCCTCGT CGGCAACTAC GTGACCGGCG AGGCGGCGGT CCTGCCGCCT GCCGGCGTCG ACTACGGGCA AATCTGGTAA
|
Protein sequence | MNKCYALVWN ASQGCWNVAS EGSRRRGKTA GVRAVAASLL ALMGAGALAP AYALPTDPKV VAGSGDIQLV GAKSMSINQH TEKLITNWDS FSVGAGERVT FNQPTSASIA LNRVLGTRAS DIQGNIDANG KVFLVNPNGV LFGRNAQVNV GGLVVSTQDI KDADFLASKF QFSGSSTSEV LNMGRLSAAE GGSIALLGAR VDNQGTILAQ MGTAALGAGG DFTLNFDGNR LLDIQVNGAV VGALAKNGGL LKADGGQVLM TARATTTVLG SVVNNQGAIE ARSLRGQSGK IILDGGSGKV LVAGALSANA LNEPGHGGTV EMKAAEVEVN LATQVNTLAS NGNNGIWKIS ADKVDVHRTA LASGGTVHVD TLSRNLATTN IELNSTKGDL NLNGPVAWAS GNRLALNSAG DLNLNGKLTA TGVNARLGLQ AKGGIEINDK IVISGASSGM SLEAGNGHRV NGNASVTLSG ANASYVSGGY YYTVVQTLAQ LQAINTNLDG LYVLGNNILG SYYCTALQSI GGPAGVFTGT LDGLGNTIGN LSITNTGANV GLFARSSGTL SNLKLDNLRV SDSNYGAGPS SLGALVGINN GLVSNVTATR ISVNGSSSRS NAVGGLVGRN NGGRIRAASV SGTVSGYAPT TAIGGLVGEN ASNGRQALIE DSNSSVQIAA RSTERNSLGG VGGLVGLNSR GVINNSHSQG RVDASRAGLN VGGLVGFNLM GEIFRSSASG QVVAGSAGYT GGLIGLNSNG VISQSQASGM VFSSSGLATG GLVGRNEGDS ELRNVKASGN VTDSYGTDIG GLIGVNTAAR VDTAEATGKV TGGANSRVGG LVGSNISASL DHVIARGAVF GGINSQVGGI AGSNSGEISS ADTRGIVSAG SNSSLGGLVG TNFGSIMASS SKNDVRGGSR SRIGGLVGEN QIQGRISSSS SESTVSGDYY TTMGGIAGVN LGRIEYSGVS GRINFRPQSN YGQIYGSLVG ENRGVLVGNY VTGEAAVLPP AGVDYGQIW
|
| |