Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PSPA7_5707 |
Symbol | |
ID | 5356806 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pseudomonas aeruginosa PA7 |
Kingdom | Bacteria |
Replicon accession | NC_009656 |
Strand | + |
Start bp | 5875443 |
End bp | 5878418 |
Gene Length | 2976 bp |
Protein Length | 991 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640814752 |
Product | adhesive protein CupB5 |
Protein accession | YP_001351026 |
Protein GI | 152986855 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAAGA CCTACGCGCT GGTCTGGAAC CAGGCCACGG GATGCTGGAA CGTCGCCAGC GAAGGCACCC GCCGGCGCGG CAAGAGCGGC CGCGGCACCC TGCTGGCGGT CGCCGGCGCC TCGCTGCTGA GCCTGCTCGG CCTGCCCGAG GCCTTCGCCC TGCCCAGCGA TGGCAAGGTC ATCCATGGCG AGGCCGGCCT TCACACCTCC ACCGACGGCA AGCACCTGGC CATCGACCAG CAGAGCCAGA AACTGATCAC TCAGTGGAAC GGCTTCGATA TCGCCGGCGA TGAAAGCGTC CGCTTCAACC AGCCCAACAG CACCGCCGTC GCCCTCAACC GTGTGGTCGG CACCAACGGC AGCCAGATCC TCGGCAAGCT CGACGCCAAC GGCAAGGTGT TCCTGGTCAA CCCCAACGGG GTGGTGTTCG GCAAGACCGC CCAGGTCAAC GTCGGCGGCC TGGTCGCCAG CACCCTGGAC ATCTCCGACA AGGACTTCCT CGACGGCAAC TACCGCTTCA GCGGCAAGTC CACCGCCCAG GTGAGCAACG CCGGCCGCCT CAACGCCAGC GAAGGCGGCA GCGTCGCCCT GCTCGGCGCG CGGGTCGACA ACAGCGGCGT GATCCAGGCC CGCCTGGGCA GCGTCGCCCT CGGCGCCGGC GAGGACGTCA GGCTCAACTT CGACGGCAAC GGCCTGCTCA ACCTGCAGGT GAACGCCGGC GCGGTCGACG CCCTGGCGCA CAACGGCGGC CTGCTCAAGG CCGACGGCGG CCAGGTGCTG ATGACCGCGC GCAGCGCCGA CAGCCTGCTC AAGACCGTGG TCAGCAACCA GGGCGTGATC GAGGCGAAAA CCCTGCAGAA TCGAGACGGC CGCATCGTCC TCGACGCCGG CAACGGCACC CTCCAGGTCG CCGGCCGCCA GGACGCCAGC GCCAGCGGCC AGGGCAATGG CGGAGTGGTG GAAAACCGCG GCGCCAAGGT TGAAGTGCAC CAGTACGCCA AGGTCGATAC CCGTTCCAAG CAAGGCCAGA CCGGTACCTG GAAGATCGCC GCGAACAACC TGGAAGTCGC CAGCAGCGTG CTGCGCGACG CCGCAACGCT GAAGGCCAGC ACCCTGGCGG ACAACCTGGA AACCACCAGC ATCGAACTGG CCAGCACCCA GGGCGACCTC AAGGTCGACG CACCGCTGTC CTGGAACAGC GGCAACAAGC TGGGCCTGAG CGCCGAGCGC GGCAACGTCG AGGTGAACGG CAACCTGCGC GCCAGCGGCG ACCAGGCCGA ACTGGCGCTG AACGCGCGCG ACCAGGTACG CCTGAACGCC GACCTGTCCC TCACCGGGCG CAACGCCAGG CTGGAGCTGA ACAGCGGCAA AGGCCACAAG CTCGCCGACG GCGTGCGCGT GACCCTTTCC GGCGCCGGTG CCGGCTACCG TGCCAACGGC GAGGATTACC GGGTGATCCA GAACCTCGCG CAGCTGCGCG AGGTCGAAAA CGACCTGAAT GGCCGCTATG TCCTCGGCAC CCGTATCGAC GGCGGCCAGA CCCGCTTCGA AACCCTCGGC AGCCGTTCGA ACCGCGCCTT CTCCGGCACC TTCGACGGCC TGGGCAACAG CATCGCCAAC CTGGCGCTAT ACAGCAACGG CAACTGGGTG GGGTTGTTCA ACCTGAACAC CGGGCGCCTC GCCAACCTCA CCCTCGAAAG CATCGGCGCC AGCGTGGCAC ATCCCGCCAA CATGGAGGCA GGCAGCCTGG CCGCGCTCAA CCTGGGGCAC ATCGACAATG TACGGATTCG CAACGGCCAG GTCAGCGGCG CCGCACAGCG CAACCTACTC GGTGGCCTCG TCGCGCGGAA CTACGGCAGC ATTGCCAACA GCACCTTCCA GGGACGCATC AACGGCAGCC GCAACACCTA CGCGATGGGT GGGCTGGCCG GCTACAACGG CAGCCAGGCC CTGATCGCAG CCAGCTCGGC CGACATGGTG CTGGGCGGCC AACCGGCAGA AGCGAGCGCC GGCGCGCTGG TGGGCGTCAA CGCCGGTGGC CGTCTCCTCG ACAGCCATGC GCGCGGCAGC ATCGACCTGG CCGGCGACCG GCTGAACATC GGCGGACTGG TCGGCCACAA CCAGGGCGGC TTGCTGCGCG ACCTCGAATC CTCAGTGAAC GTTACCGCGC GCGGACGCGA CGGCCTGCTG GGCGGGCTGG TCGGTCTGAA CGAACAAGGC ACTCTGGAAC ACGGCTCGGC CAGCGGCAAT GTCAGCGGCT ACGGCAGCCA GGCGATCGGT GGCCTGGTCG GCAGGAACCT GAAGGGTACC CTGCGCAACA GTTCCGCCAG CGGCAAGGTG AACGACCTGA TCAGCACCCA GGTGGGCGGG CTGGTCGGAC ACAACCGCGA CGGCAACCTC GACTCGGTAT CGGCCTCCGG CAACGTCAGC GGCGGTGTGC GAGCCATGAT CGGCGGACTG GTCGGGCTCA ACCAGGGCGG CAGGCTCAGC GCCATCTCGG CCACCGGCAA CGTCAGCGGC CTTGGTGCGG GCAACGTCGG CGGACTGATC GGAGAAAGCC AGGACGCCCT GATCGTCGGC GCCCTGGCGC AGGGCAAGGT CTCGGGTGGC GCGCAGAGTT CGACGGGCGG GCTGATCGGT CGCCACGCCA GCGGCCGGGT GGAGCAGGCG ACAGCCCACG GCGATGTCGC GACCGGCCCC GGCGGCATTG CCGGCGGGCT GATCGGCTGG AGTGGCGGAG AGGTCGTCGC CACCTCCGCC TCCGGCGACG TTTCCAGCGA CCACGGCATG GTCCTCGGCG GGCTGGTGGG CTGGAACCAG GGCTCTGTGA GTCACTCGTC CGCCAGCGGC AAGGTCGATA CCGACGTCAG CATCCATGGC GGCCTGATCG GCCTGAACCT CGGCCAGCAG CACGCCAACA GCACCGAGGG CGAAGCGGCG AAGACCCGCC TGCTGGGCTA CAACGCACAA CGCTGA
|
Protein sequence | MNKTYALVWN QATGCWNVAS EGTRRRGKSG RGTLLAVAGA SLLSLLGLPE AFALPSDGKV IHGEAGLHTS TDGKHLAIDQ QSQKLITQWN GFDIAGDESV RFNQPNSTAV ALNRVVGTNG SQILGKLDAN GKVFLVNPNG VVFGKTAQVN VGGLVASTLD ISDKDFLDGN YRFSGKSTAQ VSNAGRLNAS EGGSVALLGA RVDNSGVIQA RLGSVALGAG EDVRLNFDGN GLLNLQVNAG AVDALAHNGG LLKADGGQVL MTARSADSLL KTVVSNQGVI EAKTLQNRDG RIVLDAGNGT LQVAGRQDAS ASGQGNGGVV ENRGAKVEVH QYAKVDTRSK QGQTGTWKIA ANNLEVASSV LRDAATLKAS TLADNLETTS IELASTQGDL KVDAPLSWNS GNKLGLSAER GNVEVNGNLR ASGDQAELAL NARDQVRLNA DLSLTGRNAR LELNSGKGHK LADGVRVTLS GAGAGYRANG EDYRVIQNLA QLREVENDLN GRYVLGTRID GGQTRFETLG SRSNRAFSGT FDGLGNSIAN LALYSNGNWV GLFNLNTGRL ANLTLESIGA SVAHPANMEA GSLAALNLGH IDNVRIRNGQ VSGAAQRNLL GGLVARNYGS IANSTFQGRI NGSRNTYAMG GLAGYNGSQA LIAASSADMV LGGQPAEASA GALVGVNAGG RLLDSHARGS IDLAGDRLNI GGLVGHNQGG LLRDLESSVN VTARGRDGLL GGLVGLNEQG TLEHGSASGN VSGYGSQAIG GLVGRNLKGT LRNSSASGKV NDLISTQVGG LVGHNRDGNL DSVSASGNVS GGVRAMIGGL VGLNQGGRLS AISATGNVSG LGAGNVGGLI GESQDALIVG ALAQGKVSGG AQSSTGGLIG RHASGRVEQA TAHGDVATGP GGIAGGLIGW SGGEVVATSA SGDVSSDHGM VLGGLVGWNQ GSVSHSSASG KVDTDVSIHG GLIGLNLGQQ HANSTEGEAA KTRLLGYNAQ R
|
| |