Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PSPA7_0043 |
Symbol | |
ID | 5356995 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pseudomonas aeruginosa PA7 |
Kingdom | Bacteria |
Replicon accession | NC_009656 |
Strand | + |
Start bp | 42719 |
End bp | 44503 |
Gene Length | 1785 bp |
Protein Length | 594 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640809097 |
Product | adhesin/hemagglutinin |
Protein accession | YP_001345439 |
Protein GI | 152984928 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01731] adhesin HecA family 20-residue repeat (two copies) [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACATCC GCAGCCCCCT GAACCAGTGC ATCGCCCTGA CCCTGGCCGG CATCCTGTTC CTCGACCCGA TCGTCGCGGC GGCGGCGCAG TTGGCGGTGG ACGCGGCGGC CGGCGGCAAT ACCAGCCTCG GCCAGGCGGG CAACGGCGTG CCCATCGTCA ACATCGCCAC GCCCAATGGC GCCGGGCTGT CGAACAACCA TTTCCGCGAC TACAACGTCG GCGCCAACGG GCTGATCCTC AACAACGCCA CCGGCAAGAC CCAGGGCACC CAGCTCGGCG GGATCATCCT CGGCAACCCG AACCTGCAGG GCCAGGCGGC GCAGGTGATC CTCAACCAGG TCACCGGCGG CAACCGCAGC ACCCTGGCCG GCTACACCGA GGTGGCCGGG CAGTCGGCGC GGGTCATCGT CGCCAACCCG CACGGCATCA CCTGCCAGGG CTGCGGCTTC ATCAACACGC CGCGCGCCAC CCTCACCACC GGCAAGCCGA TCATGGACGG CCAGCGCCTG GAGCGCTTCC AGGTGGACGG CGGCGACATC GCCATCGAAG GCGCCGAACT GAACGTCAGC AACCTCGAAC AGTTCGACCT GATCACCCGC AGCGCCACGC TCAACGCCAA GCTCTACGCG AAGAACCTGA ACATCGTCAC CGGCCGCAAC GACGTCCAGG CCGACAGCCT GCAGGCCACT CCGCGCGCCG CCGACGGCAG CGAAAAGCCG CGACTGGCGA TCGACAGCTC GGCGCTGGGC GGGATGTACG CCGGGGCGAT CCGCCTGGTC GGCACCGAGC AGGGCGTGGG GGTGAAGCTG GCCGGCGACA TGGCCGCCAG CGGCGGCGAC ATCCGCATTG ACGCCAGCGG CAGGCTGAGC CTGGCCCAGG CCTCCAGCCA GGGCGACCTG AAGATCGCGG CCCAGGCCGT GGAGCTGAAC GGCAAGACCT ACGCCGGCGG TAGCGCCGAG ATCCGCGGCG CGGAGGAGCT GGTCAACCGG CAGAGCCTGG CGGCGCGCGA GCGCATCGCG CTGGAGGCGG CGCATATCGA CAATGCCGGG GTGATCGAGG CCGGCGTCGA TCCGGACAAC CGGCGCAACG CCCGCGGCGA CCTCGGCCTG GACAGCCGCA CCCTGCGCAA CGCCGGCAGC CTGGTAGCCA GCCGCACGCT GGAGGCCAGG GCCAGCCAGG CGCTGGACAA CCAGGGCGGC AGCTTGAAAG GCGCGACCAC CCGGGTGGCC GCCGGAGACC TGGACAACCG TGGCGGCAAG TTGCTCGCCG AGGGCGAACT GCGGGTCGCG GCGGCGAACC TGGACAACCG CCGGGGCGGG CTGTTGCACA GTCGCGACCG CACCGTGGTC GCGGCGGCCG GCAAGCTCGA CAACCGCGGC GGCCAGGTGA TCGGCCTGAA CGACCTGGAA GTCGGCGCGG GGACGCTCGA CAACAGCCAG GAAGGCCTGC TCGGCAGCCA GCAGCGACCC GCGTCAGCGC CCAGGCGCTG GACAACCGGG CGGATGGCGA AATTTCCGGC AAGCGCGCCG AGGTGCGCCT CGGCAGCCTG GACAACCGTG GCGGCAAGCT GCTCGGCGAC GACCTGCTGG TGGTCGCCAG CGGCGCCATC GACAACCGCC TCGGCCTGTT CTCGGCGGCC AACCGCCTCG ACCTGCAGGC CGGCAGCCTG GACAACAGCG GCAAGGGTAC GCTGGCCAGC CAGGGCGGGC TGGTCGCCAG CGTCGCCGGC CTGCTGGACA ACCGCGACCA GGGCAACCTG CTCAGCCAGG GCGCGCAGCG CGTGA
|
Protein sequence | MDIRSPLNQC IALTLAGILF LDPIVAAAAQ LAVDAAAGGN TSLGQAGNGV PIVNIATPNG AGLSNNHFRD YNVGANGLIL NNATGKTQGT QLGGIILGNP NLQGQAAQVI LNQVTGGNRS TLAGYTEVAG QSARVIVANP HGITCQGCGF INTPRATLTT GKPIMDGQRL ERFQVDGGDI AIEGAELNVS NLEQFDLITR SATLNAKLYA KNLNIVTGRN DVQADSLQAT PRAADGSEKP RLAIDSSALG GMYAGAIRLV GTEQGVGVKL AGDMAASGGD IRIDASGRLS LAQASSQGDL KIAAQAVELN GKTYAGGSAE IRGAEELVNR QSLAARERIA LEAAHIDNAG VIEAGVDPDN RRNARGDLGL DSRTLRNAGS LVASRTLEAR ASQALDNQGG SLKGATTRVA AGDLDNRGGK LLAEGELRVA AANLDNRRGG LLHSRDRTVV AAAGKLDNRG GQVIGLNDLE VGAGTLDNSQ EGLLGSQQRP ASAPRRWTTG RMAKFPASAP RCASAAWTTV AASCSATTCW WSPAAPSTTA SACSRRPTAS TCRPAAWTTA ARVRWPARAG WSPASPACWT TATRATCSAR ARSA
|
| |