Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PSPTO_3210 |
Symbol | |
ID | 1184867 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pseudomonas syringae pv. tomato str. DC3000 |
Kingdom | Bacteria |
Replicon accession | NC_004578 |
Strand | - |
Start bp | 3604010 |
End bp | 3606985 |
Gene Length | 2976 bp |
Protein Length | 991 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637394562 |
Product | filamentous hemagglutinin family protein |
Protein accession | NP_792996 |
Protein GI | 28870377 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3210] Large exoproteins involved in heme utilization or adhesion |
TIGRFAM ID | [TIGR01731] adhesin HecA family 20-residue repeat (two copies) |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.379337 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCGCAAG CCAGTGCCAT TACCTCGGCC GTTTCTGCCG CAGCGCAAAC TGTTACCCGA GTAGAAGGCC TGCCCAGCAG CAACTTTGTT TCCAAACCGC AAAAATACCT GATCGAAACC AACCCGGTCC TCACCGAACT CAAGCAGTTC CTGAGCTCGG ATTACCTGCT GGCAGGTCTG GGCTACGACC CGGAAGTCAG CGCCAAGCGT CTGGGCGATG GTTTGTATGA ACAGCGCCTC GTCCAGCAGG CAGTCGTAGC CCGAACCGGG CAGGCGTTCA TCGACGGCCA AACCTCAAAC GAGGCCCAGT TCAAGTACCT GATGAACAAC GCCATCGCCA GCAAACAGCA GTTGAACCTG GCGGTCGGCG TATCGCTGAG TTCACAACAA GTCGCTGCAC TGACCCACGA CATCGTCTGG CTCGAAGAGC ATGAAGTGAA TGGCGAAATG GTACTGGTGC CTGTTCTCTA TCTGGCTCAG GCCGACGGTC GCCTTGGTCC GACCGGTGCG TTGATTGCCG GTAACGACGT CTCGCTGATT GCCGGGCAAA ACCTCGACAA CGTCGGCACC TTGCGTGCAG CCAATAATCT GTCGGCAGTG GCGGGCAACA ACCTGGTCAA CACAGGGCTG ATAGAGGCTG GCAACCGTCT GGACCTGCTG GCGGGTAATG ACCTGATCAA CACCGCTGGC GGCATCATCA AAGGCCGTGA TGTCTCACTG ACCGCCATCA ATGGCGATGT GATCAATGAG CGCAGCATTA CTTCGATGGA CAACAGTGCG CGCGGTCAGC GCCACAACGA GTTCGCCGAC AGTGCCGCGC GCATCGAAGC CGCCAATGAC ATGAGCATTT CGGCGGGTCG CGACGTCATC AACAAAGGCA GCGTGCTGGA AAGCGGCCGC GACATGAGCA TCCAGGCCGG ACGCGACGTC ACCATCGCCC CGACCGAAGT CACCAACAGC CTGTTCTCGG ACAGCAAACA CAACAGCAGC GACATCACCC AACTGGGCTC CACCGCCAGT GCCGGTCGCG ACCTGACTGT TCAGGCCGGT CGCGACATCT CCGTCATCGC CAGCCAGATC GACGCCAAAC GCGACATCGC CATGGCCGCG ACCGAAAACC TCACCATCAG CTCGGCGGCG GACGAAGAAC ACTCCCTGTC GAAAAGCAAA AAACTGACCC GCCAGGAAGA CCACGTCAGC CAGATCGCTG CTGATCTTGA TGCAGGCGGC AGCGTTGCCT TGCAGGCTGG GCAGAACCTT GCGGTGATCT CCAGCCGTAT TACGGCCGGG AAAGAGGCGT ATCTGGTAGC CGGCGATCAG CTGGATATTT TGGCTGCGCA GGACAGTGAT TATTCGCTGT ACGACAAAAA GAAGAAGGGG AGTTTTGGGG CAAAGAAAAC CAAGCGTGAC GAAATCACTG ATGTGAAAAA CATCGGCAGT GAGATCACCA CGGGTGGTGA CTTGTTGCTG GCGAGCGGCG GAGATCAGAA GTATCAAGTT GCGAAGCTTG AGAGCGGCAA GGACCTGACG ATTGATAGTG GCGGTGCGGT TACGTTTGAG GGCGTGAAGG ACCTGCATCA GGAGAGTCAC GAGAAGAGCA AAAGTGACCT CGCCTGGAAC TCGAGTAAAG GCAAGGGTAA CACCGACGAA ACCCTGCGCC AGAGCGAATT GGTGGCCAAG GGCGAGTTGG CAATCCGGGC CGTGGAAGGT CTGAAAATCG ACATCAAGCA GGTTGACCAG CAAAGCGTTA GTCAGACTAT TGATGCAATG GTCAAAGCGG ATCCGCAACT GGCGTGGCTC AAGGAGGCAG AGCTGCGCGG TGACGTGGAT TGGCGGCAGG TCAAAGAAGT ACATGATTCG TTGATAGGTA CTGCTGTAAA CGCTGCAGTT GTAGCAGGGT TAACGTCCGC CACCTCTCAG GCAGTGATAA GCACAGTTAA TAATAAAGGT AACCTTGGCG CAGCGCTGAA AGATGTGACA TCTGCTGAAA GCATGAAAGG CTACTTGGTA AGTGGGCTGG CGGCTGGGTT TGCCGCGGGA ATTCTTGACC CTGCATATGG TGTTTCTCCA GAAAACACAG CGAAAGCAAC TCATGGCTTT GATCTGGGAA GTGTCGATGG TTTTACAAAC TATGCAGGTT ATACGCTTGC GCAAGGAGGG TTCAGCGCTG CAGCAAATAC TGCAATAAAT GGGGGAAGCC TGACGGATAA TCTTGCCCAA GCTGCAATCA GTTCTGCCGC AGATGCTATG TCAGCAGGTA TCTACAACAA GCTGGGAACA AAATTGGAGT TTTCAGGGCT TCCCTCAAAG CTGGCTGCGC ACGCTCTTGT TGGCGGATTG ATCGCTGAGC TTGCTGGCGG TGATTTCCGT TCTGGAGCCT TGGCTGCTGG AGCGAATGAA GCATTTGTAA ATTTGGTGGG CGACAAGATT TTTGTAGGGG AATCCCACGA TAAACTATTG GCAATGACCT CACAGCTAGT TGGCTTAACA GTTGCGGCCG CCGCGGGTGG GACTGATAAA GACCAAGCCG TCGCCGGTTG GGTTGCTCAG CAGGCCACGA CCTTCAATTA CCTTGAGCAC AGTGAGAAAG AAGCGTTTAT AAAAGAAATG CTTGGCTGCG ATACTGACAA GTGTGCGAGA GAAAAATGGG AGCAGGGCAA GTTTGATGAG GACAGCCAAG CAAACGTTCA ATACGCCAAC GACATAGCTG GATCCCAGCG TGCAAGAGAG ACTAGAGATC GAGTTCTGGA TTCGCTGGGT TCAATCCTGG ACATGAATTG CCCTACGAGT GCTTGCGAAG GTTACAAACA GCTCTTGATG GAGCGCTCAC TGGGCACGCT GAAAAACCTG AACCAGGTTA TCAAGGACTG GGCTTCCGTC GATCAGCGAC TCGGGTTGAT GGCAGGCGCG GCAGTGGGGG GGCAAGCCAA CTACAAATGC GGTTCTACAG CCAGCAGAAG CGCTGGCGTT GACTAG
|
Protein sequence | MAQASAITSA VSAAAQTVTR VEGLPSSNFV SKPQKYLIET NPVLTELKQF LSSDYLLAGL GYDPEVSAKR LGDGLYEQRL VQQAVVARTG QAFIDGQTSN EAQFKYLMNN AIASKQQLNL AVGVSLSSQQ VAALTHDIVW LEEHEVNGEM VLVPVLYLAQ ADGRLGPTGA LIAGNDVSLI AGQNLDNVGT LRAANNLSAV AGNNLVNTGL IEAGNRLDLL AGNDLINTAG GIIKGRDVSL TAINGDVINE RSITSMDNSA RGQRHNEFAD SAARIEAAND MSISAGRDVI NKGSVLESGR DMSIQAGRDV TIAPTEVTNS LFSDSKHNSS DITQLGSTAS AGRDLTVQAG RDISVIASQI DAKRDIAMAA TENLTISSAA DEEHSLSKSK KLTRQEDHVS QIAADLDAGG SVALQAGQNL AVISSRITAG KEAYLVAGDQ LDILAAQDSD YSLYDKKKKG SFGAKKTKRD EITDVKNIGS EITTGGDLLL ASGGDQKYQV AKLESGKDLT IDSGGAVTFE GVKDLHQESH EKSKSDLAWN SSKGKGNTDE TLRQSELVAK GELAIRAVEG LKIDIKQVDQ QSVSQTIDAM VKADPQLAWL KEAELRGDVD WRQVKEVHDS LIGTAVNAAV VAGLTSATSQ AVISTVNNKG NLGAALKDVT SAESMKGYLV SGLAAGFAAG ILDPAYGVSP ENTAKATHGF DLGSVDGFTN YAGYTLAQGG FSAAANTAIN GGSLTDNLAQ AAISSAADAM SAGIYNKLGT KLEFSGLPSK LAAHALVGGL IAELAGGDFR SGALAAGANE AFVNLVGDKI FVGESHDKLL AMTSQLVGLT VAAAAGGTDK DQAVAGWVAQ QATTFNYLEH SEKEAFIKEM LGCDTDKCAR EKWEQGKFDE DSQANVQYAN DIAGSQRARE TRDRVLDSLG SILDMNCPTS ACEGYKQLLM ERSLGTLKNL NQVIKDWASV DQRLGLMAGA AVGGQANYKC GSTASRSAGV D
|
| |