Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PSPTO_2231 |
Symbol | |
ID | 1183882 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pseudomonas syringae pv. tomato str. DC3000 |
Kingdom | Bacteria |
Replicon accession | NC_004578 |
Strand | - |
Start bp | 2459655 |
End bp | 2462603 |
Gene Length | 2949 bp |
Protein Length | 982 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637393612 |
Product | YD repeat protein |
Protein accession | NP_792052 |
Protein GI | 28869433 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.016497 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCCAA TCCATACTGC CCGATACAAC GCCTGGAATC AGTTGGAGCA GGAGACCGCC CATGACTGGC TGGGGGCCAA ACCCTTGGCC AGCAGCACCC TTGGCTACCG CTACGATGAC TGGAACCAGC GATGCTGCAC CACGACCGAT GACAACGTAC AGACTTATGA GTATTCAGAC CCGATCGGCA GCGACGTACA TAAAGGCCCA ATCCAGAAAA CCTGGAAACA GAGTGGCGAC CCGGAGGGCC GCATCAGTGG CCGCAGCGAA ACCTGGCTGA ATCTGTTCGG CAAACCGGAC CGGATCCGGA CGCTGACCGC TGGTAAAACG GGTCGCAGCC GCACGCACAG CATGAGCCGC AGCCGGAACC TGACCACGAC TGAGCAGGAA CTGAGCAGGC AGACCTTTCT GTACGACGGG CTGGGACGCT GCACCGAGCA GCGCGATGCA CTCCAGCAAA GCACCCTGTT CAGCTACGAC AACTGGTCAC GCATGGTCTC CTCCACGCTT GCAGACGGCA GCGTCATCAA CCGGAGTTAT GCGCCGCAAA GCAGCAGTGA GCTGGCAACG ATGCTCGAGG TCGTGCACCA GAACGGCACC ACCAGAACCG TGGCAGGTAC ACAGAAATTT GACGGGCTTG AGCGTGTGAC GCAGACCAAA ACAGGTGACC GCGTCGAACA GTTCAACTAC GACGCCGGTG AGATGCAGCC CAGGTCGCGC ACAACAGCCG GGCTGGACAA CATCAACTTT ACCTACACTC GGGCGCTCAC TGATCAGATT TTTTCCAGCA CGGCTCCGGA TGAAACGGCC AAATTCGATT ATGACAAGAC CAGTGCCCGC CTCATCGAAG CGACGAACCC GCAAGGCACG CGCACTTACC GCTATGACGT GCACAATCAA CTGACGGGAG AGACTTGGGA CAATCTGCTG GGTCAGGCTT GGGAAACCCG ACACCAATCA TCGCTGCTGG GTCGGCCGAT CAAGCGCACC GATCTCAAAA AAGGCGAGGC GGCGGGCGCA GAGACCCGTT ACGACTACGA CACGCTCGGC AGAATCAGGT TTATCAACCA GAGCAACCTG CGCACCACAA TCGACTATGA CGTGCTGGGC CAGCTCTGCA AGGTGGCCAC CGAGGACCTG CAGGCCGGAA CTGGCGTGAT CATCGACATG GAATACGACG ACCAGGGACA GGAAATTCTC AGAACCCAGA CCGCAAGCAA CCAAGCGGCG TTGACCTTGA CTCAAACGTG GGCAGTGGAC GGGCTTTTGA AAACCCGCGA CCTGCAACAG GCGGGTAGCC CCCTGCTGCA CGAAACGTTT AGCTACGACC CCAGAGGCCG CCTGACACTG GTGAATTACC TGGGTAGCAG CTTGCCGAGA GACGAACTGC AAAGGGAGAT GACCAGACAA ATATTCAGCT TCGACGAGCT GGACAACATT ACGCTATGCC AGACCAGGTT TACCGATGGC ACCTCTGAGC GAGCAGCTTT CAAATACGGC AGCCCCGGCG ACGATAAGCA TAAAGACCGC TGCCAGCTTT TGAGTATTGC CTACACGCCG CCCAGAAAAA CACCGGACCC GACATTCAGT TACGACGCCA ACGGTAACCA GCTTAAAGAC GAGCATGGCA ACAGTCTGCA TTACGATAGC CAGAGCCGCC TGCTGCAGGT CGCAGAAACC GGCGGTGCCC CTATCAGCCA ATACCGTTAT GACGGCCACA ATCAACTGGT CGCCACCAGG GATGGCAATG AAAGCGAGAT TTTGCGGTTC TATGAGGGTC ATCAACTGAG CAGCACGGTG CAGGAAGATC AACGCACTCA GTACCTGCAT CTCGGCGAAC AGCCGCTGGG CCAGCAGATT GTGGACGACG CCGAGCAAAC CCTGTTGCTA CTGACTGACG CAAACCAGAG CGTTATGGGT GAATTTCAAC AAGGCCAGCT GCGCAAGGCG GTCTACAGTG CCTACGGGGA GCGCCACAGC GAGGAGGCGC TGCTGAGCAC TGCCGGGTTT AACGGTGAAG TACGCGAAGC CGCCAACGGC TGGTATCTGT TGGGCAATGG CTACCGGGCC TACAACCCTC TCCTGATGCG CTTCCACAGC CCGGATTTTC TCAGCCCCTT CGCCGAAGGC GGCGTCAACC CCTACACCTA CTGCCTGGGC AACCCCATCG CCCTGCGCGA CCCGACAGGA CATGATGCCA GCGGTCAGAC TGGCCGGTTG AGACGGCCCG ATGAGGGGGC TTTGCCAATG CAACAAGGTG GCGGAGATAT CATGGGTTGG GTGGGTGTAG GAATAGGCGT TGTTTTCACC GTATTGGGCG TTGCCGCTAC CATAGCCACG TTAGGAACAG CCACACCGGT TACCGGCCCG GTAACTGTCC TGGGCATTTC CATGACCGCC AGCGCTGCCG CGGCCGTTTC GACAGTCTCG ACCGGTGCGT TGATCGTCGG TACGGCATTG ACAGCGGCTT CAACTACGGC CAATACAGTT GCCATTGTAA ATAACGATCA GACGGCCGGA GAAGTCGGCG GCTGGTTGGG TATTGCCGCT GTGCCCGTTG GCTTGGTAGG GTTTGGCGCG GGGGCTGTGG TGGCGAGGGC AGTTGCGGCT GCGGCTAAAG TTGCGGCTGC CAACGCTGGT ACGATCGGTG TCCGCAGCGT CAGCAGAATA GGCCTCGCTG CTGCTGGTGC CCGCAGAACC ATTTCCAGCG CTGCCAGCAG CGCTCGGCGC CAAATCAGCA ACATGTTAGG CAGAATCTTA CCCCGTGCTC TAAACAGGAC TGCTGCTACT GCACGCCGGA TTCCAAGCGT TACAAGTGGC GGATCAGGAC CAGGGCCATC ATTATTTACA CAGACTACCT TTAACGAATC GATTGGGATG ACGCAGACCA CTATTTTTTC AACGAATGCG AGCGGAATCC CACCGGCCAC GCAGGTAACT CGAATCTAG
|
Protein sequence | MKPIHTARYN AWNQLEQETA HDWLGAKPLA SSTLGYRYDD WNQRCCTTTD DNVQTYEYSD PIGSDVHKGP IQKTWKQSGD PEGRISGRSE TWLNLFGKPD RIRTLTAGKT GRSRTHSMSR SRNLTTTEQE LSRQTFLYDG LGRCTEQRDA LQQSTLFSYD NWSRMVSSTL ADGSVINRSY APQSSSELAT MLEVVHQNGT TRTVAGTQKF DGLERVTQTK TGDRVEQFNY DAGEMQPRSR TTAGLDNINF TYTRALTDQI FSSTAPDETA KFDYDKTSAR LIEATNPQGT RTYRYDVHNQ LTGETWDNLL GQAWETRHQS SLLGRPIKRT DLKKGEAAGA ETRYDYDTLG RIRFINQSNL RTTIDYDVLG QLCKVATEDL QAGTGVIIDM EYDDQGQEIL RTQTASNQAA LTLTQTWAVD GLLKTRDLQQ AGSPLLHETF SYDPRGRLTL VNYLGSSLPR DELQREMTRQ IFSFDELDNI TLCQTRFTDG TSERAAFKYG SPGDDKHKDR CQLLSIAYTP PRKTPDPTFS YDANGNQLKD EHGNSLHYDS QSRLLQVAET GGAPISQYRY DGHNQLVATR DGNESEILRF YEGHQLSSTV QEDQRTQYLH LGEQPLGQQI VDDAEQTLLL LTDANQSVMG EFQQGQLRKA VYSAYGERHS EEALLSTAGF NGEVREAANG WYLLGNGYRA YNPLLMRFHS PDFLSPFAEG GVNPYTYCLG NPIALRDPTG HDASGQTGRL RRPDEGALPM QQGGGDIMGW VGVGIGVVFT VLGVAATIAT LGTATPVTGP VTVLGISMTA SAAAAVSTVS TGALIVGTAL TAASTTANTV AIVNNDQTAG EVGGWLGIAA VPVGLVGFGA GAVVARAVAA AAKVAAANAG TIGVRSVSRI GLAAAGARRT ISSAASSARR QISNMLGRIL PRALNRTAAT ARRIPSVTSG GSGPGPSLFT QTTFNESIGM TQTTIFSTNA SGIPPATQVT RI
|
| |