Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PSPTO_1947 |
Symbol | |
ID | 1183592 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pseudomonas syringae pv. tomato str. DC3000 |
Kingdom | Bacteria |
Replicon accession | NC_004578 |
Strand | + |
Start bp | 2131534 |
End bp | 2134440 |
Gene Length | 2907 bp |
Protein Length | 968 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637393325 |
Product | glycosyl transferase, group 2 family protein |
Protein accession | NP_791770 |
Protein GI | 28869151 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAAGCC ATTCTGGAAA TGAAACCACC CCGTTGAGCC AGCGTCTGAC CCTGCTGTTG CTTGCCGAGA ATCAGTCGGA CTTTCTGCGT CGTGCCCTGA AGTACTACAG CAGCTACCCG TGCAATCTGA TTGTGGTCGA TGCTTCGCCT GCGCCCGACG CGCAGGTTGC CGAGCGTACC GGCCTGCAGT ACATCCACAA TGCCGCACTC AGTGAAGCCG GACTCAGCGC CAGAATTGCA GAAGGCCTCA AGCAGGTGTC CACCCCCTTT GTGGCGTACG CACAGGTCGA CAGTTTCCTG CTGCCTGATG CGCTCACCGA AGCTGTGAAT TTCCTTGAAG CCAACGAGCA TTATGGCGCT TGCCAAGGCT ATAGCCTGGG TTATCAGGCC TACGTCGATC GCGTGGATTA TTTTCGCCGC GACCGCAAGG TTCATGAGGA CTACGCGTCT GATCAGGCCG ATGAGCGGCT TCTGAGCTTT ATGGGTCAGA GCGTGTCACT GCTCAACGCC GTGACCCGGA CCGGGCTGGC GCGTCAGTGG TTCACGTCGG TGCCTGAGGG CACCGAGCTG CACTGGCAGG ACATCGGCCA GATGAGCTTT CTGGTTGCGG CTGCTGCGCT ACGGATCCTG CCCATTCCCT ATGCGTTGCA CGTCAACGCT GAAAAGGAGG CAGAGCATCG CTATGGCGCT GCCATTGCAG GGGCGGTCAA ACATATCGAC CCCAAGGCGA AAGCCGAGCG CGAAACGCTT GCGCGCAGGG TTGTCTCGAC GCTGGGCAAT GTGTCGGGTT TTGACGGCGA GCAGGGCGCG CAGCACATAT TGGCCGGCCT CGCAGCCATG GCTGAATGCC TCGAAACCCA GCCCTATCAG GCGGGCGAAA AAATTATCAG TTCAGTGTGG AACGTCGCAC TGGCGCAGGC CGATGCGCTG TTCGAGCCTC GTCAGTTCGT TGAACTGCCG TTCTACAACC AGCCACTGTT CGATGAACTG GCGAGAATCG AGTTTTTGAT CCATCTGTTG CCGGCAGGTC ATGCGCAACT GGAAGGGCTT GAGGCGGTGC TGGTCAAACA GGCCGAGCTG CTGCGTGTGC AGAGCAACCC GGACGCCGAG TCACTGTTGA GCCGTCTGTG GCAAGCCTAC GCAAGATACG CCTTCAGCCA CAGTGTCGTG CAAAGTCTGG CCGATGAGTT GGGCAAGTCC GATGACGAGG AAGACATTGA ATGCGCCGAG CGCATCGTTG CCTGGGGCCA GCGTTTGCAG TCTGATTCCG CATTCGATAA CAGCCAGTTG CTGAAAAGTA TGGCTTCCGG CAAGTTGCTC GACTGGCTCG ATTCACGTGA TCCGGCCCCG GCACAACTGA AAAAGCTGAC TGCGCAGCTG GCGCGTCGGC CAGCAGGTTC GCAGATCGGT ATTTTGCTGC TTGATCTGGA AGCCGATGTG TTCAAGCTGC AGGCAACTTT CGACAGCCTG ATCAACGGCC ATTATCGGGG TTTCAAAGTC GTGGTGTTCA CCACCGGTGA CCTGCCGGCA ACCACCACCT TGAATGACAC CCTGCACTTC GTCAAAGTCG CCGAAAACAA TTATGTCGAC AAGATCAATC AGGTCGTCAA ACAGGCGTCC AGCGACTGGC TGATGCTGGC GCAGGCGGGT GAAGAGTTCA CCCGCAGCGG TCTGTTGCTG GCCAGCGCCG AGCTGATTGA CGCTGCGCAA TGCCGGGCGG TCGCGGTGGA TGAAATTCAC CGTCAAGCCA ATGGCACCCT GGCACCGTTG TTCCGTCCGG GCTTCAACCT GGATCTGTTG CAAAGTTTGC CGACGCTGGC CGCTCGGCAC TGGCTGGTTC GTCGTGATTT GCTGGTCGAG GCGGGCGGTT ATTCGCGAGA GTTTCCAAAG GCGCTGGAAT TCGACCTGCT GCTGCGCTTG ATCGAACAGG GCGGCATGAG CGGTCTGGCG CACTTGAGCG AGCCGGTATT GATCTGCGAT GCGCCCGAGC TTGAAGACAA TCAGGACGAG CAAAAAGCGC TGACCCGTCA TCTGGGCCAG CGTGGTTATC AGGCGGAAGT CAGTTCGGCA TTGCCCGGCA CCTACAAGAT TGATTATCGC CATGCTCATC GGCCGATGGT CTCGATTCTG CTGCACAGCC AGGACAATCT GCCGCAATTG CAGCGTTGCC TGCACAGCAT CCTGCAGCGC ACCCGCTATC AGCGCTATGA AGTGTTGATT GGCGATAACG CCAGTACCTC TGCCGAGCTG TCGACCTGGC TTGACCAGCA GCAACAGTTG TCCAGCCGGG TGCGCGTATT CCGCGCCGAT CAGCGTGTGA GTACGGCAGC GTTGCGCAAC CTGGTCAGCC AGGAAGCCAA GGGTGAGTAC CTGATTCTGC TGGATGCCGA GAGCCAGATC GTCAACGTGG GCTGGATCGA GTCACTGCTC AATCAGGTAC AGCGCCCGGA AGTCGGCGTG GTCGGCGCAC GTCTGGTGGA CCGCGAGGGC ACGGTGACCC AGGCGGGCCT GATTCTGGGG CTCAATGGTG GCGTGGGCTC GGGCTTTGTC GGCGAGCCGA AGACGTCCCG AGGTTACATG CAACGCCTGG TGGTGGAGCA GAACTACTCT GCCGTGTCAT CAGCGTGCCT GATCATTGCC AAGGAGCTGT TCGACGCGCT GGGTGGCCTG GATGAAGAGG TGTTTGCCGA GTCGCTGGGT GACGTCGATT TGTGTCTCAA GGCCGCGCAG GCAGGCTATC TGACTGTCTG GACCCCGCAT GTTCAAGTGG TGCATTCGGG TGTGGTGCAT GCGCCCGAAC AAACCCTTGG CGCGTTGATC GGCAAATGGT CGGCGCAGTT CGCGCAGGAC GAAGCCTATA ACGCCAATCT CGACCGCAAT GACCGTGGGT TCACGCTGGC GGTTTAA
|
Protein sequence | MQSHSGNETT PLSQRLTLLL LAENQSDFLR RALKYYSSYP CNLIVVDASP APDAQVAERT GLQYIHNAAL SEAGLSARIA EGLKQVSTPF VAYAQVDSFL LPDALTEAVN FLEANEHYGA CQGYSLGYQA YVDRVDYFRR DRKVHEDYAS DQADERLLSF MGQSVSLLNA VTRTGLARQW FTSVPEGTEL HWQDIGQMSF LVAAAALRIL PIPYALHVNA EKEAEHRYGA AIAGAVKHID PKAKAERETL ARRVVSTLGN VSGFDGEQGA QHILAGLAAM AECLETQPYQ AGEKIISSVW NVALAQADAL FEPRQFVELP FYNQPLFDEL ARIEFLIHLL PAGHAQLEGL EAVLVKQAEL LRVQSNPDAE SLLSRLWQAY ARYAFSHSVV QSLADELGKS DDEEDIECAE RIVAWGQRLQ SDSAFDNSQL LKSMASGKLL DWLDSRDPAP AQLKKLTAQL ARRPAGSQIG ILLLDLEADV FKLQATFDSL INGHYRGFKV VVFTTGDLPA TTTLNDTLHF VKVAENNYVD KINQVVKQAS SDWLMLAQAG EEFTRSGLLL ASAELIDAAQ CRAVAVDEIH RQANGTLAPL FRPGFNLDLL QSLPTLAARH WLVRRDLLVE AGGYSREFPK ALEFDLLLRL IEQGGMSGLA HLSEPVLICD APELEDNQDE QKALTRHLGQ RGYQAEVSSA LPGTYKIDYR HAHRPMVSIL LHSQDNLPQL QRCLHSILQR TRYQRYEVLI GDNASTSAEL STWLDQQQQL SSRVRVFRAD QRVSTAALRN LVSQEAKGEY LILLDAESQI VNVGWIESLL NQVQRPEVGV VGARLVDREG TVTQAGLILG LNGGVGSGFV GEPKTSRGYM QRLVVEQNYS AVSSACLIIA KELFDALGGL DEEVFAESLG DVDLCLKAAQ AGYLTVWTPH VQVVHSGVVH APEQTLGALI GKWSAQFAQD EAYNANLDRN DRGFTLAV
|
| |