Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PXO_00223 |
Symbol | talC2A |
ID | 6305131 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Xanthomonas oryzae pv. oryzae PXO99A |
Kingdom | Bacteria |
Replicon accession | NC_010717 |
Strand | - |
Start bp | 1650351 |
End bp | 1653557 |
Gene Length | 3207 bp |
Protein Length | 1068 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 642638902 |
Product | TAL effector AvrBs3/PthA |
Protein accession | YP_001912778 |
Protein GI | 188575849 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.973214 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATCCCA TTCGTTCGCG CACGCCAAGT CCTGCCCGCG AGCTTCTGCC CGGACCCCAA CCGGATAGGG TTCAGCCGAC TGCAGATCGG GGGGGGGCTC CGCCTGCTGG CGGCCCCCTG GATGGCTTGC CCGCTCGGCG GACGATGTCC CGGACCCGGC TGCCATCTCC CCCTGCGCCC TCGCCTGCGT TCTCGGCGGG CAGCTTCAGC GATCTGCTCC GTCAGTTCGA TCCGTCGCTT CTTGATACAT CGCTTCTTGA TTCGATGCCT GCCGTCGGCA CGCCGCATAC AGCGGCTGCC CCAGCAGAGT GCGATGAGGT GCAATCGGGT CTGCGTGCAG CCGATGACCC GCCACCCACC GTGCGTGTCG CTGTCACTGC CGCGCGGCCG CCGCGCGCCA AGCCGGCCCC GCGACGGCGT GCGGCGCAAC CCTCCGACGC TTCGCCGGCC GCGCAGGTGG ATCTACGCAC GCTCGGCTAC AGTCAGCAGC AGCAAGAGAA GATCAAACCG AAGGTGCGTT CGACAGTGGC GCAGCACCAC GAGGCACTGG TGGGCCATGG GTTTACACAC GCGCACATCG TTGCGCTCAG CCAACACCCG GCAGCGTTAG GGACCGTTGC TGTCACGTAT CAGGACATAA TCACGGCGTT GCCAGAGGCG ACACACGAAG ACATCGTTGG CGTCGGCAAA CAGTGGTCCG GCGCACGCGC CCTGGAGGCC TTGCTCACGG AGGCGAGGGA GTTGAGAGGT CCGCCGTTAC AGTTGGACAC AGGCCAACTT CTCAAGATTG CAAAACGTGG CGGCGTGACC GCAGTGGAGG CAGTGCATGC ATGGCGCAAT GCACTGACGG GTGCCCCCCT GAACCTGACC CCGGACCAAG TGGTGGCCAT CGCCAGCAAT ATTGGCGGCA AGCAGGCGCT GGAGACGGTG CAGCGGCTGT TGCCGGTGCT GTGCCAGGAC CATGGCCTGA CCCCGGACCA GGTCGTGGCC ATCGCCAGCA ATGGCGGCGG CAAGCAGGCG CTGGAGACGG TGCAGCGGCT GTTGCCGGTG CTGTGCCAGG CCCATGGCCT GACCCCGGAC CAGGTCGTGG CCATCGCCAG CAATAACGGC GGCAAGCAGG CGCTGGAGAC GGTGCAGCGG CTGTTGCCGG TGCTGTGCCA GGACCATGGC CTGACCCCGG ACCAGGTCGT GGCCATCGCC AGCAATGGCG GCGGCAAGCA GGCGCTGGAG ACGGTGCAGC GGCTGTTGCC GGTGCTGTGC CAGGACCATG GCCTGACCCC GGACCAAGTG GTGGCCATCG CCAACAATAA AGGCGGCAAG CAGGCGCTGG AGACGCTGCA GCGGCTGTTG CCGGTGCTGT GCCAGGCCCA TGGCCTGACC CCGGACCAGG TGGTGGCCAT CGCCAGCAAT GGCGGCGGCA AGCAGGCGCT GGAGACGGTG CAACGGCTGT TGCCGGTGCT GTGCCAGGAT CATGGCCTGA CCCCGGCCCA GGTCGTGGCC ATCGCCAGCA ATATTGGCGG CAAGCAGGCG CTGGAGACGG TGCAGCGGCT GTTGCCGGTG CTGTGCCAGG ACCATGGCCT GACCCCGGAC CAAGTGGTGG CCATCGCCAA CAATAACGGC GGCAAGCAGG CGCTGGAGAC GGTGCAGCGG CTGTTGCCGG TGCTGTGCCA GGACCATGGC CTGACCCCGG ACCAGGTCGT GACCATCGCC AGCAATATTG GCGGCAAGCA GGCGCTGGAG ATGGTGCAGC GGCTGTTGCC GGTGCTGTGC CAGGACCATG GCCTGACCCC GGACCAAGTG GTGGCCATCG CCAACAATAA CGGCGGCAAG CAGGCGCTGG AGACGGTGCA GCGGCTGTTG CCGGTGCTGT GCCAGGCCCA TGGCCTGACC CCGGACCAGG TCGTGGCCAT CGCCAGCAAT ATTGGCGGCA AGCAGACGCT GGAGACGGTG CAGCGGCTGT TGCCGGTGCT GTGCCAGGAC CATGGCCTGA CCCCGGACCA GGTCGTGGCC ATCGCCAGCC ACGATGGCGG CAAGCAGGCG CTGGAGACGG TGCAGCGGCT GTTGCCGGTG CTGTGCCAGG ACCATGGCCT GACCCTGGAC CAGGTGGTGG CCATCGCCAG CAATGGCGGC AAGCAGGCGC TGGAGACGGT GCAGCGGCTG TTGCCGGTGC TGTGCCAGGA CCATGGCCTG ACCCCGGACC AGGTCGTGGC CATCGCCAGC AATAGTGGCG GCAAGCAGGC GCTGGAGACG GTGCAGCGGC TGTTGCCGGT GCTGTGCCAG GACCATGGCC TGACCCCGAA CCAGGTGGTG GCCATCGCCA GCAATGGCGG CAAGCAGGCG CTGGAGAGCA TTGTTGCCCA GTTATCTCGC CCTGATCCGG CGTTGGCCGC GTTGACCAAC GACCACCTCG TCGCCTTGGC CTGCCTCGGC GGACGTCCTG CCCTGGATGC AGTGAAAAAG GGATTGCCGC ACGCGCCGGA ATTGATCAGA AGAATCAATC GCCGTATTCC CGAACGCACG TCCCATCGCG TTCCCGACCT CGCGCACGTG GTGCGCGTGC TTGGTTTTTT CCAGAGCCAC TCCCACCCAG CGCAAGCATT CGATGACGCC ATGACGCAGT TCGGGATGAG CAGGCACGGC TTGGTACAGC TCTTTCGCAG AGTGGGCGTC ACCGAATTCG AAGCCCGCTA CGGAACGCTC CCCCCAGCCT CGCAGCGTTG GGACCGTATC CTCCAGGCAT CAGGGATGAA AAGGGCCAAA CCGTCCCCTA CTTCAGCTCA AACACCGGAT CAGGCGTCTT TGCATGCATT CGCCGATTCG CTGGAGCGTG ACCTTGATGC GCCCAGCCCA ATGCACGAGG GAGATCAGAC GCGGGCAAGC AGCCGTAAAC GGTCCCGATC GGATCGTGCT GTCACCGACC CCTCCACACA GCAATCTTTC GAGGTGCGCG TTCCCGAACA GCGCGATGCG CTGCATTTGC CCCTCAGCTG GAGGGTAAAA CGCCCGCGTA CCAGGATCGG GGGCGGCCTC CCGGATCCTG GTACGCCCAT CGCTGCCGAC CTGGCAGCGT CCAGCACCGT GCTGTGGGAA CAAGATGCGG CCCCCTTCGC AGGGGCAGCG GATGATTTCC CGGCATTCAA CGAAGAGGAG CTCGCATGGT TGATGGAGCT ATTGCCTCAG TCAGGCTCAG TCGGAGGGAC GATCTGA
|
Protein sequence | MDPIRSRTPS PARELLPGPQ PDRVQPTADR GGAPPAGGPL DGLPARRTMS RTRLPSPPAP SPAFSAGSFS DLLRQFDPSL LDTSLLDSMP AVGTPHTAAA PAECDEVQSG LRAADDPPPT VRVAVTAARP PRAKPAPRRR AAQPSDASPA AQVDLRTLGY SQQQQEKIKP KVRSTVAQHH EALVGHGFTH AHIVALSQHP AALGTVAVTY QDIITALPEA THEDIVGVGK QWSGARALEA LLTEARELRG PPLQLDTGQL LKIAKRGGVT AVEAVHAWRN ALTGAPLNLT PDQVVAIASN IGGKQALETV QRLLPVLCQD HGLTPDQVVA IASNGGGKQA LETVQRLLPV LCQAHGLTPD QVVAIASNNG GKQALETVQR LLPVLCQDHG LTPDQVVAIA SNGGGKQALE TVQRLLPVLC QDHGLTPDQV VAIANNKGGK QALETLQRLL PVLCQAHGLT PDQVVAIASN GGGKQALETV QRLLPVLCQD HGLTPAQVVA IASNIGGKQA LETVQRLLPV LCQDHGLTPD QVVAIANNNG GKQALETVQR LLPVLCQDHG LTPDQVVTIA SNIGGKQALE MVQRLLPVLC QDHGLTPDQV VAIANNNGGK QALETVQRLL PVLCQAHGLT PDQVVAIASN IGGKQTLETV QRLLPVLCQD HGLTPDQVVA IASHDGGKQA LETVQRLLPV LCQDHGLTLD QVVAIASNGG KQALETVQRL LPVLCQDHGL TPDQVVAIAS NSGGKQALET VQRLLPVLCQ DHGLTPNQVV AIASNGGKQA LESIVAQLSR PDPALAALTN DHLVALACLG GRPALDAVKK GLPHAPELIR RINRRIPERT SHRVPDLAHV VRVLGFFQSH SHPAQAFDDA MTQFGMSRHG LVQLFRRVGV TEFEARYGTL PPASQRWDRI LQASGMKRAK PSPTSAQTPD QASLHAFADS LERDLDAPSP MHEGDQTRAS SRKRSRSDRA VTDPSTQQSF EVRVPEQRDA LHLPLSWRVK RPRTRIGGGL PDPGTPIAAD LAASSTVLWE QDAAPFAGAA DDFPAFNEEE LAWLMELLPQ SGSVGGTI
|
| |