Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Xfasm12_2267 |
Symbol | |
ID | 6120031 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Xylella fastidiosa M12 |
Kingdom | Bacteria |
Replicon accession | NC_010513 |
Strand | - |
Start bp | 2403935 |
End bp | 2407006 |
Gene Length | 3072 bp |
Protein Length | 1023 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 641650206 |
Product | type I restriction-modification system endonuclease |
Protein accession | YP_001776746 |
Protein GI | 170731313 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGAGGA CTGACACCAG TGAGCGCTGG TTCGAGGCGC GCGTGGTGCG TGGCCTGACT GGCGTGCCGC AGCCCGAGTA CAGCCATGAG CTGGCGCCCA CGGACTTTGC CGCCACCCAC AACGGCTATG TGCAGGGCAA GCCCACTGAC TACAACCGCG ACGTGGCGTT GGATGTGACG CAGTTGCTGG CCTTCTTGCA GGCCACTCAG CCGAAGGTGG TGGAGACATT GGAGCTGGCG GCGGACGGTA TCAAACGCAC TCAGTTTCTG CACCGCCTGC AAGGGGGGAT CACTAAGCGC GGCGTGGTGG ACGTGCTGCG CAAGGGCGTA AGCCACGGGC CGGTACACGT GGATCTGTAC AAGTTGCTGC CCACGCCGGG CAATGCCGCT GCGGCGGATG CCTTCGGCAA GAACATGTTC AGTGTCACTC GGCAGGTGCG CTACAGCAAT GATTCCGGCA ATGAACTGGA CTTGGTGATC TTCATCAACG GCCTGCCGGT GTTGACCTTT GAGTTGAAGA ATTCGCTGAC CAAGCAGACT CTGGCCGACG CCATCGTCCA GTATCAGACC ACGCGTAGCC CGAAGGAGTT GCTGTTCCAG TTGGGCCGTT GCGTGGCGCA CGTCGCGGTC GATGATGCGG AGGCCGCGTT CTGCACCGAG CTGAAAGGCA AGGCGTCGTG GTTCCTGCCG TTCAATCAGG GCTGGAACAG CGGTGCGGGC AATCCGCCCA ACCCGGATGG TCTGAAGACC GACTACCTGT GGAAGCAGGT GCTGACCTGC GAGTCGCTGG CCAACCTCAT CGAGAGCTAC GCGCAGGTGG TGGAAGAGGA AGAAGCGGAT GCCAGCGGCA AGAAGCGTAA GACGCGTAAG CAGATCTTCC CGCGCTTCCA TCAGTTGCGC ACGGTACGTG CTCTGCTGCG CCGGACCCGT ACCGACGGAG TAGGCAAGCG TTATTTGATC CAGCATTCGG CAGGCAGCGG CAAGAGCAAC ACGATTGCGT GGCTAGCGCA CCAGCTGGTG GAGCTGCGCC GCAAGGACGA CCCACTGAGG GTGCAGTTCG ATTCCATCAT CGTCATCACC GACTGGCGCA CGCTGGACAA GCAGATCGCC GACATCATCA AGGGGTACGA CCACGTGGCG GCGATCTTCG GCCACTCCGA CAACGCGCAG GAGCTGCGCG AGTATCTGCG CCGGGGCAAG AAGATCATTG TCACCACGGT GCAGAAGTTC CCGTTCATCC TTGATGAGCT GGGCGATCTC TCTGGCAAGA GGTTCGCGCT GTTGATCGAT GAGGCGCATT CCAGCCAGGG CGGCAAGACC ACGGCGCGGA TGCACGAAGC TCTCGGCGGC AAGGTGGCCG AGGAGGAGTT CGAGGAGGAC AGTACGCAGG ATGCGGTCAA CGCGGAAATC GAGAGGCGCA TCGCCTCGCG CAAGCTGCTC GCCAACGCCA GCTACTTCGC GTTCACGGCC ACACCCAAGA ACAAGACGCT GGAGTTGTTC GGTGAGAGGA CTCTTGTTGG CGACAAGGTG CAGTTCCGCT CGCCTGAAGA GCTGACCTAT ACCACGAAGC AGGCGATCCA GGAGAAGTTC ATCCTCGACG TGGTGGAGAA GTACACCCCC TACGACAGTT TCTATCAAGT CGCCAAGACG GTGGCGGACG ATCCGGAATT CGACAAGGTG AAGGCGCTGA AGAAGATCCG CCTCTACGTC GAGTCGCACG ACAAGGCGAT CCGCCGCAAG GCCGAGATCA TGGTGGATCA CTTCATCGCA TGTGTCGCAG GCAAGCAGAT AATTGGTGGC CAGGCGCGGG CGATGATTGT GTGCAACGGC ATCGCGCGGG CCATTGATTA CTGGCGTGAG GTGTCGGACT ACCTCACGCA GATCAAAAGC CCGTACAAGG CCATCGTGGC GTACTCGGGC AGCTTCGAGA TTGGCGGGCA GAAGAAGACG GAGGCTGATC TCAATGGGTT TCCGAGCAAG AAGATTCCGG AGAATCTCAA GAAAGACCCG TATCGCTTTC TGATCGTTGC CAACAAGTTT GTCACCGGCT TCGATGAGCC TTTGTTGCAC ACCATGTACG TGGATAAGCC GCTGGCGGGC GTGCTGGCGG TTCAGACTCT GTCACGTCTG AACCGCGCGC ACCCGCAAAA GCGCGATACG TTCGTGCTCG ACTTTGCCGA CAACGCCGAG GCGGTGAAGG CGGCGTTCCA GGACTACTAC CGCGCCACCA TCCAGATGGG CGAAACCGAC CCCAACAAGC TGCATGATCT GAAGGCCGAA CTCGATGGGC AGCAGGTGTA TAGCTGGCCG CAGGTGGAAG ATTTGGTGGC GCTGTACGTG AGTGGCGCGG ATCGGGACAA GCTTGACCCC ATCTTGGATG TGTGCGTGGC CGAATACACC GACAGGCTCG GCGAGGACGA TCAGGTCAAG TTCAAGGGCA AGGCCAAGGC GTTCGTGCGC AGCTACGGCT TCTTTGCTGC GATCTTGAGT TACGGTCATC CTGGTTGGGA AAAGCTGTCG ATTTTTCTGA ACTTTCTCAT CCCCAAGCTG CCTGCGCCCA AGGAGGAGGA CTTCTCCAAG GGGGTGCTGG AGACCATCGA TATGGACAGC TACCGGGTGG AGGCCAAGGC GGCGCTGAAG ATGGCGATGG ACGACGCAGA TGCCTCTGTC GAGCCAGTGC CTTCGGGAGG CGGTGGCGGC AAGGGTGAGG CAGTCATCGA TAGACTCTCG GTGATCATCA AAACCTTCAA CGATCTGTTC GGCAACATCC AGTGGAAGGA CGAGGACAAG ATCCGCAAGG TCATCGCCGA GGAAATTCCG GCGCGGGTGG CGCAGGACAA GGCTTACCAG AATGCGCAGG TGAATTCCGA CAAGCAAAAC GCCAAGCTGG AGCACGACAA GGCGCTCAAT CGCGTGGTGC TGGAGTTGTT GTCTGACCAC ACTGAGCTGT TTAAGCAGTT CAGTGATAAC CCTAACTTCA AGCGCTGGCT GACGGACACG GTGTTTGATG CGACTTATCA GACTGGGGCG GTTCCTCCGA AGGTACCGCC ACAGGTGGGG GCGTCGGCAT GA
|
Protein sequence | MTRTDTSERW FEARVVRGLT GVPQPEYSHE LAPTDFAATH NGYVQGKPTD YNRDVALDVT QLLAFLQATQ PKVVETLELA ADGIKRTQFL HRLQGGITKR GVVDVLRKGV SHGPVHVDLY KLLPTPGNAA AADAFGKNMF SVTRQVRYSN DSGNELDLVI FINGLPVLTF ELKNSLTKQT LADAIVQYQT TRSPKELLFQ LGRCVAHVAV DDAEAAFCTE LKGKASWFLP FNQGWNSGAG NPPNPDGLKT DYLWKQVLTC ESLANLIESY AQVVEEEEAD ASGKKRKTRK QIFPRFHQLR TVRALLRRTR TDGVGKRYLI QHSAGSGKSN TIAWLAHQLV ELRRKDDPLR VQFDSIIVIT DWRTLDKQIA DIIKGYDHVA AIFGHSDNAQ ELREYLRRGK KIIVTTVQKF PFILDELGDL SGKRFALLID EAHSSQGGKT TARMHEALGG KVAEEEFEED STQDAVNAEI ERRIASRKLL ANASYFAFTA TPKNKTLELF GERTLVGDKV QFRSPEELTY TTKQAIQEKF ILDVVEKYTP YDSFYQVAKT VADDPEFDKV KALKKIRLYV ESHDKAIRRK AEIMVDHFIA CVAGKQIIGG QARAMIVCNG IARAIDYWRE VSDYLTQIKS PYKAIVAYSG SFEIGGQKKT EADLNGFPSK KIPENLKKDP YRFLIVANKF VTGFDEPLLH TMYVDKPLAG VLAVQTLSRL NRAHPQKRDT FVLDFADNAE AVKAAFQDYY RATIQMGETD PNKLHDLKAE LDGQQVYSWP QVEDLVALYV SGADRDKLDP ILDVCVAEYT DRLGEDDQVK FKGKAKAFVR SYGFFAAILS YGHPGWEKLS IFLNFLIPKL PAPKEEDFSK GVLETIDMDS YRVEAKAALK MAMDDADASV EPVPSGGGGG KGEAVIDRLS VIIKTFNDLF GNIQWKDEDK IRKVIAEEIP ARVAQDKAYQ NAQVNSDKQN AKLEHDKALN RVVLELLSDH TELFKQFSDN PNFKRWLTDT VFDATYQTGA VPPKVPPQVG ASA
|
| |