Gene Xfasm12_2271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagXfasm12_2271 
Symbol 
ID6120035 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameXylella fastidiosa M12 
KingdomBacteria 
Replicon accessionNC_010513 
Strand
Start bp2411476 
End bp2414505 
Gene Length3030 bp 
Protein Length1009 aa 
Translation table11 
GC content51% 
IMG OID641650210 
Producttype I restriction-modification system endonuclease 
Protein accessionYP_001776750 
Protein GI170731317 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGAAT ATATCCCGCC CTTTCGTTAT GGACCCATTG TGTGCTCCGA GCAGAGCACC 
ATCGTTGCCG AGTTTGTACC CGAGGCCTCG GGTGTGCGTG AGGCGGGATA TCAGTCCGAA
GCGGAGTTGG AAGCAGCTTT CATCAAGCAG TTGCAGGGGC AGGCCTACGA ATATCTGCCG
ATCACCTCGG AGGCCGCGTT GCTGGCCAAT CTTCGGCGCC AGTTGGAAAA GCTCAACAAG
GTCACGCTGT CGGATGCAGA GTGGGAAGGT TTTTTCAGTA CCTGCATCGC GGGTGCACAT
GACGGCATTG TGCAAAAAAC GGCACGCATT CAAGAAGACC ATGTGCAGGT TTTGAAGCGC
GACGATGGCA CCACTAAAAA CATGTGTCTG CTTGACAAGG TCAACATCCA CAACAACACC
TTGCAGGTGA TTAACCAGTA CGCAGTGGAG GGCGCTCGTG CCAATCGCTA TGACGTCACG
GTGTTGGTGA ACGGTTTGCC GATGGTGCAT GTTGAGTTGA AGCGCCGGGG CGTGGACATT
CGCGAGGCTT TCAATCAGAT CAACCGTTAT CAGCGTGATA GCTTCTGGGC GGGTTCGGGG
TTGTTCGATT ATGTGCAGCT TTTTGTCATC AGCAATGGCA CGTTGACCAA GTATTACAGT
AATACGGTGC GTGCCGGGCA TGTGGAGGAG CAATCCAACA AGCGCAGTAA GAGTAAGACT
TCCAACAGTT TCTCTTTTGC CATTTGGTGG GCAGATGCGA AAAATCAGCC GATCACCGAG
TTGACTAGTT TTACCAAAAC GTTTTTCGCC AAGCATTCGC TGCTTAACGT GCTGACCAAA
TACTGCGTGT TTAATGCCAA TCGTAAGTTG TTGGTGATGC GGCCTTATCA GATTGTGGCT
GCCGAGCGCA TTTTGCAACG CATTGTCACC GCCACACATC AGCAGCAGTT GGGTACGGTG
GCGGCAGGTG GCTATATTTG GCACACCACG GGCAGCGGCA AAACGCTGAC TAGTTTCAAG
GTGGCGCAAT TGGCATGCGG TCTGGGCGTG ATCGACAAGG TGTTGTTTGT GGTGGATCGT
AAGGACTTGG ACTACCAGAC GATGCGTGAG TACGAGTCTT TTGAGAAAGG TGCGGCGAAC
TCCAACACGT CCACGGCGGT ATTGCAACGA CAGTTGGAAG ATTCCGACGT CCGCATCATC
ATCACCACCA TTCAGAAGCT TTCTCGCTTT GTCGCCAAGC ACAAGCAGCA CCCTGTGTAT
GGGGCGCATG TGGTGGTGAT TTTTGATGAG TGTCATCGTA GTCAATTCGG CGATATGCAC
ACCAAAATCA CCAGGGAGTT TCAGCGTTAT CACTTGTTTG GTTTCACTGG TACGCCCATT
TTTGCTGAGA ATGCAGGCAG TACACAGAAC CCGATGCGGC GCACGACGCA GCAGACGTTT
GGCGATACTT TGCATACGTA TACGATTGTT GATGCGATCA ATGATAAGAC GGTGTTGCCA
TTTCGCATTG ATTACATCAA TACGATCAAA TCACAGCCGA ATATCAAAGA TAAGAAGGTG
GCAGCTATCG ACACCGAGCG GGCGTTGTTG GCACCAGAGC GGATCAGGCA GATTGTGAGC
TATATCCGGG AGCACTTCGA CCAAAAAACC AAGCGTGCCA GCATCTACCG TCACGAGGGT
AAGCGACTGG CGGGGTTTAA TTCGCTGTTG GCCACTGCTT CCATTGATGC GGCTAAACGT
TATTACGCCG AATTTATGGC GCAGCAAAAG GAGCTGCCCG AGGCACGACG GTTGAAGGTG
GGGTTGATCT ACAGTGTTGC TGCCAATGAG GGGGGAGGTG ATGGGGGGCT GGGTGAGGAG
GAGTTTGAAA CTGAGGGCTT GGATCAAGAT TCGCGGAATT TTCTGGAAGC CGCGATCCAG
GATTACAACA GGTTTTTTGA TCCCAATTCC AGTTTTGATA CGACGGAAGA TAAGTTCCAA
AATTATTACA AGGATGTGTC GAAACAGCTG AAAAACCGTA AGTTGGACAT TCTCATCGTG
GTCAATATGT TCCTGACCGG GTTTGACGCG AGCACGCTCA ACACGCTGTG GGTGGACAAG
CGTCTGAAGG CGCACGGTTT GATACAGGCT TATTCGCGCA CCAATCGCAT TCTCAATTCG
GTTAAGAGCT ACGGCAACAT CGTTTCTTTC CGCAATCTGG AGCAGGAGAC GAATGAGGCG
CTGGCTTTGT TTGGCAACAA GGACGCCAAG GGCATCGTGT TGTTGCGGCC TTACGCTGAG
TATTACAAGG AGTATGAGGA GCGGGTGGGG GAGCTGGTGG CCGCATTTCC GCTGGGGAAG
GCGATCGTCG GGGAGGCGGC ACAAAAGGCA TTTATCACGT TATTCGGTTC GATATTGAGG
CTGAAAAATA TTCTGACTGC TTTTGATGAT TTCGGTGGTC AGGAAATTCT CACGGAGCGG
GAGTTTCAGG ATTATCAGAG TCTGTATTTG AATCTGTATG CGGAATTCCG CAGTGCATCG
GCTGCTGAGA AGGAATCGAT TAATGATGAT GTGGTCTTCG AGATTGAACT CATCAAGCAG
GTGGAAATCA ATGTTGATTT CATTTTGTTG TTGGTCGAGC AGTATGTGAA GAAAAAGGGG
ACGGGGGAGG ATAAGGACAT CCGTGCGACG ATCGAGCGTG CCATTAATTC CAGCCCCAGT
TTGCGTAATA AGAAAGATTT GATTGAGCAG TTTGTCGATT CGGTGAGCAT GAAGGCCAAG
GTGGATGCGC AGTGGCAGGC TTTTGTGGCG GTGAAGCAGA GCCAGGAGTT GGAGGGCATC
ATCGCGGAGG AGAATCTCAA CGCCGAGGCG GCACGTGAGT TCATTGGAAA TGCGTTCCGC
GATGGCAGCA TCCCTGTCAC AGGCACGGCA ATCACCAAAG TCTTGCCGCC GGTATCGCGA
TTTTCCAAGA ACAACGGCCA TGCAGCCAAG AAGCAGGCGG TGCTGGACAA GCTTGCTATC
TTCTTTGAGC GATATTTTGG GCTGGTTTGA
 
Protein sequence
MSEYIPPFRY GPIVCSEQST IVAEFVPEAS GVREAGYQSE AELEAAFIKQ LQGQAYEYLP 
ITSEAALLAN LRRQLEKLNK VTLSDAEWEG FFSTCIAGAH DGIVQKTARI QEDHVQVLKR
DDGTTKNMCL LDKVNIHNNT LQVINQYAVE GARANRYDVT VLVNGLPMVH VELKRRGVDI
REAFNQINRY QRDSFWAGSG LFDYVQLFVI SNGTLTKYYS NTVRAGHVEE QSNKRSKSKT
SNSFSFAIWW ADAKNQPITE LTSFTKTFFA KHSLLNVLTK YCVFNANRKL LVMRPYQIVA
AERILQRIVT ATHQQQLGTV AAGGYIWHTT GSGKTLTSFK VAQLACGLGV IDKVLFVVDR
KDLDYQTMRE YESFEKGAAN SNTSTAVLQR QLEDSDVRII ITTIQKLSRF VAKHKQHPVY
GAHVVVIFDE CHRSQFGDMH TKITREFQRY HLFGFTGTPI FAENAGSTQN PMRRTTQQTF
GDTLHTYTIV DAINDKTVLP FRIDYINTIK SQPNIKDKKV AAIDTERALL APERIRQIVS
YIREHFDQKT KRASIYRHEG KRLAGFNSLL ATASIDAAKR YYAEFMAQQK ELPEARRLKV
GLIYSVAANE GGGDGGLGEE EFETEGLDQD SRNFLEAAIQ DYNRFFDPNS SFDTTEDKFQ
NYYKDVSKQL KNRKLDILIV VNMFLTGFDA STLNTLWVDK RLKAHGLIQA YSRTNRILNS
VKSYGNIVSF RNLEQETNEA LALFGNKDAK GIVLLRPYAE YYKEYEERVG ELVAAFPLGK
AIVGEAAQKA FITLFGSILR LKNILTAFDD FGGQEILTER EFQDYQSLYL NLYAEFRSAS
AAEKESINDD VVFEIELIKQ VEINVDFILL LVEQYVKKKG TGEDKDIRAT IERAINSSPS
LRNKKDLIEQ FVDSVSMKAK VDAQWQAFVA VKQSQELEGI IAEENLNAEA AREFIGNAFR
DGSIPVTGTA ITKVLPPVSR FSKNNGHAAK KQAVLDKLAI FFERYFGLV