Gene Xaut_1098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagXaut_1098 
SymbolhsdR 
ID5421511 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameXanthobacter autotrophicus Py2 
KingdomBacteria 
Replicon accessionNC_009720 
Strand
Start bp1266535 
End bp1269909 
Gene Length3375 bp 
Protein Length1124 aa 
Translation table11 
GC content64% 
IMG OID640880343 
Producttype I restriction enzyme EcoKI subunit R 
Protein accessionYP_001416004 
Protein GI154245046 
COG category[V] Defense mechanisms 
COG ID[COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCTCGG TCAACTTCGG CTTTCTCGAC GCGCACGACG CCAAGCTGTC GGCTTTGGGG 
GCACTCGGGG AACGCTATTT CCGAGAGGAC CCGTCTACGT CGATCGTGAA GCTGCGCCAG
TTCGCCGAGC TGACAGCCAA GATCATCGCC GCCCGTCATG CCGCCTATCG CGGGGAACGC
GAGACGTTCG ATGAGACGCT GAGGCGCCTA TCCTACGATC GTGTCATCCC CAAAGAAGTT
GCCGATGTGT TCCACGCCTT GCGCAAGGTG GGCAACGCCG CCGTGCACGA GGCCAAAGGC
GGCCATGCCG ACGCACTGAC AGCGCTGAAG CTCGCCCGCT CGCTCGGCGT CTGGTTCCAC
CGCACCTACG CGCGGCTGCC CAATTTCAAT CCCGGCCCTT TTGTGCCTCC GCCGGAGCCG
ACAGATGCCT CGGCGCCCCT GCGTGACGAG ATCGAGGCGC TACGCCGCAG AGTGGCCGAA
AGCGAAGATG CTGCCGCGGC TGCACGCCGG GAGGCCGAAG AGCATGCCCG CGCCCGCGAG
ACGCTGGATG AGCGACTGAG GCGGGAGGCC GAGGAGAGGG CGGCCTGGGA GCAACTCGCC
CAGGAGACGG AGGCTCAAAA GACCGCCATT GCCGAAAAGC TGGCGAGCCT TCAGTCGCAA
GCCGAGGCCG CCCCGCGACA AGAGGCAGTG GAGCTTGTGG AACGCGGCGA AGAGGCCGCC
GCCCGGCTTG AACTCGACGA AGCGGAGACC CGCGCCCTTA TCGACCAGCA GCTTCGGGAC
AGGGGCTGGG AGGCCGATAC GAAGGCCTTG CGCCACAGCG CGGGCGTTCG TCCGGCAAAG
GGCCGCAACA CGGCCATCGC CGAATGGCCG ACTGCAAACG GACGGGCTGA TTATGCCTTG
TTTGTCGGCA CCACTCTCGT TGGTGTGGTG GAGGCCAAGC GCCAGCGCAA GAGTGTCTCG
GGTGCCATCG ACCAATCCGA GCGCTATTCC ATCGGCATCC GCGATGGCGA CGAATTCATT
TATGCCGGCG GCCCGTGGAA CGACCACCGC GTGCCATTCG TCTTCGCGGC AAACGGCCGC
TCCTACCTGA AGCAGATCGA GACCGAGAGC GGCATCTGGT TTCGCGACAC GCGACGGAGC
GCGAACCACC GCCGTGCCCT GGTGGACTGG CCCACGCCCG AAGGCCTCAC CGGCCAACTG
GAGGTCGATC AAGACACGGC AGACGCTGCC CTCAATGCTC AGCCCTTCGT CTTCGGCTTC
CCTCTGCGCC ACTATCAGGA GAGCGCCATC CGGGCGGTGG AGGAGGCGCT GGCGAACGAG
CGTCGCTCCA TGCTGGTGGC CATGGCGACG GGGACCGGCA AGACCAAGCT CGCTATTGCC
CTGCTCTATC GGCTGTTGTC GGCCAAGCGC TTCCGCCGCA TCTGCTTCGT GGTGGACCGT
TCCGCGCTCG GGCACCAGAC CGAGGGCGAG TTCTCAACCA CCAAGGTTGT GTCGGGTAAG
ACGTTCGCGG AAATCTTCGG GCTGAAAGGC CTCGACGACG TGACACCCGA TCTGGAAACG
AAGGTCCACA TCTGCACCAT CCAGGGGCTG GTCAAGCGGG TGCTCTACGC GGCGGACGGG
TCCGAGGCGC CGCCTATCGA CCAGTATGAC CTCATGGTCA TCGACGAATG CCACCGCGGC
TACTTGCTCG ATCGTGAGCT GTCCGACCCC GAACTGGGCT TCCGGGACGA AACCGACTAC
ATATCCAAAT ATCGGCGCGT GCTGGAATAT TTCGACGCGG TGAAGATCGG CCTCACCGCC
ACCCCGGCAC TGCACACCAC GGACATCTTC GGCGAGCCGA TCTTTCGCTA TTCCTATCGT
GAGGCGGTTG TCGACGGCTT CCTCATCGAC CATGAGCCCC CGGTGCGGAT CGAGACCGCG
TTGGCGCGGG CCGGCATTGT CTTCGTGCGG GACGAGCAGC TCGACCTTCT GAACACCCGG
ACCGGAGAGG TCGACACCGC CACCCTCCCG GACGAAATCC GGTTCGAGGT GGAGCAGTTC
AACAAGCAGG TCATCACGCC GGAGTTCAAC CGAGTGGTCG CCGAGGAGCT GGCCCGCCAC
ATCGATCCCG CCTTCGAGGG CAAGACCCTG ATCTTCGCCG CCACCGATGC CCATGCGGAC
ATGGTGGTGA ACGCCATCAA AACCGCCTTT GCGAAGGCCT ATGGCGAGAT CGACGATGCG
GCGGTGAAGA AGATCACCGG CAAAGTGGAC AAGGTGCAGG ACCTTATCCG CTCATTCCGG
AACGACGCCA ATCCGAAGAT CGCGGTTACG GTGGACCTGC TCACCACCGG CATCGACGTG
CCGAAGATCG TGAACCTCGT CTTCCTGCGC CGGGTGAACA GCCGCATTCT CTATGAGCAG
ATGATCGGCC GGGCCACGCG CCGCTGCGAC GAGATCGGCA AGGAGGTTTT CCGCATCTTC
GACGCGGTGG ACCTCTATCC CCACCTTCAG AACCTCACGG ACATGAAGCC CGTGGTGGTC
AATCCATCCA TCAGCTTCGC CCAGCTCGTC GCGGAGATGC TCGGCGCCAC GGATGATGCC
CAGCGTGAAA CGATCCGCGA GCAAATCGCC GTCAAGCTGC GTCGTCGGCT GCGGAAGATG
CCCGAAGAAG CGCGGCAACG CTTCGAGGCG GTGGCGGGCG AGACGCCGCA GCAGATGCTC
GATCGCGTCC TGAACGGGGA CGCCCCCGCC CTCGCAGCTT GGCTCAAGGA TCACTCGGCC
ATCGGCCCGA TCCTCGATTG GCAAACGGAC GGCGGCACCC CACCGCTCAT TCCCATCTCA
CCCCATGCAG ACGAGGTCGT CGCCGTCACC CGTGGTTATG GTACCGCCGC ACGGCCCGAG
GATTTCCTCG ACAGCTTCAC CGCCTTCGTC CGCGACAACA TGAACACCAT CGCCGCGCTG
AAGCTGGTGG TGCAACGGCC CCGCGACCTC ACCCGCGCCG ACCTCAAGGA ATTGCGCCTC
GCCCTTGATC GCAAGGGCTA TTCGGAAGCG AACCTGCGCC GCGCCTGGGC CGATGCCAAG
AATGAGGAGA TCGCCGCCTC CATCATCGGC TTCGTGCGAC AGGCGGCGAT CGGCGACCCC
TTGGTGCCCT ACACCACCCG CGTGAAGGCG GCGATGCGCA CCATCCTGGC GAGCCGCGCC
TGGACGGAGC CGCAGAAGAC GTGGTTGAAG CGCATCGGCG AGCAGATCGA GAAGGAGGTC
GTGGTGGATC GCGAGGCCAT CGACACCGGC CAGTTCTCGT CCCATGGCGG GTTCAGCCGA
TTGAACAAGG TGTTCGGCGG GGAGCTTGAG AGCATCCTCG CTGGGATCAA CGAAAAGATG
TGGAGTGTTG CATGA
 
Protein sequence
MGSVNFGFLD AHDAKLSALG ALGERYFRED PSTSIVKLRQ FAELTAKIIA ARHAAYRGER 
ETFDETLRRL SYDRVIPKEV ADVFHALRKV GNAAVHEAKG GHADALTALK LARSLGVWFH
RTYARLPNFN PGPFVPPPEP TDASAPLRDE IEALRRRVAE SEDAAAAARR EAEEHARARE
TLDERLRREA EERAAWEQLA QETEAQKTAI AEKLASLQSQ AEAAPRQEAV ELVERGEEAA
ARLELDEAET RALIDQQLRD RGWEADTKAL RHSAGVRPAK GRNTAIAEWP TANGRADYAL
FVGTTLVGVV EAKRQRKSVS GAIDQSERYS IGIRDGDEFI YAGGPWNDHR VPFVFAANGR
SYLKQIETES GIWFRDTRRS ANHRRALVDW PTPEGLTGQL EVDQDTADAA LNAQPFVFGF
PLRHYQESAI RAVEEALANE RRSMLVAMAT GTGKTKLAIA LLYRLLSAKR FRRICFVVDR
SALGHQTEGE FSTTKVVSGK TFAEIFGLKG LDDVTPDLET KVHICTIQGL VKRVLYAADG
SEAPPIDQYD LMVIDECHRG YLLDRELSDP ELGFRDETDY ISKYRRVLEY FDAVKIGLTA
TPALHTTDIF GEPIFRYSYR EAVVDGFLID HEPPVRIETA LARAGIVFVR DEQLDLLNTR
TGEVDTATLP DEIRFEVEQF NKQVITPEFN RVVAEELARH IDPAFEGKTL IFAATDAHAD
MVVNAIKTAF AKAYGEIDDA AVKKITGKVD KVQDLIRSFR NDANPKIAVT VDLLTTGIDV
PKIVNLVFLR RVNSRILYEQ MIGRATRRCD EIGKEVFRIF DAVDLYPHLQ NLTDMKPVVV
NPSISFAQLV AEMLGATDDA QRETIREQIA VKLRRRLRKM PEEARQRFEA VAGETPQQML
DRVLNGDAPA LAAWLKDHSA IGPILDWQTD GGTPPLIPIS PHADEVVAVT RGYGTAARPE
DFLDSFTAFV RDNMNTIAAL KLVVQRPRDL TRADLKELRL ALDRKGYSEA NLRRAWADAK
NEEIAASIIG FVRQAAIGDP LVPYTTRVKA AMRTILASRA WTEPQKTWLK RIGEQIEKEV
VVDREAIDTG QFSSHGGFSR LNKVFGGELE SILAGINEKM WSVA