Gene Rxyl_2233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_2233 
Symbol 
ID4115184 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp2243219 
End bp2244853 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content59% 
IMG OID638037020 
Producthypothetical protein 
Protein accessionYP_644983 
Protein GI108805046 
COG category 
COG ID 
TIGRFAM ID[TIGR02987] type II restriction m6 adenine DNA methyltransferase, Alw26I/Eco31I/Esp3I family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGAAGC GTAACTCGAA AGACTTGTTA ACTAGGGCTA CCGGACGTTT TTATACTCAT 
GAGTTCATCG GGCGACGCCT AGCGGATGTG GTCGTGCGGA CAGCCCAACT GGAGGGCCGC
CGTTCAGTCA GCGTGATCGA CCCGTTCTGT GGCGACGGCC GGCTTATCGG CTGGTTCCTT
GAGGCTGTCT CAAAGAGAGA CGGGGCGCGT CCGCGGATGT GGAGCGTCGA GCTCTGGGAC
TGCGACGCAG CGGCGCTCGA CACAGCGAGA AAGCGGGTGC TTGAGACAGC TGCTCAGCTC
GGCGTGGAGG TCGAGGTAGA AACTGTCCTA GGCGACACCT TTGCTCACGC TCCGAGTCGC
TTCGGAGAGT TCGTGGTGTG CCTCACTAAT CCACCGTGGG AGCTCCTCAA GCCTGACCGT
CGGGAGCTAG ACCAGCTCGA CGAGGACGTT GCCAGAGAGT ACGTGGCGCA ACTCCGGCAG
CAGAGTTCGA GGCTAGTCGA GTATTATCCG CTGTCGGCAC CCCGACGTAG ATATGCGGGG
TGGGGCATGA ACTTGGCCAG AGCAGGTACG GAGGTGGCTT TGCGACTCAC CGCCAACGAT
GGTGTGTGCG GTGTGGTGTC CCCGGCATCG CTCCTAGCGG ATCAGATGTC CGAGTCCTTG
CGCCGCTGGG TATTCGAAGA ATATGTGGTG CACGACATCG CCTACTACGT GGCAGAGGCG
AAGCTCTTCG ATGAGGTGGA TCAACCTAGC GTGACACTAG TCGTGTCCCC AGGCGTCACG
AGTCATAGCC CTACAATGCT TGTCGTTCAC GATCGAGCTC TCCGGGGTAA AGAGGTGCGC
TTCAGTGAGC AAGAGTGGCA CTCTCTCCGG TCAGACGGTT ATGTCTTCCC TCTGCAGTTC
GGGCCAGAAC TGCTGGGACT CCGGTCGAAG TGGGACAACC TACCAAGCTT CGCGGATCTT
GAGGGAGAGT CACAAGACGA TCTTTGGGCT GGTCGTGAGT TGGATGAAAC TGGTCACAGG
CGCTTCCTTG GCGACAAAGG AGACTACCTC TTCGTCAAAG GACGTATGGT GAGTCGCTTC
AGGATGTGCG AAGCTCCGAG CCAATACGTT AATCGCGACG GGCCTCGTAT TCCATCTTCA
GCGGATCACT ACCGACTCGC GTGGCGCGAC ATCTCGCGAC CTAACCAGAG ACGCAGAATC
CAAGCCGCTA TAGTTCCTCC CGGTATGGTC ACAGGTAACT CACTGCATGT CGCTTACTTC
AGGGACGACG ACCTTGAGAG ACTGAAGGCC CTCCTTGCGG TTATGAACTC CTTCGTCTTT
GAGGCTCAGG TACGCATGCG CTTAGCCACG GCTCACGTCT CGCTCGGCGT GGTCAGAAAG
GCCCGTGTAC CACCGTTAAA TGACAGGGGC CTAACGGCCG AGCTTGCTCA ACTGGTGGCC
CGCTGCGAGA AAGGTGACGA GGAGGCACTT ACCACAGTAG AGGTGCGGGT GGCCCAGTTA
TACGGGTTGT CACACGAGGA CTTCGCGCTC CTACTCTCGG CGTTCGAGAA GGTAGAGGAG
GAAGAGAAGA GGGCCTTGCT CACTAGCCCC GCCTGGCGCT GTCCCTCTGT AGGCTCACAG
CAACTCGCGA GGTGA
 
Protein sequence
MRKRNSKDLL TRATGRFYTH EFIGRRLADV VVRTAQLEGR RSVSVIDPFC GDGRLIGWFL 
EAVSKRDGAR PRMWSVELWD CDAAALDTAR KRVLETAAQL GVEVEVETVL GDTFAHAPSR
FGEFVVCLTN PPWELLKPDR RELDQLDEDV AREYVAQLRQ QSSRLVEYYP LSAPRRRYAG
WGMNLARAGT EVALRLTAND GVCGVVSPAS LLADQMSESL RRWVFEEYVV HDIAYYVAEA
KLFDEVDQPS VTLVVSPGVT SHSPTMLVVH DRALRGKEVR FSEQEWHSLR SDGYVFPLQF
GPELLGLRSK WDNLPSFADL EGESQDDLWA GRELDETGHR RFLGDKGDYL FVKGRMVSRF
RMCEAPSQYV NRDGPRIPSS ADHYRLAWRD ISRPNQRRRI QAAIVPPGMV TGNSLHVAYF
RDDDLERLKA LLAVMNSFVF EAQVRMRLAT AHVSLGVVRK ARVPPLNDRG LTAELAQLVA
RCEKGDEEAL TTVEVRVAQL YGLSHEDFAL LLSAFEKVEE EEKRALLTSP AWRCPSVGSQ
QLAR