Gene Rxyl_2064 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_2064 
Symbol 
ID4115776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp2094156 
End bp2095766 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content67% 
IMG OID638036851 
Productmalate synthase 
Protein accessionYP_644821 
Protein GI108804884 
COG category[C] Energy production and conversion 
COG ID[COG2225] Malate synthase 
TIGRFAM ID[TIGR01344] malate synthase A 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.889002 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACGAA AGCGTTCCTT CCCCGAGGGC GTGGAGTTCA CCGCTCCCAT CCCTGAGGAG 
TATGCGGACG TGCTCTCGCC CGACGCGGTG AGCTTTGTGG CCGGGCTGGC CCGGGAGTTC
ACGGGCCGGG TGGACGAGAT TCTGGCGGCG CGGCAGGAGC GTCAGCTGCG GATCAACGCC
GGCGAGATGC CGGACTTCCC CTCGGAGACG CGGGAGGTGC GGGAGTCCGA GTGGCGGGTG
GCGCCGGCGC CGCCCGACCT GCAGGACCGC AGGGTGGAGA TCACCGGCCC GCCGGACAGG
AAGATGCTGA TCAACGCGCT GAACTCCGGG GCCAGCACCT ACATGACCGA CCTGGAGGAT
GCCAACTGCC CCACCTGGCG GAACATGCTG GAGAGCCAGT ACAACATCCG GGACGCGGTG
AGGGGGACCA TCACCTACGA CGACCCCAAC ACCGGCAAGC ACTACGAGCT GGGGGAGGAG
CTGGCCACCC TCATCGTCCG GCCGCGGGGC TGGCATCTCT TCGAGAAGCA CATGCTGGTG
GACGGCCGGC AGGTACCGGG GGCCCTCTTC GACTTCGGGC TGGCGTTCTT CCACAACGCG
GGGCGGCTCA TCGAGAACGG CAGCGGCCCC TACTACTACC TGCCCAAGCT CGAGGGCTAC
CGGGAGGCCC GGCTCTGGAA CGACGTCTTC AACATGGCGC AGGACGAGCT CGGCATCCCG
CGGGGGACCA TCAGGGCCAC CGTGCTGGTG GAGACCATCC TGGCCACCTT CGAGATGGAC
GAGATCCTCT ACGAGCTGCG CGACCACTCC TCGGGCCTCA ACGCCGGGCG CTGGGACTAC
ATCTTCAGCT ACATAAAGAA GTTCCGCGAG CACGAGGACC GGCTGCTGCC CGACCGGGCG
CAGGTGACCA TGACCGTGCC GTTCATGCGC GCCTACACCC AGCTTCTGGT GAAGGTCTGC
CACCGGAGGG GGGCGCACGC CATAGGCGGG ATGGCCGCCC AGATACCGGT CAAGGACGAC
CCGAAGAAGA ACGAGGAGGC CTTCGCCAAG GTCCGCGCCG ACAAGGAGCG CGAGGCCCGC
GACGGGCACG ACGGCACCTG GGTGGCCCAC CCGGCGCTCG TCCCGGTCGC CAAGGAGGTC
TTCGACGAGT ACATGCCGCA GCCCAACCAG ATCGAGACCA AAAAGCGCGA GGACGTGCAC
GTCACGGCCG AAGACCTGCT GGAGCGCCCG GAGGGCACCA TCACCATGGA CGGCTTCCGC
AACAACATCA GCGTGGGCAT CCAGTACCTC GGCGCCTGGT TCTCCGGCCG CGGCGCGGTG
CCCGTCTTCA ACCTCATGGA GGACACGGCG ACCGCCGAGA TCAGCCGGGC CCAGGTGTGG
CAGTGGATCC ACCACCCCAA GGCCGTCCTC GACGACGGCA CCAAGGTCAC GAAGGAGCTC
TTCCACAGGG TGATGGCCGA GGAGGTCGAG AGGATCCGGG AGGAGATCGT CGGCCCCGAG
CGCTTCCGGC GCGACCGCTT CGATACCGCC ATCGAGTTCT TCGACCGGAT CTCCACCCAG
GACGAGTTCG TCGAGTTCCT GACCCTGCCC GGTTACGACT ACCTGGAGTA G
 
Protein sequence
MARKRSFPEG VEFTAPIPEE YADVLSPDAV SFVAGLAREF TGRVDEILAA RQERQLRINA 
GEMPDFPSET REVRESEWRV APAPPDLQDR RVEITGPPDR KMLINALNSG ASTYMTDLED
ANCPTWRNML ESQYNIRDAV RGTITYDDPN TGKHYELGEE LATLIVRPRG WHLFEKHMLV
DGRQVPGALF DFGLAFFHNA GRLIENGSGP YYYLPKLEGY REARLWNDVF NMAQDELGIP
RGTIRATVLV ETILATFEMD EILYELRDHS SGLNAGRWDY IFSYIKKFRE HEDRLLPDRA
QVTMTVPFMR AYTQLLVKVC HRRGAHAIGG MAAQIPVKDD PKKNEEAFAK VRADKEREAR
DGHDGTWVAH PALVPVAKEV FDEYMPQPNQ IETKKREDVH VTAEDLLERP EGTITMDGFR
NNISVGIQYL GAWFSGRGAV PVFNLMEDTA TAEISRAQVW QWIHHPKAVL DDGTKVTKEL
FHRVMAEEVE RIREEIVGPE RFRRDRFDTA IEFFDRISTQ DEFVEFLTLP GYDYLE