Gene Rxyl_2052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_2052 
Symbol 
ID4115764 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp2080376 
End bp2081947 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content70% 
IMG OID638036839 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_644809 
Protein GI108804872 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCGAGA AGGTCTACCG CACGGAGCTC ACGCCGGTGA GCTTTCTGCG CCGCAGCGCC 
TTTATGTTCC CGGAGAAGAC CGCCGTGGTC TACGGCGACA GGCGCTACAC CTACCGGGAG
TTCGAACGGC GGGTCGACCG GCTCGCCTCC GGCCTGAGGG AGGCGGGCCT GCGCGGGGGC
GACCGGGTGG CGTTCATCTG CCCCAACACC CCGCCGCTGC TCGAGGCCCA CTTTGCGGTC
CCCGCCGCCG GGGGCGTGCT CGTGGCCATA AACACCCGCC TGAGCCCGGA GGAGGTCGGG
TACATCCTCG AGCACTCGGG GGCCCGGTTC GTCTTCGCCG ACGCCGGGCT CGAGCACCTC
GCCTCCGGCG CGGAGGCGCA GCGGGTGCGC ATAGACGACA CCGGGGCGGA GGGCGACCCC
TACGAGGACT TCCTCGCCGC CGCCCCGCCG GAGCCGCCGG AGAGCCCCCT CAAGGACGAG
GAGGAGACCA TCTCCCTCAA CTACACCTCC GGCACCACCG GCAGGCCCAA GGGCGTGATG
TACAGCCACC GGGGGGCGTA CCTCTGCGCC CTCGGCAACG TTATCGAGGC CGGGATGGGC
TACGAGACCC GCTACCTGTG GACCCTCCCC ATGTTCCACT GCAACGGGTG GACCTACCCC
TGGGCCGTAA CGGCGGTGGC CGGGACCCAC GTCTGCCTGC GGCGGGTGGA GCCCGGGCGC
ATCTGGAGGC TCTTCAAAGA GGAGGGCATA ACCCACTACT GCGCCGCCCC CACCGTGCAG
GTCGGGATCA TAAACGACGA GGCGGCGCAC CGGCTGCCGC GCCCGGTGCG GGCCATGATC
GCGGGGGCCC CGCCCTCCCC CACCCTGATA GCCGGCCTCG GCGACCTCAA CATAGACCCG
GTGCACATCT ACGGCCTCAC CGAGACCTAC GGCCCCATCA CCACGAGCGC CCCCCGCAAG
GAGTGGGAGG AGCTGCCGGC GGAGGAGCGG GCCCGCCTGC TGGCCCGCCA GGGCAACGCC
TACGTCACCG CGGACATAGT GCGCGTGGTG GACGAGAACC TGCAGGACGT GCCCCGCGAC
GGGGAGACGA TGGGCGAGAT CGTGATGCGG GGCAACATGG TGATGAAGGG CTACTTCGAG
AACGAGGAGG CCACCCGCGA GGCCTTCGAG GGCGGCTGGT TCCACTCCGG GGACGTGGCC
GTCTGGCACC CCGACGGCTA CGTGGAGATC CGGGACCGCC GCAAGGACAT CATCATCTCC
GGCGGGGAGA ACATCTCCAC CATCGAGGTG GAGCAGGCGG TCGTGAGCCA CCCGGCGGTG
CTGGAGTGCG CGGTGGTCGC CATCCCCGAC GAGAAGTGGG GCGAGCGCCC GAAGGCGTTC
GTGACGCTCA AGAAGGGGCA TAACGCCACG GAGGAGGAGA TCATCGAGCA CTGCAAGGCC
AAGATAGCCC GCTTCAAGGC GCCCTCGGCG GTGGAGTTCG TGGAGGAGCT GCCGAAGACC
TCCACCGGCA AGGTGCAGAA GTTCGTGCTG CGCGAGAAGG AGTGGGCGGG GCAGGAGAAG
CGGGTGCACT GA
 
Protein sequence
MGEKVYRTEL TPVSFLRRSA FMFPEKTAVV YGDRRYTYRE FERRVDRLAS GLREAGLRGG 
DRVAFICPNT PPLLEAHFAV PAAGGVLVAI NTRLSPEEVG YILEHSGARF VFADAGLEHL
ASGAEAQRVR IDDTGAEGDP YEDFLAAAPP EPPESPLKDE EETISLNYTS GTTGRPKGVM
YSHRGAYLCA LGNVIEAGMG YETRYLWTLP MFHCNGWTYP WAVTAVAGTH VCLRRVEPGR
IWRLFKEEGI THYCAAPTVQ VGIINDEAAH RLPRPVRAMI AGAPPSPTLI AGLGDLNIDP
VHIYGLTETY GPITTSAPRK EWEELPAEER ARLLARQGNA YVTADIVRVV DENLQDVPRD
GETMGEIVMR GNMVMKGYFE NEEATREAFE GGWFHSGDVA VWHPDGYVEI RDRRKDIIIS
GGENISTIEV EQAVVSHPAV LECAVVAIPD EKWGERPKAF VTLKKGHNAT EEEIIEHCKA
KIARFKAPSA VEFVEELPKT STGKVQKFVL REKEWAGQEK RVH