Gene Rxyl_2038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_2038 
Symbol 
ID4115848 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp2063981 
End bp2065234 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content59% 
IMG OID638036825 
Productlycopene beta and epsilon cyclase 
Protein accessionYP_644795 
Protein GI108804858 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID[TIGR01790] lycopene cyclase family protein 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCCTGAGG CCAACGACAG CGGAGTAGCG GTGAAACAAC ATGCCGGGCG AGCCTCCGAG 
GGGCTCGCCC GGGTTCTATC TTCGCGCAGA GAACCGCGGA TAGCGCTCGT AGGGGCCGGG
CTCGCCGGAA GCAGCCTGGC TCTCGCGCTT CTCTGGAGAG GTTTCCGCGG CCAGGTCACC
CTATTCGACA GCAGGACAGA TTTTTCCCGA GAGCAGCGCT GGTGCAGCTG GGGTCCCCTG
CCGGAGCCGC TGTCTGACCT AACAGATGCC TCATGGCCTG CGTGGAAGGT CATCTGCGGG
AGTAAGAGGG TCTTGTGCCG GCTCCCCGAG CGTCCTTACC TCCATCTCTA CGCGCCGAGG
TTTTTCAACT ACGCGCACCA GCAGCTGGAG AAAGCGCCGG GCTTTGCACT GAACCTCGGC
GTCGCGGTGC ACATCATAGA GGAGAAAAGA GATCGAGTAA GGTTGCAAAC CGACGCAGGC
GAGCTGGAAG CGGACTTTGT CTTTGACAGC AGACCAACCG GACAGGCAGG TGGTTCCCCA
CCACACCCCT CTGACCAGGC AATCCTCTAC CAGTCATTCC GCGGCTGGGT TTTAGAGCTC
GGTCAGAGAT GTCTTGAAAC AGGCGCCCTG ACCCTTATGG ACTTCAATAC GACTCAGGGA
AACGGCATAT CTTTCATTTA CGTCCTACCC TTCTCGGCCG ACCGGGCGCT CGTGGAGAGC
ACATCGCTCT CGCAGCAACC CGATAGCAAA GAAGAGCACG TTGCGAGGAT CAGGGATTAT
TTGGAGCGGC TCGGAGTCCG CGAATACCTC GTCAGCGCCG AGGAGTGGGG CCTACTTCCG
ATGACGACTA CGAGCTTACC GAACAGGCCG GGAAGGAAGT GGGTCAGGAT AGGGCAAGCC
GGCGGTGCCC TGCGCCCCTC AAGCGGCTAC ACCCTCGTCA ATGCGCTGCG CCAGAGCCAG
GCCATAGCAG ACGCTCTGAT AGAGGGCAGA GCGCCGCGGT CGCGACCCAT ATCTCGCAAG
TACATGATCT TCGACGATAT ATTTCTGGAA GTCCTGCGCA CCTCGCCTGA GTTGGTCAGA
GAGGGCTTGG TGAATATGTT CGAGCGCATT AGAGAGGACG CTGTCGTACG GTTTTTATCG
AGCGAGAGCA GCTTTGCAGA CGATGCAAGG CTCGTTGCAG CGCTCCCAAA GACGCCCTTC
GCTCGCGCTG CGCTGCGAAG GTTTAAAACT TATGTTACCC CGCTCATACG GTAA
 
Protein sequence
MPEANDSGVA VKQHAGRASE GLARVLSSRR EPRIALVGAG LAGSSLALAL LWRGFRGQVT 
LFDSRTDFSR EQRWCSWGPL PEPLSDLTDA SWPAWKVICG SKRVLCRLPE RPYLHLYAPR
FFNYAHQQLE KAPGFALNLG VAVHIIEEKR DRVRLQTDAG ELEADFVFDS RPTGQAGGSP
PHPSDQAILY QSFRGWVLEL GQRCLETGAL TLMDFNTTQG NGISFIYVLP FSADRALVES
TSLSQQPDSK EEHVARIRDY LERLGVREYL VSAEEWGLLP MTTTSLPNRP GRKWVRIGQA
GGALRPSSGY TLVNALRQSQ AIADALIEGR APRSRPISRK YMIFDDIFLE VLRTSPELVR
EGLVNMFERI REDAVVRFLS SESSFADDAR LVAALPKTPF ARAALRRFKT YVTPLIR