Gene Rxyl_1952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_1952 
Symbol 
ID4115744 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp1973708 
End bp1975669 
Gene Length1962 bp 
Protein Length653 aa 
Translation table11 
GC content76% 
IMG OID638036738 
ProductO-antigen polymerase 
Protein accessionYP_644711 
Protein GI108804774 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.953343 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGCCG CCTCCCCCGG CAAGCCCCTC GCCGAGCACG GCCGGCTCGC CGCGAGCCTC 
GCGCTGTGCG CCCTCGCGCT GCTCGCCCCC TGGATGGGCT GGCGGTACGG GGGCTACTAC
GTGGGCGGCT GGGCCCCTCC GGCCCTCGCC TCGGCGGCGC TGCTGCTCGT CCTCGCCGCC
GCCGGCGCGC TCGGCAGGAT GCGCTCTTCC TGGGGCGCGG CCGCGGCCGC CCTGCTCACC
GGCTACGCCG CGTGGACCTT CCTCTCGCTG CTGTGGTCCC CGGACCGCGG AGAGGCCTGG
CAGGGGGCGG GGCTCACCCT CCTGTACCTG ATGGCCTTCT GGGCCGCCCT CTCCCTGCTC
TCCTCCGGAG CCTCCCGCCG CTGGGCGCTC GCCGCCTGCG CGCTGGGGAC CGGCGCGGTG
GCCGCCCTGA CGCTCCCCCG GCTGGCCCCG GAGGCGGAGG AGCTCTTCGT GAACGGGCGC
CTGCTCGGGA CCGCCGGCTA CTTCAACGCC GAGGCCGCCT TTCTGCTGCT CCCCTTCTGG
GCGGCCCTCT GCCTGGCGGG CTCCCCGCGG GTGAACCCGC TGCTGCGGTG CCTGGCCCTC
GCCGCGGCGG CGCTCTGCTC GCAGCTTGCG GTCCTCACCC AGTCGCGGGG GGCGGCGATG
GCCCTCGCCG CCTCGCTCCC GGTGTTCTTC CTGCTGTCGG GCCGGCGGAT GCGGGGAGCT
CTGGCCCTGC TGCCGGTCGC GGCGGCGCTC ATCCTGAACT TCCCGCAGCT CAACGGGGTC
TACCTGGCCG GCGAAGGGGC CGCCCTGGAG GAGGCGTTGC GCCGCGCGGT GCCGCAGGTG
TGGCTCGCCG CCCTCGCCTG CGGGCTGTAC GGGCTGGGGT GGGCGCTGCT CGACGCCCGC
TGGAGGCCGC CCGCCCGCGC GGCGCGCGCG GCCGGGGCCG CCACCCTCGC CGCGCTGCTG
CTCCTCGCGG CGGCGGGCGG GGCCGCCTTC TACGCCCGGG AGGGGAGCCC CGTGCGGTGG
GTCCAGCAGA AGGCGGAAGC CTTCCAAAAC GGCGACCGCT CGGGGCAGGA GCAGAGCCGC
TACCTGAGCG CCTCGGGCTC GGGCCGGCTC GTGATGTGGC GGGTGGCCTG GGAGGACTTC
GCCCGCCACC CGGTGCTGGG GGTGGGCACC CACAACTACG AGGCCACCTA CTACCGGCTG
CGAGGCGAGA GGGCGGGCTA CGTGCGCCAG CCGCACTCGC TGCCGCTGGA GGCGCTGGCC
GAGAGGGGCG TGGTGGGCGG GGCGCTGCTC GCGGGCTTTC TCGGGGTGTG CTTTGCGGCG
GGATTGCGCC GGTTCGGCGG CCTGAACGCG GAGGGGCGAA CGCAGCTGGC GGCGGCCTGC
GCGGCGGCGG CGTACTGGCT GGCGCACTCC GGCCTGGAGT GGTTCTGGCA GTTCCCGGCG
ATCACGCTCC CGGCCATGCT CTGCCTGGCC GCTCTGGCCG CGCCCTGGAG CCGCGGAGAG
GGGGAGGGCG GGGCCGGAGG CCGGACGGAG CGTTCCCTGC GGCTCGCCGG TGCGGGGCTC
GCCGCCCTGA TGCTCGCCAC CGCGCTCCCG CTGTACGCCG CCGACCGCTA CGCGCAGCAG
AGCGCGGCGG CCGAGAACCC GTGGATGGCC CTCCAGAGGC TGGAGACGGC CCGGCGCCTC
AACCCGGTCA GCGCCGAGCT TCCGCTGCGC GAGGCGGAGC TGCTGGAGCG GGTGGGGGAC
TGGCCGGGGG CAACCGGGGC GTACGCCGAG GCGATACGGC TCTCCCCCGA GCACTACGCC
CCCTACGCGG CGATGGCGGG CTTCTACGAG CGCCACGGCG ACGCGCAAAA CGCCCTCGAG
CTCTACAGAA GGGCGCTGGA GCTTAACCCC CTGGAGCCCG CCCTGCGGGG CAAGGTGCGG
CGGCTCGAGG AGGCTAGTCC CAGTTCGCCC GCTCGCGGAT GA
 
Protein sequence
MSAASPGKPL AEHGRLAASL ALCALALLAP WMGWRYGGYY VGGWAPPALA SAALLLVLAA 
AGALGRMRSS WGAAAAALLT GYAAWTFLSL LWSPDRGEAW QGAGLTLLYL MAFWAALSLL
SSGASRRWAL AACALGTGAV AALTLPRLAP EAEELFVNGR LLGTAGYFNA EAAFLLLPFW
AALCLAGSPR VNPLLRCLAL AAAALCSQLA VLTQSRGAAM ALAASLPVFF LLSGRRMRGA
LALLPVAAAL ILNFPQLNGV YLAGEGAALE EALRRAVPQV WLAALACGLY GLGWALLDAR
WRPPARAARA AGAATLAALL LLAAAGGAAF YAREGSPVRW VQQKAEAFQN GDRSGQEQSR
YLSASGSGRL VMWRVAWEDF ARHPVLGVGT HNYEATYYRL RGERAGYVRQ PHSLPLEALA
ERGVVGGALL AGFLGVCFAA GLRRFGGLNA EGRTQLAAAC AAAAYWLAHS GLEWFWQFPA
ITLPAMLCLA ALAAPWSRGE GEGGAGGRTE RSLRLAGAGL AALMLATALP LYAADRYAQQ
SAAAENPWMA LQRLETARRL NPVSAELPLR EAELLERVGD WPGATGAYAE AIRLSPEHYA
PYAAMAGFYE RHGDAQNALE LYRRALELNP LEPALRGKVR RLEEASPSSP ARG