Gene Rxyl_2973 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_2973 
Symbol 
ID4115695 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp2977814 
End bp2979064 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content68% 
IMG OID638037743 
Productglycosyl transferase, group 1 
Protein accessionYP_645695 
Protein GI108805758 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTGCAGC GGGTCAACCC GGGCCACAAG GCGCTGGCCG ACTACCGCAG CATCATCCGC 
CGCGAGCTCT ACGGGGAGCT GCAGGAGCTC GCCGGGCGCC TGCGGGGGGC GCGGGTGTTG
CACATAAACG CCACCAGCTT CGGGGGCGGG GTGGCGGAGA TCCTCTACAC CCTCGTGCCT
CTGGCCCGCG ACGCCGGGCT CGAGGTGGAG TGGGCCATAA TGTTCGGCGC CGAGCCCTTC
TTCAACGTCA CCAAGAGGTT CCACAACGCC CTGCAGGGCG CCGACTACGA GCTGACAATA
GAGGACCGGG CCATCTACGA GGAGTACAAC CGCAGGACCG CGCAGGCGCT CGCCGAGTCC
GGCGAGGAGT GGGACATAGT CTTCGTCCAC GACCCGCAGC CCGCGCTCGT GCGGGAGTTC
TCCGGGGGGT TGGGGGAGGG GACGCGTTGG ATCTGGCGCT GCCACATCGA CACCTCCACC
CCCAACCGGC AGGTTCTCGA CTACCTGTGG CCGTACATAG CCGACTACGA CGCCCAGGTC
TACACCATGC GCGAGTACAC CCCGCCCGGC GTCGAGATGC CCGGGCTCAC CCTCATCCCC
CCGGCCATAG ACCCGCTCTC GCCCAAGAAC ATGGCCCTCT CGCGGGACGA CGCCAGCTAC
ATCGTCAGCC AGTTCGGGGT CGACGTCGAG CGTCCCTTTC TGCTGCAGGT CTCCCGCTTC
GACCCCTGGA AGGACCCCCT CGGCGTCATC GACGTCTACC GCATGGTCAA GGAGGAGGTG
GGGGAGGTCC AGCTGGTGCT CGTCGGCTCC ATGGCCCACG ACGACCCCGA GGGGTGGGAC
TACTGGTACA AGACCGTCAA CTACGCGGGC GGGGACCCGG ACATCTTCCT CTTCTCCAAC
CTCACCAACG TCGGCGCCAT CGAGGTCAAC GCCTTCCAGT CGCTCGCCGA CGTCGTGATC
CAGAAGTCCA TCCGGGAGGG CTTCGGGCTC GTGGTCTCCG AGGCGCTCTG GAAGGCCCGC
CCGGTGGTGG CCAGCCGCGT CGGGGGCATC CCCATGCAGA TAACCGCCGG CGGCGGCATC
CTGATAGACA CCATCCCGGA GGCGGCCGCG GCCTGCGCCA AGCTCCTCTC CGACCCGGAG
TTCGCCCGCG AGATGGGGCG GCGCGGCAAG GAGCACGTCC GGGCCAACTT CCTCACCCCC
CGCCTGCTGC GCGACGACCT GCGGCTTTTC GCTAAACTTC TCGGCGTGTA G
 
Protein sequence
MLQRVNPGHK ALADYRSIIR RELYGELQEL AGRLRGARVL HINATSFGGG VAEILYTLVP 
LARDAGLEVE WAIMFGAEPF FNVTKRFHNA LQGADYELTI EDRAIYEEYN RRTAQALAES
GEEWDIVFVH DPQPALVREF SGGLGEGTRW IWRCHIDTST PNRQVLDYLW PYIADYDAQV
YTMREYTPPG VEMPGLTLIP PAIDPLSPKN MALSRDDASY IVSQFGVDVE RPFLLQVSRF
DPWKDPLGVI DVYRMVKEEV GEVQLVLVGS MAHDDPEGWD YWYKTVNYAG GDPDIFLFSN
LTNVGAIEVN AFQSLADVVI QKSIREGFGL VVSEALWKAR PVVASRVGGI PMQITAGGGI
LIDTIPEAAA ACAKLLSDPE FAREMGRRGK EHVRANFLTP RLLRDDLRLF AKLLGV