Gene Rxyl_1950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_1950 
Symbol 
ID4115742 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp1971745 
End bp1972917 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content76% 
IMG OID638036736 
Producthypothetical protein 
Protein accessionYP_644709 
Protein GI108804772 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5653] Protein involved in cellulose biosynthesis (CelD) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0965898 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGATTC TCGACGAGAG CGGCGCGCGC GCCCTCACGA CCTCCGTGGC GCGCTCCGGG 
CGGGAGCTTG AGGCGCTGGA GGCGGAATGG GAGGCGCTCT ACGCCGCAAG CCCCGCGGCC
ACCCCCTTCC AGTCCTGGGC CTGGCTGTAC TCCTGGTGGG AGGTGTACGG GGAGCGCTAC
GAGCCGTGCG CCATAACCGT GCGCTCCGGC GGGGAGCTCG CCGGGCTCGC GCCGCTCGCC
CGGGAGCGGG GGACGGGGAG GGTGCTGTTC ATGGGCACCG GCCCGAGCGT CTACCTGGAC
GTGCTGGCCC GGGGGGGCGA GGAGGCGGCG GTCGTCCGGG CGGTGGCGGG GCGGCTGCGG
GAGGAGCTGC GGCCGTGGGA GGTGGCCGAC CTGCAGCACC TGCGGCCGCG GGCGGCCGCC
CGGGGACTTC TCGAGGCGTG GCGGGGACCG GCGGCGAGCC TCTGGCAGAC AAACTGCCCC
GTCCTCGAGG CGAGGCCCTT CGAGGAGCTG CTCGGCGCGC TCACCCAGAA GCAGCGGGGC
AACGTTCGCC GCCTCGTCCG GCGCTCCGAG CGGGAGGGGG TGCGGGCCGT GGCCGCCGGG
CCGGAGGAGG CCGCCGACGC CGCCTCCCGG ATGCTCGGGA TGCACCGGAA GGCCTGGCGG
GAGCGCGGCA TAAACCCCGA GCACCTCAGC CCCCGCTTCG AGGTCCTGCT CAGGGCCGCC
GCCGGGCGCC TCACCGCGAG GGGGCTCGGC TTCGTCTCCG AGTTCCGCCG GGGCGAGGAG
GTGGTGGCCT CGCACCTCCT CCTCGTGGGG CACGACCGGG TGGGGGGCTA CCTCAGCGGG
GCCACCGAGG AGGCCTTCCG GCGCTACGCC GTCTACCCGC TCTACGTCCG CGACGGGGTG
GAGGCGGCCC GCTCGCGGGG CCTGGAGGCC TTCGACCTCA TGTGGGGCAG GGGCGAGCAC
AAGCTGCAGT GGGGTCCCGA GATGGTCCCG AGCCGGCGTC TGGTCCTGGG CCGCAACCGG
CTCCCGCTCT GGGCGCCCTA CGCCGGGCAC CACCTGCTCC GCTCCCGGGT CAAGGCCGCC
GTGGACTCGG GCTCGGCCCC CCGGCCGGTG ATGCTGGCCG CCGAGGGCTA CCGGGCCGCG
CGCCGCCTCA TCCGGCGGAG GGCCGCCGGA TGA
 
Protein sequence
MRILDESGAR ALTTSVARSG RELEALEAEW EALYAASPAA TPFQSWAWLY SWWEVYGERY 
EPCAITVRSG GELAGLAPLA RERGTGRVLF MGTGPSVYLD VLARGGEEAA VVRAVAGRLR
EELRPWEVAD LQHLRPRAAA RGLLEAWRGP AASLWQTNCP VLEARPFEEL LGALTQKQRG
NVRRLVRRSE REGVRAVAAG PEEAADAASR MLGMHRKAWR ERGINPEHLS PRFEVLLRAA
AGRLTARGLG FVSEFRRGEE VVASHLLLVG HDRVGGYLSG ATEEAFRRYA VYPLYVRDGV
EAARSRGLEA FDLMWGRGEH KLQWGPEMVP SRRLVLGRNR LPLWAPYAGH HLLRSRVKAA
VDSGSAPRPV MLAAEGYRAA RRLIRRRAAG