Gene Rxyl_0738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_0738 
Symbol 
ID4116564 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp768120 
End bp769181 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content72% 
IMG OID638035522 
Productpeptidase M42 
Protein accessionYP_643519 
Protein GI108803582 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.511886 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCAAAG AGTCCTACGA TTTCCTGAAG AGGCTCCTCT CCGCGCCGGG ACCGAGCGGC 
CGCGAGGAGG CCGCCGCGCG GGTGTGGCGG GAGGAGGCCG GGCGTTTCGC CGACGGGGTG
CGCGGCGACA GGATGGGCAA CTCCTTCGCC ACGCTCAACC CCGGCGGCCG CCCGCGGGTG
ATGCTCAGCG GGCACATAGA CGAGATCGGG CTGATCGTCA CCCACGTGGA CGAGCAGGGG
TTCGTCCGCT TCAAGGGCGT CGGGGGCTGG GACCCGCAGG TGCTGGTGGG CCAGCGGGTG
CGCCTCCGGA CCGGGAGCGG CGAGATCCCC GGCGTCATCG GCAAGAAGGC CATCCACCTC
ATGGAGAGCG AGGAGCGCAA AAAGGCCTCC GAGATAAAGG GCCTGTGGAT AGACATCGGG
GCGAGGGACG CCGAGGAGGC GCGCCGGAGC GTGCGCGTGG GGGATGTGGC TGTCCTCGAC
CAGGAGCCGG TGGAGCTTCC CAACGGGCGC CTCGCCTCCC GCTCGCTGGA CAACCGGATG
GGGGCCTTCG TCGTGCTGGA GGCGCTGCGG CTGCTCTCCG AGGAGGAGGG GCTCTCCGCC
GAGGTGGTGG CGGTCGCCAC CGTGCAGGAG GAGGTCGGCA TCTACGGCGC CCGTGGCGCC
GCCTTCGGGC TGGACCCGGA CGCGGCCATC GCCGTCGACG TCACCCACGC CACCGACACC
CCCGGGGTGC CCAAGAACGA GCACGGGGAC CACCCGCTCG GCAGCGGCCC CGTCATAGCC
CGGGCCTCCG TGCTCAGCCC GCTGGTTACG GACGGCCTCG TCTCCGCCGC CGAGCGCGAG
GGCATCCCCT ACACCCTGGA GGCCGACTCC TCCCGCACCG GCACGGACGC CGACGCCATC
CACCTCTCGC GGGCGGGGAT CGCCACCGGG CTCGTCTCCT GCCCCAACCG CTACATGCAC
TCGCCGAACG AGATGGTGGA GCTGGGAGAT CTGGAGGGGT GCGCCCGGCT CATCGCCTCC
TACGTGCGCT CGCTGGGCCC CGACGCGGAC TTCGTCCGGT AG
 
Protein sequence
MRKESYDFLK RLLSAPGPSG REEAAARVWR EEAGRFADGV RGDRMGNSFA TLNPGGRPRV 
MLSGHIDEIG LIVTHVDEQG FVRFKGVGGW DPQVLVGQRV RLRTGSGEIP GVIGKKAIHL
MESEERKKAS EIKGLWIDIG ARDAEEARRS VRVGDVAVLD QEPVELPNGR LASRSLDNRM
GAFVVLEALR LLSEEEGLSA EVVAVATVQE EVGIYGARGA AFGLDPDAAI AVDVTHATDT
PGVPKNEHGD HPLGSGPVIA RASVLSPLVT DGLVSAAERE GIPYTLEADS SRTGTDADAI
HLSRAGIATG LVSCPNRYMH SPNEMVELGD LEGCARLIAS YVRSLGPDAD FVR