Gene Rxyl_0403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_0403 
Symbol 
ID4115224 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp432980 
End bp434635 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content71% 
IMG OID638035192 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_643190 
Protein GI108803253 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 



Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000492986 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACCGAT CGTGGATCGC CACCCCTCTT CTGGCCCTCT TTTTGTTCGC GGGGGCGTGC 
GCCGGGTCGC GGGAGGATGC CCCGGACCCC GGCCGGGGCG GCGACCGCGG GGACCCGGGC
GGCCGGGACG CCGCCGTCTC GAGCATAGAG GACGTGCGGA AGGCCACGGT GTACATCGAG
GCGCGGGGCG GTGCCTACGA CGAGGGGCGG GGCTTCGGGG AGGTCAGCTA CGGCAGCGGC
TCGGGGTTTA TCGTGGGCGA CGGCGGCGGG AGCGGCAAGC TCGTCATCAC CAACAACCAC
GTGGTGACCG GGGCGGGCTT TCTGCAGGTG TACCTGGACG GCCAGGACGA GCCGGTCGAC
GCCAGGGTGC TCGGCGCCTC CGAGTGCTCG GACCTCGCGC TGCTGGAGCT CGAGGGCGGC
GGGTACCCCT ACCTCTCCTG GCGGACCGGC GACATAGACG CCGGCCTCGG CGTGCGCGCC
GCCGGCTACC CGGCGGACGA CGTGGAGACC GGCGAGCGGC CAGACTACAC CATAACCAGC
GGGAGCATAA ACTCCACCGA GGCCGACGGC GAGACGCCCT GGGCCTCGGT GGACTCTGTG
CTGGAGCACG ACGTCCTGAT CCGGCCCGGC AACTCCGGCG GGCCGCTCGT CGACGAGAAC
GGGCGGGTGG TGGGGGTCAA CTACGCCTCG CGGGTGGACG ACGAGGGGCG CCCGACCGGC
CCGCAGCTGG CCATCGCCCG GGACGAGGCC CGCACCATCG TGGACAAGCT GCGCCAGGGG
GACGTGGAGT CCATCGGGGT GAACGGCGAG GCGTTCAGCC TCCCGGAGCA GGAGATCTCC
GGCATCCGGG TGACCTCGGT GAAGACCGAC TCCCCGGCGG GCCGGGTGGG GCTGCGCAAC
GCCGTTATCG ACCCGCAGAG CGGCGAGTTC GCGGCCTTCG ACGTGATCAC GGAGATCGAA
GGCACCCGGC TCGGCGAGGG AGGGACGATG GAGGAGTATT GCAACATCCT CCGCCAGCAC
GAGCCGGACG ACAGGCTCAG CATCCAGGCG GTGCGGGTGG AGGAGAACGG CGACGTCTCC
CTGATGGAGG GCGCCCTGAA CGGCGAGGAG CTGGCGGTCG TCGAGACCAT CCCGGCGCAG
ACCGACGCCG GCGGAGAGCC GCAGGGGGGC TTTGTCTCGC TGACCGACGA TACCGGCACG
CTCACCATGG AGGTCCCGGC CGCCTGGAGC GACGTCCGGA CCGGCGGGAG CCTAAAGCTG
GACGGCGAGA GCCTGGGGCC GGCCATGCTG GCCTCCACCG ACGCCCAGCG CTGGATCGAC
ACCTTCGAGG TGCCTGGCGT GTACTTCGCG GCCTCGAGCC GCCTCGCCGA ACGCTTCCCG
GAGAACCCCG TTGAACAGAT CCTGGACCTG CCGGAGTACG ATTTCTCCGG CACCTGCCGG
TACGAGGGGC GGGAGGGCTA CCAGGACAGC AAGTTCACCG GCGCCGTAGA CACCTACACC
GGCTGCGACG GTACGGACAA CGCCTTCCAG ATCTACGCCG CAACGCCCCC GGACGGCTCC
TACGTCGTGG TGCTGCAGGC CGTCATAACC AGCGAGGCCG ACCTCGACGG GCTCCAGAGG
ACCCTCGCCA CCTTCGACGT CCTGCAGCAG CCCTGA
 
Protein sequence
MHRSWIATPL LALFLFAGAC AGSREDAPDP GRGGDRGDPG GRDAAVSSIE DVRKATVYIE 
ARGGAYDEGR GFGEVSYGSG SGFIVGDGGG SGKLVITNNH VVTGAGFLQV YLDGQDEPVD
ARVLGASECS DLALLELEGG GYPYLSWRTG DIDAGLGVRA AGYPADDVET GERPDYTITS
GSINSTEADG ETPWASVDSV LEHDVLIRPG NSGGPLVDEN GRVVGVNYAS RVDDEGRPTG
PQLAIARDEA RTIVDKLRQG DVESIGVNGE AFSLPEQEIS GIRVTSVKTD SPAGRVGLRN
AVIDPQSGEF AAFDVITEIE GTRLGEGGTM EEYCNILRQH EPDDRLSIQA VRVEENGDVS
LMEGALNGEE LAVVETIPAQ TDAGGEPQGG FVSLTDDTGT LTMEVPAAWS DVRTGGSLKL
DGESLGPAML ASTDAQRWID TFEVPGVYFA ASSRLAERFP ENPVEQILDL PEYDFSGTCR
YEGREGYQDS KFTGAVDTYT GCDGTDNAFQ IYAATPPDGS YVVVLQAVIT SEADLDGLQR
TLATFDVLQQ P