Gene Rxyl_2188 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_2188 
Symbol 
ID4117422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp2204818 
End bp2206065 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content76% 
IMG OID638036979 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_644944 
Protein GI108805007 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.180724 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGGAC GGCCGGCCGT CCGCCGCTCC GCGCTTCAGG CATGCGCGGG GCTTGCGGCC 
TGCCTGCTCG CGCTGCTCCT GCTGCCGGCG GCGGGCCGCG GCGAGCCGGA GGGGGAGGTC
GACGCGCTCC CCGGCGGGGC GCGCTACGTC GCCGGCGAGC TCCTGGTCGT CTACAGGTCG
GCGCCCGGGC TGGAGCGGGC GCTCGAGGCG ACCGGCGGCA GGGTAAAGGA GGAGCTCCCC
GCCTCCGACG CCCGGCTCGT GGTCTTCCCG GCCGTCCGGG AGAAGCCCTC GGAGGCCCTC
AGGGAGCGGC TGCTCAGGGA GAAGAAGAGG GCCCTGGAGC AGAGCCCGGC CGTGGAGGCG
GTGAGCTTCA ACTACCTGCG CGAGCCCCTC GCGAACCCGA ACGACCGCTA CTTCGGCCGC
CAGTGGGGGC TCCGCAAGAT CCGGGCCCCC CTCGCCTGGA GCAGGGCGCG GGGCGGCGGG
GCGCGCGTCG CCGTGCTCGA CAGCGGCGTG GCCGCCGGCC ACCCCGACCT GCGCGGGAAG
ATCGCCGGCC GCTACAACAC CGACACCCGC ACCAGCTCGG CGGGCGACCA GTACGGGCAC
GGGACCCACG TGGCCGGGAT AGCCGCGGCC TCCACGAACA ACCGGATCGG GGTGGCGGGG
ACCTGCCCGG GGTGCCGGCT GCTGGCGGTC AAGCTGGACG GGGACGGCCT GATCACGACG
ACGGACCTGG TGCGCGGGAT CAACTGGGCA ATCGGCCGCC GCGCGGACGT AATAAACCTC
TCCCTGGGGG GCGGCGGCTT CAGCCGCCCC GAGGCCGACG CGATCGCGAA GGCCTGGAAC
CGGGGCGCGG TGGTCGTAGC GGCCGCGGGC AACGAGCGCT CCAGCAGGCG GACCTACCCT
GCGGCCTACC CGCAGGTCAT CGCCGTCTCG GCCACCACCC GGAGCGACGC CCGGGCCCGG
TACTCCAACT ACGGCGGCTG GGTGGACGTC GCGGCCCCGG GCGGCACCTC CGGCACCGGC
GGGATCTACT CGACCCTCCC CGGCGGCCGC TACGGCTACC TGAGCGGCAC CAGCATGGCC
GCGCCGTTCG TCTCCGGCGT CGCCGGGCTG CTCGCCGGGC AGGATCTCGC GAACAGCCAG
ATCCGGCGCC GCATACAGTC CACCGCCGCG GACCTCGGCC CTCGCGGCCG CGACCCCTAC
TACGGCCACG GCCGGTTGGA CGCCGCCGCC GCGGTGGGAG CCGCCTAG
 
Protein sequence
MRGRPAVRRS ALQACAGLAA CLLALLLLPA AGRGEPEGEV DALPGGARYV AGELLVVYRS 
APGLERALEA TGGRVKEELP ASDARLVVFP AVREKPSEAL RERLLREKKR ALEQSPAVEA
VSFNYLREPL ANPNDRYFGR QWGLRKIRAP LAWSRARGGG ARVAVLDSGV AAGHPDLRGK
IAGRYNTDTR TSSAGDQYGH GTHVAGIAAA STNNRIGVAG TCPGCRLLAV KLDGDGLITT
TDLVRGINWA IGRRADVINL SLGGGGFSRP EADAIAKAWN RGAVVVAAAG NERSSRRTYP
AAYPQVIAVS ATTRSDARAR YSNYGGWVDV AAPGGTSGTG GIYSTLPGGR YGYLSGTSMA
APFVSGVAGL LAGQDLANSQ IRRRIQSTAA DLGPRGRDPY YGHGRLDAAA AVGAA