Gene Rxyl_3020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_3020 
Symbol 
ID4115956 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp3027711 
End bp3028685 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content73% 
IMG OID638037790 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_645742 
Protein GI108805805 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACATCT TCGAGAACCT CTCTGCGAGC TCGCGGGTAG AGCCCGGCGA GCTCGACCCC 
TACTCCCGCA CCGTCCAGGA GGTGGCCAGA AAGCTCGAGC CCGCCGTCAT AGCGCTCGGG
GTCCCGGGCG GCCGGGGCGG GGGGAGCGGC GTGATCCTGG GCCTGGACGA GGGGGCGGCC
ACCGCCGTGA CCAACAGCCA CGTCGTGCAG GGCCTCTGGC AGCGGGGCGG GACGGGCACG
ATCGCGGTCA TCCAGTCCGG CGGCGGGACG GCGCGCGCCG AGGTGCTGGG CTTCGACCAG
CTGAGCGACC TGGCCGTGAT CCGCTTCTCC CCCGAGGAGG AGCCCGCGGT TGCCGAGCTG
GGCGAGGCGG GCAACCTGGT GGTGGGCCAG CTCGTGGTGG CCATCGGGAG CCCCTTCGGT
TTCCAGAGCA CCGTAACCGC CGGGGTGGTG AGCGCGCTCG GACGCACCCT CATGGGCCAG
GACAGGCGCC TCGTCGAGAA CGTCATCCAG ACCGACGCCG CGGTGAACCC GGGCAACTCC
GGCGGCCCGC TGGCCGACGC GGACGGGCGG GTGGTGGGGA TCAACACGGC GGTCTTCGGG
GGCGCGCAGG GGCTGGGCTT CGCCATCCCC GTCTCGTCCT CCTTCCGGCG GGTGGTCTTC
TCGCTGGTCA CCGAGGGCCG GGTGCGCCGG GCCTACCTGG GGGTGATGGT CCAGAGCCAG
CCCGGCAGGG AGCCCTCGGG CCCGGGAGGC GGCGCCCGGG TGGAGAGCGT CGCCCCCAAC
AGCCCCGCCG AGCGGGCCGG CCTGAGGCCC GGGGACGTGA TCGTGGGCTT CAAGCAACAG
CCCGTGCGCA GCACGGACGA TCTGCTCAGC CTGCTGGACG GCTCGGTGAT CGGACGCGAC
GTCCAGATCC GGGTGCTGCG CCGCGGGAAG GAGACCCCGC TGAGCATCCG GCCCCAGGAG
TACCCGGAGG AGTAG
 
Protein sequence
MDIFENLSAS SRVEPGELDP YSRTVQEVAR KLEPAVIALG VPGGRGGGSG VILGLDEGAA 
TAVTNSHVVQ GLWQRGGTGT IAVIQSGGGT ARAEVLGFDQ LSDLAVIRFS PEEEPAVAEL
GEAGNLVVGQ LVVAIGSPFG FQSTVTAGVV SALGRTLMGQ DRRLVENVIQ TDAAVNPGNS
GGPLADADGR VVGINTAVFG GAQGLGFAIP VSSSFRRVVF SLVTEGRVRR AYLGVMVQSQ
PGREPSGPGG GARVESVAPN SPAERAGLRP GDVIVGFKQQ PVRSTDDLLS LLDGSVIGRD
VQIRVLRRGK ETPLSIRPQE YPEE