Gene Rxyl_0443 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_0443 
Symbol 
ID4116605 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp474375 
End bp475544 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content74% 
IMG OID638035231 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_643229 
Protein GI108803292 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGGGTTA GAATTGCCCG GATGACGGCG CTCGACTGGT GCATAGTTGC GTTCGTGGCG 
CTGGCCGTCT TTCGCGGCGC CCGGACCGGC TTTCTCGCGG GCCTCTTCTC GCTGGTGGGG
GTGCTGCTGG GGGCCTCGGT CGGCTCCCGG GTGGCCGGGC ACCTCATCCC GGAGGGCGAG
AGCCCCTTTC TGGGGGCCGC GATCACGCTG GTGAGCATAG TCTCCTTCGC GATCCTCGGC
GAGATGATCG CCCGCTCGGC CGGGGGCTCG CTCCGCAGCA GGCTCCGGGG CGGCGGCTCC
TCCCTGCTCG ACAGCGCCGG CGGGGCCGCC CTCGGGCTGG CGCTCTCCCT GCTGCTGGTG
TGGGCGGTCG GGATCTTCGC CATCCAGTCC CCCCCGCTCT CCGGGCTGCA CCCGCTGGTG
AAGGACTCGC GCATCATCCG CGCGCTCGAC GAGCGGATGC CCGCCGAGCT CCTCACCCAG
GCCGTCGCCC AGCTCAACCC GCTCCCCCAG ATGCGCGGCC CCGACGCCGG GGTGGGGGCG
CCCGACGGGA GCATCGTCCG CGACCCCGAC GTGCTCGCCG CAAGCTCCCG GATGGTCCGG
ATCACGGGCA TCGCCTGCGG CTACGGCATC GAGGGCTCCG GGTGGGTCGC CGCTCCGGAC
CTGATCGTCA CCAACGCCCA CGTGGTCGCC GGGGAGACCG TCACCAGCGT CCAGCCCGGG
GGGACCGGGC CGCGCCGGAG GGCCGACGTG GTGGTCTTCG ACCCCAAGAA CGACGTGGCC
GTCCTGCGGG TGGAGGACCT GGGGCTCACC CCCCTGCCGC TGGACGAGCC GGTCCCCGGA
GAGCCCGCGG CGGTCCTCGG CTTCCCCGGC AACGGGCCGC TGGACATCCA GCCCGCCCGC
ACCGGGGCCA CGCAGCGCGT GATCTCCAGC GACGCCTACA ACCGCGGCCC GGTGGAGCGC
ACGGTCACCA GCTTCCGGGT CTACGTCCGG CCGGGGAACT CCGGGGGGCC GGTGGTGAAC
GCCGAGGGCG AGGTGACCGC CACCATCTTC GCCAGCCGGG CCAACTCCCG CAACTCCGGC
TACGGGATCC CCTCCCAGAT CATCCGCCGC CACCTGGAGA GGGCCACCCT CCGCGCGGAG
CCGGTGGGCA CGGGTCCCTG CGCGAGCTGA
 
Protein sequence
MWVRIARMTA LDWCIVAFVA LAVFRGARTG FLAGLFSLVG VLLGASVGSR VAGHLIPEGE 
SPFLGAAITL VSIVSFAILG EMIARSAGGS LRSRLRGGGS SLLDSAGGAA LGLALSLLLV
WAVGIFAIQS PPLSGLHPLV KDSRIIRALD ERMPAELLTQ AVAQLNPLPQ MRGPDAGVGA
PDGSIVRDPD VLAASSRMVR ITGIACGYGI EGSGWVAAPD LIVTNAHVVA GETVTSVQPG
GTGPRRRADV VVFDPKNDVA VLRVEDLGLT PLPLDEPVPG EPAAVLGFPG NGPLDIQPAR
TGATQRVISS DAYNRGPVER TVTSFRVYVR PGNSGGPVVN AEGEVTATIF ASRANSRNSG
YGIPSQIIRR HLERATLRAE PVGTGPCAS