Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rxyl_0403 |
Symbol | |
ID | 4115224 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rubrobacter xylanophilus DSM 9941 |
Kingdom | Bacteria |
Replicon accession | NC_008148 |
Strand | + |
Start bp | 432980 |
End bp | 434635 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 638035192 |
Product | peptidase S1 and S6, chymotrypsin/Hap |
Protein accession | YP_643190 |
Protein GI | 108803253 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
| 


|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000492986 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACCGAT CGTGGATCGC CACCCCTCTT CTGGCCCTCT TTTTGTTCGC GGGGGCGTGC GCCGGGTCGC GGGAGGATGC CCCGGACCCC GGCCGGGGCG GCGACCGCGG GGACCCGGGC GGCCGGGACG CCGCCGTCTC GAGCATAGAG GACGTGCGGA AGGCCACGGT GTACATCGAG GCGCGGGGCG GTGCCTACGA CGAGGGGCGG GGCTTCGGGG AGGTCAGCTA CGGCAGCGGC TCGGGGTTTA TCGTGGGCGA CGGCGGCGGG AGCGGCAAGC TCGTCATCAC CAACAACCAC GTGGTGACCG GGGCGGGCTT TCTGCAGGTG TACCTGGACG GCCAGGACGA GCCGGTCGAC GCCAGGGTGC TCGGCGCCTC CGAGTGCTCG GACCTCGCGC TGCTGGAGCT CGAGGGCGGC GGGTACCCCT ACCTCTCCTG GCGGACCGGC GACATAGACG CCGGCCTCGG CGTGCGCGCC GCCGGCTACC CGGCGGACGA CGTGGAGACC GGCGAGCGGC CAGACTACAC CATAACCAGC GGGAGCATAA ACTCCACCGA GGCCGACGGC GAGACGCCCT GGGCCTCGGT GGACTCTGTG CTGGAGCACG ACGTCCTGAT CCGGCCCGGC AACTCCGGCG GGCCGCTCGT CGACGAGAAC GGGCGGGTGG TGGGGGTCAA CTACGCCTCG CGGGTGGACG ACGAGGGGCG CCCGACCGGC CCGCAGCTGG CCATCGCCCG GGACGAGGCC CGCACCATCG TGGACAAGCT GCGCCAGGGG GACGTGGAGT CCATCGGGGT GAACGGCGAG GCGTTCAGCC TCCCGGAGCA GGAGATCTCC GGCATCCGGG TGACCTCGGT GAAGACCGAC TCCCCGGCGG GCCGGGTGGG GCTGCGCAAC GCCGTTATCG ACCCGCAGAG CGGCGAGTTC GCGGCCTTCG ACGTGATCAC GGAGATCGAA GGCACCCGGC TCGGCGAGGG AGGGACGATG GAGGAGTATT GCAACATCCT CCGCCAGCAC GAGCCGGACG ACAGGCTCAG CATCCAGGCG GTGCGGGTGG AGGAGAACGG CGACGTCTCC CTGATGGAGG GCGCCCTGAA CGGCGAGGAG CTGGCGGTCG TCGAGACCAT CCCGGCGCAG ACCGACGCCG GCGGAGAGCC GCAGGGGGGC TTTGTCTCGC TGACCGACGA TACCGGCACG CTCACCATGG AGGTCCCGGC CGCCTGGAGC GACGTCCGGA CCGGCGGGAG CCTAAAGCTG GACGGCGAGA GCCTGGGGCC GGCCATGCTG GCCTCCACCG ACGCCCAGCG CTGGATCGAC ACCTTCGAGG TGCCTGGCGT GTACTTCGCG GCCTCGAGCC GCCTCGCCGA ACGCTTCCCG GAGAACCCCG TTGAACAGAT CCTGGACCTG CCGGAGTACG ATTTCTCCGG CACCTGCCGG TACGAGGGGC GGGAGGGCTA CCAGGACAGC AAGTTCACCG GCGCCGTAGA CACCTACACC GGCTGCGACG GTACGGACAA CGCCTTCCAG ATCTACGCCG CAACGCCCCC GGACGGCTCC TACGTCGTGG TGCTGCAGGC CGTCATAACC AGCGAGGCCG ACCTCGACGG GCTCCAGAGG ACCCTCGCCA CCTTCGACGT CCTGCAGCAG CCCTGA
|
Protein sequence | MHRSWIATPL LALFLFAGAC AGSREDAPDP GRGGDRGDPG GRDAAVSSIE DVRKATVYIE ARGGAYDEGR GFGEVSYGSG SGFIVGDGGG SGKLVITNNH VVTGAGFLQV YLDGQDEPVD ARVLGASECS DLALLELEGG GYPYLSWRTG DIDAGLGVRA AGYPADDVET GERPDYTITS GSINSTEADG ETPWASVDSV LEHDVLIRPG NSGGPLVDEN GRVVGVNYAS RVDDEGRPTG PQLAIARDEA RTIVDKLRQG DVESIGVNGE AFSLPEQEIS GIRVTSVKTD SPAGRVGLRN AVIDPQSGEF AAFDVITEIE GTRLGEGGTM EEYCNILRQH EPDDRLSIQA VRVEENGDVS LMEGALNGEE LAVVETIPAQ TDAGGEPQGG FVSLTDDTGT LTMEVPAAWS DVRTGGSLKL DGESLGPAML ASTDAQRWID TFEVPGVYFA ASSRLAERFP ENPVEQILDL PEYDFSGTCR YEGREGYQDS KFTGAVDTYT GCDGTDNAFQ IYAATPPDGS YVVVLQAVIT SEADLDGLQR TLATFDVLQQ P
|
| |