Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rxyl_0443 |
Symbol | |
ID | 4116605 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rubrobacter xylanophilus DSM 9941 |
Kingdom | Bacteria |
Replicon accession | NC_008148 |
Strand | - |
Start bp | 474375 |
End bp | 475544 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 638035231 |
Product | peptidase S1 and S6, chymotrypsin/Hap |
Protein accession | YP_643229 |
Protein GI | 108803292 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGGGTTA GAATTGCCCG GATGACGGCG CTCGACTGGT GCATAGTTGC GTTCGTGGCG CTGGCCGTCT TTCGCGGCGC CCGGACCGGC TTTCTCGCGG GCCTCTTCTC GCTGGTGGGG GTGCTGCTGG GGGCCTCGGT CGGCTCCCGG GTGGCCGGGC ACCTCATCCC GGAGGGCGAG AGCCCCTTTC TGGGGGCCGC GATCACGCTG GTGAGCATAG TCTCCTTCGC GATCCTCGGC GAGATGATCG CCCGCTCGGC CGGGGGCTCG CTCCGCAGCA GGCTCCGGGG CGGCGGCTCC TCCCTGCTCG ACAGCGCCGG CGGGGCCGCC CTCGGGCTGG CGCTCTCCCT GCTGCTGGTG TGGGCGGTCG GGATCTTCGC CATCCAGTCC CCCCCGCTCT CCGGGCTGCA CCCGCTGGTG AAGGACTCGC GCATCATCCG CGCGCTCGAC GAGCGGATGC CCGCCGAGCT CCTCACCCAG GCCGTCGCCC AGCTCAACCC GCTCCCCCAG ATGCGCGGCC CCGACGCCGG GGTGGGGGCG CCCGACGGGA GCATCGTCCG CGACCCCGAC GTGCTCGCCG CAAGCTCCCG GATGGTCCGG ATCACGGGCA TCGCCTGCGG CTACGGCATC GAGGGCTCCG GGTGGGTCGC CGCTCCGGAC CTGATCGTCA CCAACGCCCA CGTGGTCGCC GGGGAGACCG TCACCAGCGT CCAGCCCGGG GGGACCGGGC CGCGCCGGAG GGCCGACGTG GTGGTCTTCG ACCCCAAGAA CGACGTGGCC GTCCTGCGGG TGGAGGACCT GGGGCTCACC CCCCTGCCGC TGGACGAGCC GGTCCCCGGA GAGCCCGCGG CGGTCCTCGG CTTCCCCGGC AACGGGCCGC TGGACATCCA GCCCGCCCGC ACCGGGGCCA CGCAGCGCGT GATCTCCAGC GACGCCTACA ACCGCGGCCC GGTGGAGCGC ACGGTCACCA GCTTCCGGGT CTACGTCCGG CCGGGGAACT CCGGGGGGCC GGTGGTGAAC GCCGAGGGCG AGGTGACCGC CACCATCTTC GCCAGCCGGG CCAACTCCCG CAACTCCGGC TACGGGATCC CCTCCCAGAT CATCCGCCGC CACCTGGAGA GGGCCACCCT CCGCGCGGAG CCGGTGGGCA CGGGTCCCTG CGCGAGCTGA
|
Protein sequence | MWVRIARMTA LDWCIVAFVA LAVFRGARTG FLAGLFSLVG VLLGASVGSR VAGHLIPEGE SPFLGAAITL VSIVSFAILG EMIARSAGGS LRSRLRGGGS SLLDSAGGAA LGLALSLLLV WAVGIFAIQS PPLSGLHPLV KDSRIIRALD ERMPAELLTQ AVAQLNPLPQ MRGPDAGVGA PDGSIVRDPD VLAASSRMVR ITGIACGYGI EGSGWVAAPD LIVTNAHVVA GETVTSVQPG GTGPRRRADV VVFDPKNDVA VLRVEDLGLT PLPLDEPVPG EPAAVLGFPG NGPLDIQPAR TGATQRVISS DAYNRGPVER TVTSFRVYVR PGNSGGPVVN AEGEVTATIF ASRANSRNSG YGIPSQIIRR HLERATLRAE PVGTGPCAS
|
| |