Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rxyl_0812 |
Symbol | |
ID | 4116188 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rubrobacter xylanophilus DSM 9941 |
Kingdom | Bacteria |
Replicon accession | NC_008148 |
Strand | + |
Start bp | 844809 |
End bp | 845798 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 638035596 |
Product | O-sialoglycoprotein endopeptidase |
Protein accession | YP_643592 |
Protein GI | 108803655 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.423303 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCCTGG CCATAGAGAC CTCCTGCGAC GACACCTGCG CGGCGGTGGT GGAGCCGGAC GGCCGCCGCG CCCTCTCCAA CGCCGTCCAC ACCCAGACCG AGCACGCCCG CTACGGCGGC GTGGTGCCAG AGGTCGCCTC CCGGGCGCAC CTCGAGCGCA TGGACGGCGT CGTCCGCAAG GCGCTCTCCG ACGCCGGCGT CTCGCTGGAC CAGATAGACC GGGTGGCGGT AACCGTCCGG CCCGGCCTGA TCGGGGCGCT CCTCGTGGGG GTGGCCGCGG CCAAGGGGGT CGCCTACGCC CGGCGGCTGC CGCTCGTCCC GGTGAACCAC CTGGAGGGGC ACGTGGCCGC CGCCTACCTG GAGGCCCCCG ACCTCGAGCC GCCCTTCGTG GCCCTGGTCG CCTCCGGGGG GCACACGGCC CTGTACGCGG TGGGCGAGGA CCGCGGGATG CGCCTGCTCG GGGAGACCCT CGACGACGCC GCGGGCGAGG CCCTGGACAA GGGGGCGAGG ATGCTGGGGC TGGGCTTTCC GGGCGGCCCC GCCATCTCCA GGGCTGCCGC GGGCGGGGAC CCGGAGCGCT ACGGCTTCCC CGTGGCCCTC AAGGGGAGGG ACAACCTGGA CTTCTCCTTC TCCGGCCTGA AGACCAGCCT CCTCTACAGG ATCCGGGAGC TCGGCCCCGA GCGGGTGCGG CGAGAGCTCC CGCACCTCGC GGCGAGCTAC GAGGCCGCCG TGGTGGAGGC GCTCGCCCGC AAGCTGCTGC GCGCCGCCGA GCTCCGGGAG GCCGGGGCCG TCGTGGTGGC CGGCGGGGTG GCGGCCAACG GGCGGCTGCG CGAGAGGCTC CGGCGCGAGT GCGCCGGGCG GGGGCTCAGG CTCGTGATCC CGCACCCCAG CCTCTGCACG GACAACGCCG CCATGATCGG GGCGGCCGCC GCCCACACCC CGAAGATCCC CTGGCCGGAG TACCTCTCGC TCAACGCCCG CAGCGTCTGA
|
Protein sequence | MILAIETSCD DTCAAVVEPD GRRALSNAVH TQTEHARYGG VVPEVASRAH LERMDGVVRK ALSDAGVSLD QIDRVAVTVR PGLIGALLVG VAAAKGVAYA RRLPLVPVNH LEGHVAAAYL EAPDLEPPFV ALVASGGHTA LYAVGEDRGM RLLGETLDDA AGEALDKGAR MLGLGFPGGP AISRAAAGGD PERYGFPVAL KGRDNLDFSF SGLKTSLLYR IRELGPERVR RELPHLAASY EAAVVEALAR KLLRAAELRE AGAVVVAGGV AANGRLRERL RRECAGRGLR LVIPHPSLCT DNAAMIGAAA AHTPKIPWPE YLSLNARSV
|
| |