Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rxyl_2773 |
Symbol | |
ID | 4117314 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rubrobacter xylanophilus DSM 9941 |
Kingdom | Bacteria |
Replicon accession | NC_008148 |
Strand | - |
Start bp | 2783692 |
End bp | 2785941 |
Gene Length | 2250 bp |
Protein Length | 749 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 638037545 |
Product | transglutaminase-like protein |
Protein accession | YP_645499 |
Protein GI | 108805562 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.663199 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGCGG GGCGCGCTCC CGGGCTCTGG GTTCGCCTCT GGCTGTACGC GGCGGGGCTC GCGACCGGCT CGGCGTTCTG CGTGCTGTTC ACCGGGGAGG TCTCCCCCGA CTACGGGGCG CGCTTCGACC CCGGCTCGCT GGCGGCCCTC TCCGGCTACA GCGCCTTCTG GGTGATGCTC GGGGCGGCCG CGTGCGCCCT GCTCGCGGGC TCCGCGGGGA GGGCGCGCTT CCTGGTCGCG CCGCCGGCGG CCGCCCTGTA CGCCCTCGTC GCCGCCTACG GGCCGCCGCC GCTCTCCCCC GGGGGCTGGG AGGCTCTCCT GCAGGAGGTG CGGACGGACC TCTACGCCGC GGCCGGCACC ATGTACACCC AGCCCGTGCC CTACGACCTC TCGCCGGGCG TTCTGTTCGC CGTCGTCCCC ATGGCGATGG TGGTGGTCTC CTTCGCCGCC TCCGCCACCC TCTACGAGCG CTCGCCGGTC GTCTCGGTGG CGACCCTCGG GCTCACCATA GGGGTGATAA GCACCGTGAG CTTCGAGGAC GGGGTGGGGC CCTTCTTCGC CGCCTTTCTC GCCGGGGCGT TGGGGCTGCT GCTCGCCTCC TCCGGCGCGG GGCGGGGGCT GCTGCCTTTG GGGGCGGCGG CGGGGGCCGC GGTGGCGGCC CTCGTGCTCC TCCTCCCGCG CGCGCCGCTC GCCGAGTACA CCGTGAGCTC CGGCGCCATA GACTGGACCA GGATCGGGGC CGGGGACACC TCCCGGCTCG GGGTGCAGGC CGACGTGGGG GACTACCTCA CCACCGGCCG GGAGGCCGAG CTTCTGCGCA TCCGCTCCGA GGAGCCGCTC TTCTGGCGGG GCGGGACCCT CGACCACTTC GACGGGGTGC GGTGGTCGAG CACCGTGCGG CCCGGCGAGG ACGACGGCTA CGAGGTCGCC CCCGGGGTGC AGACCCGGCT GGTGGTGCAG AGCGTGCAGG TCTTGAACGC CCGCACAGAG CTGGTCTTCG GGGGGTACAG GGTAGTGAAG ACCTCGCTGC CCGACGGCTA CGTGGAGCAG CTCCCCGACG GCTCGTGGGT CGCCTCGGAG CCCTTCGAGG AGGGCGACTA CTACAGGGTG CTCTCCGCCG TGCCCCAGCC CACCGCCGCC CAGCTGGCCA CCGCCGGCAC CGACTACCCG CCGTACGTCC GGGAGAAGTT CCTGCAGCTG CCGGAGGACA CCCCCGGGGT GGTGGGCGAG ACCGCCCGCA GGATACAGCG CCGCTACGGC CCCTCCAACC CCTACGAGGC CGCCCGGGCG GTGGAGCGCT ACCTGCTCTA CGACGGCGGC TTCGTCTACA ACCTCGACGT CAGCTACCGG CGGGCCGACA GGGCCATAGA GGAGTTCCTC GGCGACGGGC GGGAGGGGTT CTGCACCCAG TTCGCCACCT CCATGGCGCT CATCCTGCGC GAGATGGGCA TCCCCACCCG GGTGGTGTAC GGGGCCACCA CCGGCGAGCG GGTCGGGGAG GACGAGTACG TGGTCACCGG GAGCAACATG CACACCTGGG TCGAGGTCTT CTTCCCCGGC GTGGGCTGGT ACCCCTTCGA CCCCACCCCG GGCTTCGGGC TCCCGCAGGC CATGGAGGCC AACGCGCCGC GGGCCCCCGC CGGGCCGGCC GGCAGCCCGA TACCGGAGAA CCCCGCGCTG CGGCCCGGTA ATCTCTCCCG GAGCCCCGAG CCCTCCGAGG CCCCCCTCCC CGGGGAGCCC GCCGGCGGCG AGCGGGCGGG CGCCGCGCAG AAGGAGGCGG GCGGGGAGGG GCTTCCGGTC TGGCCGCTGG CGGTTGCGGG GGCCCTCTCG CTCCTGGCCC TCCCGCCGCT CCTCAAGCGG GCGCTGCTCG CCCGGGGCCG GCCCGCCGGC CTCTACAGGG ACCTCTGTGG GCGCCTGCGC GACGCGCTGC CGCCGGGCCG GGGCGCCCTC GCGGACTCCC CGGCGCTCAC CGTGGAGGAG CGGCTGGGCC TGCTCTCGGG GGCGCTGGGG CTCGACGAGC GACCCTTCCT GGAGTTCGCG CGCGCCTACT CCGAGCACCT GTACGCCGCG GGGGCCTCCC GGCGCCGGCT GCACGGGGCG TACCGGCGGG CCCTGCGGGA GTACGGGCGG CTGCCCGGCT GGCGGCGGGC GTTCGCCGCG CTCAACCCGG CCTCGCTCCT GCTCCGCGCC CGGCGGGGGG CCGGGGCGGG GCGGGCCTCG CTGGCCAAGC GGGTCGGGCG CGCGCGTTGA
|
Protein sequence | MTAGRAPGLW VRLWLYAAGL ATGSAFCVLF TGEVSPDYGA RFDPGSLAAL SGYSAFWVML GAAACALLAG SAGRARFLVA PPAAALYALV AAYGPPPLSP GGWEALLQEV RTDLYAAAGT MYTQPVPYDL SPGVLFAVVP MAMVVVSFAA SATLYERSPV VSVATLGLTI GVISTVSFED GVGPFFAAFL AGALGLLLAS SGAGRGLLPL GAAAGAAVAA LVLLLPRAPL AEYTVSSGAI DWTRIGAGDT SRLGVQADVG DYLTTGREAE LLRIRSEEPL FWRGGTLDHF DGVRWSSTVR PGEDDGYEVA PGVQTRLVVQ SVQVLNARTE LVFGGYRVVK TSLPDGYVEQ LPDGSWVASE PFEEGDYYRV LSAVPQPTAA QLATAGTDYP PYVREKFLQL PEDTPGVVGE TARRIQRRYG PSNPYEAARA VERYLLYDGG FVYNLDVSYR RADRAIEEFL GDGREGFCTQ FATSMALILR EMGIPTRVVY GATTGERVGE DEYVVTGSNM HTWVEVFFPG VGWYPFDPTP GFGLPQAMEA NAPRAPAGPA GSPIPENPAL RPGNLSRSPE PSEAPLPGEP AGGERAGAAQ KEAGGEGLPV WPLAVAGALS LLALPPLLKR ALLARGRPAG LYRDLCGRLR DALPPGRGAL ADSPALTVEE RLGLLSGALG LDERPFLEFA RAYSEHLYAA GASRRRLHGA YRRALREYGR LPGWRRAFAA LNPASLLLRA RRGAGAGRAS LAKRVGRAR
|
| |