Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_1649 |
Symbol | |
ID | 8332992 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 1870335 |
End bp | 1873079 |
Gene Length | 2745 bp |
Protein Length | 914 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 644954799 |
Product | transglutaminase domain protein |
Protein accession | YP_003112411 |
Protein GI | 256390847 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.987605 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGGCA GACTGCGCAT AGGCATAGCC GCGGCCGTCG CGCTGCTGCT GACCTCCTCG GGGTTGCTGC GACTGCTGGC GCCCGGCTCG TGGGTGTGGC CGATCGTGCT GGCGGTGATC GTGGCGACCG CCTCCGGCGA GCTGTTCCGG CGGCTGGTGC GGCCGCGGCC GCTGGTGGTG CTGGCCCAGG CCGCGGTCGT GTTCTGGTAC GACATGGTGC TGCTGGCGCA CGACGTCGCC TTCGCCGGCG TGCTGCCGAC CTGGGCCGTC TTCGACAAGC TCGGAAACCT GTACTCCACC GGCGCGCACG ACATCCGCGA CACCTTCGTC GGCGGCCAGG CCACCTCCGG CATCGCCGCG ATCCTGGTGC TCTCGCTGAG CGCGCTGGCG GTGCTGGTGG ACGCCCTCGC CGCGACCTAC GCCGCCGCGC CGCTGGCCGG CCTGCCGCTG CTGGCGCTGT ACCTGGTCCC GGCCACCCGC AGCGGCGGCG GCTACGACTG GCTGGCGTTC GCGCTGGCGG CCGGCGGCTA CACCGCGCTG CTCTCGGCGG AAGGCCGCGA CCGCCTCGGC CGCTGGGGAC GGCCGCTGGT CCACGCCGGA CGCCAGCGGA CCGGCGCCGA CGGCAAGCCG GTCGCCGCGC CCAAGGCGCG GGTGGACACC GGGCCGATCG CCGCCACCGG ACACCAGATC ACGATGTACG CGCTGATCGT CGCGGTCCTG GCGCCGGTGC TGCTGCCCAC GGTCGCCGGC GGGCTGTTCG GGATCGGACC GGACAGCGGC AGCGGGACCG GGAACGGCCA CCACAAGGGC AACGGCTCGT TCAGCCAGGT GCTGGACCTC AAGGCGAATC TGGGCTCGCT GACCGACGAC CCGGTCCTCG ACTACAACTC GGCCAAGCCG GAGTACATCC GGCTGCAGAC GCTGGACGTG TTCGACCCGA AGAGCACCAC GCCCTGGACC CCCGACCCGT CGCTGCAGCT GAAGGGGCTG GCAGCCGACG GGACGGTCTC CAGCACCGTG CCCGGGCTCG TGGCACCGAC CCAGCCGCAG ACCGTCCCGA TGCAGATCAC CTACCTGACC GAGGAGAGCA CGAAGGTCAG CCCGACCAGC AATCTCAGCT CGCTGCCGGT GCCCTACCCC GCCAAGCAGG TCACCGGGCT GGCAGCCGGG TTCGAGGTCG ACCCCGACTC GCTGGCGGTC CTGGGCGTCT CCCCGGTGTT CAACACCATG ACCTACCGCG CGACGGTCTA CGACACGACC TCGATCCCGC CGGCCGAGCT GGCCGCCGCG ACCGGGCCGA CCTCCGACGA CAAGACGTTC TTCAACCTGA GCCGGGACCT GGACACCACA GACATCCCGC CGTACATCAA GCAGCTGGCC GAGCAGATCA CCGCGGGCCT GACCAACCCG GTGGACAAGG CCCAGGCGAT CCAGAACTAC TTCCAGAACT CGGCCAACCA CTTCAGGTAC ACGCTGGACG TGCCGAAGAA CCCGACCGGT CTGAGCGCCA TGCAGTGGCT GTTGCAGAAC AAGGCGGGCT ACTGCCAGTT CTACGCCGAG ACCATGGCCG CGATGGCGCG CTCGATCGGC ATCCCGGCGC GGATCGCGGT GGGCTTCACC CCCGGGCAGT CCGTCGGCGG CGACACCTGG GCCGTGAAGA TGCACGACTA CCACTCCTGG CCGGAGCTGT TCATGCACGG CGTCGGCTGG CTGCGGTTCG AGCCGACGGT CGGCATCGAC AACATAGCGA CCACCAGCGG CGGCCACGGC CGGATCCCGA GCTACAGCAC CAACACCACC ACGAGCACCA CCACGGCGCC GTCGCAGAAC CCGACCACGT CCTCCGCGCC CGGCGCCGGG GCGTCCTCGG CCGCCAACTG CCCGGTGAAC ATCCGCAAGG CCGGCGGATG CTCGGCGAAC CTGGAGGACG GCGGCACGGC GAGCGCGCCG AAGTCGAAGT TCAGCTGGCT CGGCTGGTTC GGCTCGGTGC CGCGCTTCCT GCAGTACTGG CTGTTCGGCG GCTCCGGCTA CGCGATCGCG GTGCGCTTCA TCCTGCTGGC GCTGCTGCTC ATCGCCTGCG TCCCGATGGT GGTGCGGATC GTGCGCCGCC GGGCCCGCTG GAAGCTGGCG GCGGGACGGC GCGCGGGCAA GCGCGCCAAG TCCGGCGGCG GGGACGAGGA CGGGCTCGAC TGGGAGTCCG CCGACGTGCC CGCCGCCCGC CGGCCCGAGG AGGACCCGGA GCGGCTGCGG ATCCTCGCGG CCTGGGACGA GGTCCGCGAC AGCGCGACCG ACCTGGGCTA CACCTGGCCG ACCTCGGAGA CCCCGAGGCG CAGCGCCGAG CGGATCATCA AGCAGGCGCA CCTGTCGCGC CCGGCGCAGG ACGCCATGGG CCGGGTCACG GTGCTGGCCG AACGGGCGAA CTACGCGCGG ACGCTGCGGC GCCCGGCGCC CTCGGGCGCG GCACCGACGC CGAACCTGCT GGACGACGTG AAGGAGATCC GCGCCGGGCT GGCCGAGCCG GTCTCGCGGC GTACCCGGAT CCGCGCGACG GTGCTGCCGC CCTCGGCGAT GGCCGCGCTG CGCGAGCGGC GGGAGGACTT CACCGGGCGG GTCTACGAGC GGGTGCAGGG CACCGGCTCG CGGCTGCGGG CTCGGACGCC TGGCGCTGTG GGGCGGTCGC GACCGGAGGG GCGGGACGGG CGAGAAGGGC GGGAAGGTCG GGAGGATCAG CGGCCTCGGC AGTAG
|
Protein sequence | MTGRLRIGIA AAVALLLTSS GLLRLLAPGS WVWPIVLAVI VATASGELFR RLVRPRPLVV LAQAAVVFWY DMVLLAHDVA FAGVLPTWAV FDKLGNLYST GAHDIRDTFV GGQATSGIAA ILVLSLSALA VLVDALAATY AAAPLAGLPL LALYLVPATR SGGGYDWLAF ALAAGGYTAL LSAEGRDRLG RWGRPLVHAG RQRTGADGKP VAAPKARVDT GPIAATGHQI TMYALIVAVL APVLLPTVAG GLFGIGPDSG SGTGNGHHKG NGSFSQVLDL KANLGSLTDD PVLDYNSAKP EYIRLQTLDV FDPKSTTPWT PDPSLQLKGL AADGTVSSTV PGLVAPTQPQ TVPMQITYLT EESTKVSPTS NLSSLPVPYP AKQVTGLAAG FEVDPDSLAV LGVSPVFNTM TYRATVYDTT SIPPAELAAA TGPTSDDKTF FNLSRDLDTT DIPPYIKQLA EQITAGLTNP VDKAQAIQNY FQNSANHFRY TLDVPKNPTG LSAMQWLLQN KAGYCQFYAE TMAAMARSIG IPARIAVGFT PGQSVGGDTW AVKMHDYHSW PELFMHGVGW LRFEPTVGID NIATTSGGHG RIPSYSTNTT TSTTTAPSQN PTTSSAPGAG ASSAANCPVN IRKAGGCSAN LEDGGTASAP KSKFSWLGWF GSVPRFLQYW LFGGSGYAIA VRFILLALLL IACVPMVVRI VRRRARWKLA AGRRAGKRAK SGGGDEDGLD WESADVPAAR RPEEDPERLR ILAAWDEVRD SATDLGYTWP TSETPRRSAE RIIKQAHLSR PAQDAMGRVT VLAERANYAR TLRRPAPSGA APTPNLLDDV KEIRAGLAEP VSRRTRIRAT VLPPSAMAAL RERREDFTGR VYERVQGTGS RLRARTPGAV GRSRPEGRDG REGREGREDQ RPRQ
|
| |