Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shel_04340 |
Symbol | |
ID | 8394326 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Slackia heliotrinireducens DSM 20476 |
Kingdom | Bacteria |
Replicon accession | NC_013165 |
Strand | + |
Start bp | 518577 |
End bp | 519755 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644985198 |
Product | transglutaminase-like enzyme, predicted cysteine protease |
Protein accession | YP_003142844 |
Protein GI | 257063172 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00550212 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAGAAGG CAACGACCTT CGTCGTGGTC CTCTCGCTGG CAGCCATGGG CGCTCTGCTC GCTGCCTGCA GCAGCCAGAC CGAAGAGCCT GCCGCTGAGG AGCCCGCAGC TGAAGAAGCT GCCACCGAAG AGGCTGCTAC TGAGGAACCG GCCGCTGAGG CTACGACCGA GCAGGCCGCA CCTGAGGCTA CCGCCGAGGA AGAAGAGGCC CCTGCTGAGG CCGAGGTTGT CTCCGGCACC GTGACCACCA CCGTTGACCT GACCGCCTAT GACAAGGGCC AGGTCGTCCG CGTGTGGGTC CCCGTCGCCA CCGACAATGA CTACCAGGTC ATCACCGACG ACGAAGTTGA CGGCGGCGCC AACGCCACCA CGGCCGAGAT CGTCGAGTCC GCTGACGGCA ACAAGATGGC TTACATCGAA TGGGATGAAA ACGTTGAGCC TGCTGACCGC ATCGCCACCG TTTCCTTCCA TGCCGAGCGC ACCGAAGCTC TGCGTCCTGA AATCGTCGAA GAGGGCGAAG TCCCCGAAGA CATCGCTGCC AACTACCTGG GCAGCTCCTC CATGGTCAAG GTCGACGATC CTGAGGTCGT CGCTCTGGCC GAGGAAATCA CCGCTGGCAA GGAGACCTAC GTCGACAAGG CCCGCGCGGT CTACGACTGG GTTTATGAGA ACATGAACCG TGACAACAAC GTCACCGGCT GCGGCGACGG CGACGTCTGC CGTCTGGTCT CTGACGGCGT CCGTGCCGGC AAGTTCACCG ACATCAACTC CGTGTTCGTG GCCCTGTGCC GCGCCTCCGG CGTGCCTGCC CGCGAAATGT TCGGCATCCG CATGAACGAT GCTGACATCA CCAAGAACCA GCACTGCTGG TGCGAGTTCT ACGTTCCCGG CACCGGCTGG GTTCCGGCTG ACCCTGCAGA CGTCCTGAAG GCCGTTCTCA CCGACGAGCT GGAGAAGGAT TCCCAGGAAG CCCTGGACAA GAAGGAATAC TACTGGGGTC AGTTCGACGC CAAGCGCGTT GAGTACAGCC ACGGCCGCGA CGTCGTGCTT GAGCCTGCTC AGGCCGGCGA TCCGCTGAAC GACTTCGGCT ATCCTTACGC TGAGGTTGAC GGTGAGGCTC TGGACTTCTA CAGCCCCGAC ACCTTCGTGT ACTCCGTCGC CTTCTCGGCT GACGAATAG
|
Protein sequence | MQKATTFVVV LSLAAMGALL AACSSQTEEP AAEEPAAEEA ATEEAATEEP AAEATTEQAA PEATAEEEEA PAEAEVVSGT VTTTVDLTAY DKGQVVRVWV PVATDNDYQV ITDDEVDGGA NATTAEIVES ADGNKMAYIE WDENVEPADR IATVSFHAER TEALRPEIVE EGEVPEDIAA NYLGSSSMVK VDDPEVVALA EEITAGKETY VDKARAVYDW VYENMNRDNN VTGCGDGDVC RLVSDGVRAG KFTDINSVFV ALCRASGVPA REMFGIRMND ADITKNQHCW CEFYVPGTGW VPADPADVLK AVLTDELEKD SQEALDKKEY YWGQFDAKRV EYSHGRDVVL EPAQAGDPLN DFGYPYAEVD GEALDFYSPD TFVYSVAFSA DE
|
| |