Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmwyl1_0988 |
Symbol | |
ID | 5368098 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Marinomonas sp. MWYL1 |
Kingdom | Bacteria |
Replicon accession | NC_009654 |
Strand | - |
Start bp | 1091576 |
End bp | 1094935 |
Gene Length | 3360 bp |
Protein Length | 1119 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640803322 |
Product | transglutaminase domain-containing protein |
Protein accession | YP_001339854 |
Protein GI | 152995019 |
COG category | [E] Amino acid transport and metabolism [S] Function unknown |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases [COG4196] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCATTA GAGTCGCAAT ACAGCACAAG ACTACCTACG AGTTTGATCG TTTTATCAAC GTGGCTCCCC ACGTACTAAG ACTACAACCG GCGGCGCACT CACGCACAAA AATTCATGCT TACTCGCTTA AAGTCTTGCC TGAAACACAT TTTATCAATG TGCAACAAGA TCCGTTTGGC AACTTCCAAA CTCGCTTAGT GTTTCCTGAA AAGACTAATA AGTTAGAGTT TTACGTGGAA GTTATCGCCG ATATGACGGT AATCAACCCA TTCGATTTCT TCGTTGAAAG CTACGCAGAA GAATATCCAT TTGCCTACGA CAAAGCGCTT AAAAAAGAGC TCGAGCCTTA CTTACAAGTA ACCGAATCTT GCCCGTTATT AGATAAATGG TTATCAACAG TTGATCGTAA AAATACGCCG ATTAACGATT TTCTCGTGGC GATCAATAGT CGCTTAGCAG CGGATATTGG TTACGGAATT CGTTTAGAGC CAGGCGTTCA GACCTGTGAA GAAACACTGA CCCTGAAAAA AGGTTCTTGT CGTGACACAT CATGGTTACT GGTTCAAATA CTTCGTAGCC TTGGATTAGC TGCACGCTTT GCGTCAGGTT ATTTAGTACA GCTCACTGCG GACGTAAAAG CTCTCGACGG TCCATCTGGC CCAGAAGAAG ATTTCACCGA CTTACACGCA TGGTGTGAAG TGTTTTTACC AGGTGCAGGC TGGGTTGGCT TAGATCCTAC TTCAGGCCTA TTCGCTGGCG AAGGTCACAT TCCTCTGGCT TGTACACCAG AACCAGCTTC GGCAGCCCCA ATCACAGGCT TTGTTGACGA ATGTGAATGT GAGTTTAGCT ACAGCAACAT TGTTACTCGT ATCCATGAAG ATCCGCGTGT GACCAAACCC TATGCAGAAG GCGAATGGGA CACTATCAAA GCTCTTGGCT TTGCGGTCGA TAAGCAACTT GAAGACGGAG ACGTTCGCCT AACCATGGGT GGAGAACCGA CCTTTGTCTC AATCGACGAC ATGGATTCAG AACAATGGAA TACTGGCGCA TTGGGCGCAG AAAAATTAAA ACTAGCCAAA GATTTGCTAA TTAAAATGAA GCAAGAATTT GGTGCTAACG GCCTACTCCA TTATGGGCAA GGCAAATGGT ATCCGGGCGA AGAAGTACCA CGCTGGGCGC TGGGTTGTTT CTGGCGCACT GACGGCGAAG CCTTGTGGAA TGATCCAAAA TATTTAGCTC GTGTAGACAA AGACTACAAA CACACCATCA AAGACGCCCA AGCGTTTGGC GAATTGTTGT GTGAAAAACT CGCCTTGGAT AAGCAATACC TTCAAAGCAC CTACGAAGAC ACGCTGTATT ATTTATGGAT GGAGCAATCG CTTCCTGCGG ATGCAGATCC AACCAAAGCC GATTTAAAAG ATGATCTAGA ACGCCGTCGT CTAGCAAAAC TGCTTAGCCG TGGTTTAAGC ACCACAACAG GTTTCGTTTT GCCGTTAGAA TTCGATACGG TAAACAATCG TTGGAATAGC TCCCTTTGGC CAATGCGCAG TGACGTTATC ACCTTAATTC CGGGCGACAG CCCAATGGGT TATCGTTTAC CACTAAATTC TTTACCTGCT CAAGCTGAAG AAGATCGCAT CCCTGAGCGT GACCCGTTTG ATCCTCGCCA ACCTTTGGCG AACAAAAAAG ACAACGCGGT ATTAGAAAGC GTTGCCAAGC AGCATTTCGC TAAAAAACCA GCAGCACCAG CCTTAGCGCC AGAAAAAACG TTGAAAAACG TAATTCGCAC TACCTTGTGT ATCGAACCAC GCGATGGCCG CTTGCATATA TTTATGCCGC CAATGACACA TCTTGAGCAC TTTGTAGATG TTCTTGAGCA GCTTGAAGCG GTTGCGAAAA CCCTTGATAA ACCAATCGTT GTCGAAGGTT ATGAGCCTCC AAAAGATCCG CGCTTGCAAA AGTTCCTGAT CACGCCAGAT CCAGGTGTTA TCGAAGTAAA TATTCATCCT GCCGCGAGCT GGAAAGAACT GGTTCACAAT ACCGAAACCT TGTATCACCA AGCTTACCTT TCACGCCTTG GCGCAGAAAA ATTCATGCTA GATGGTCGCC ATACTGGCAC AGGTGGGGGT AATCACGTAA CGCTAGGTGG TCGCACACCA GCAGACAGTC CGCTGTTGCG CCGCCCTGAG TTGTTACAGA GTTTAGTGAC CTTCTGGCAG CATCATCCAG GTTTGTCTTA TCTTTTCTCT GGCATGTTCA TTGGCCCAAC TAGCCAAGCA CCTCGCCCAG ACGAAGGACG AGATGAAGCT CTGTATGAAA TGGAAATTGC TTTCCAAAAT ATGCCTGAAG GCTTTGTTGA AGAACCTTGG TTAGTTGACC GCTTAATGCG TAACTTACTG GTCGATATCA CAGGTAACAC CCATCGTTCT GAGTTCTGCA TAGATAAGCT CTACGCAGCT GGCAGCGCCA GCGGTAGACA AGGTTTATTG GAATTCCGTG GTTTTGAAAT GCCGCCACAT CCGCATATGT CGCTCGTACA AATGCTGCTG TTGCGCTGTC TCGTGGCCCG TTTCTGGAAA GAACCGTACA AAAAGCCTTT AGTTCGTTGG GGCACTAGTT TGCATGACAA ATTCATGCTG CCGCATTTTG TTTGGCAAGA CGTGAAAGAA GTGGTCGAAG ATCTTCAGCG CCATGGTTTC CCATTCAAAC TAGAATGGTT AGCACCATTT GAAGAGTTCC GTTTCCCTCA TTATGGTCGT CAAAAAATCG ATGACATGGA AATCGAGCTT CGTTGGGCAA TTGAGCCTTG GCACGTTCTA GGCGAAGAAA TAACCGGATC GGGTACTGCT CGCTATGTAG ATTCTTCCGT AGAACGTTTG CAGGTAAAAT TATCTGGCTT AACCGATGGA CGTTATGTAC TTAGCTGTAA TGGCCGTCGT GTGCCGATTC GTTCTACCGG TCGCAAAGGC GAATACGTGG GTGCTGTGCG TTACAAAGCC TGGGCACCAC CATCTGCTCT GCACCCAACA CTAGGGACAG ACACGCCACT AATATTTGAT TTAATCGATA CCTGGAATGG CTTATCTGTG GGCGGTTGTA CTTATCATGT GTCTCACCCA GGTGGACGTA CGTATGACAA TGTACCGGTC AATAGCAACG AAGCGGAAGC TCGTCGTGTG AACCGTTTTT GGGATCACGG CTTCACTCAA GGCACGCTAT CTCCACCGCC AGCCTTCAGT GCATTGCGCT CTTTTTACCC AAATGGAGAC GAGCCTCGTG CCATGTCGCC ACCAGCAGAA GAGCCAATGA ATGAATATCC GCATACATTA GATCTAAGGA AGCAGTATAA TGTTCTCTAA
|
Protein sequence | MTIRVAIQHK TTYEFDRFIN VAPHVLRLQP AAHSRTKIHA YSLKVLPETH FINVQQDPFG NFQTRLVFPE KTNKLEFYVE VIADMTVINP FDFFVESYAE EYPFAYDKAL KKELEPYLQV TESCPLLDKW LSTVDRKNTP INDFLVAINS RLAADIGYGI RLEPGVQTCE ETLTLKKGSC RDTSWLLVQI LRSLGLAARF ASGYLVQLTA DVKALDGPSG PEEDFTDLHA WCEVFLPGAG WVGLDPTSGL FAGEGHIPLA CTPEPASAAP ITGFVDECEC EFSYSNIVTR IHEDPRVTKP YAEGEWDTIK ALGFAVDKQL EDGDVRLTMG GEPTFVSIDD MDSEQWNTGA LGAEKLKLAK DLLIKMKQEF GANGLLHYGQ GKWYPGEEVP RWALGCFWRT DGEALWNDPK YLARVDKDYK HTIKDAQAFG ELLCEKLALD KQYLQSTYED TLYYLWMEQS LPADADPTKA DLKDDLERRR LAKLLSRGLS TTTGFVLPLE FDTVNNRWNS SLWPMRSDVI TLIPGDSPMG YRLPLNSLPA QAEEDRIPER DPFDPRQPLA NKKDNAVLES VAKQHFAKKP AAPALAPEKT LKNVIRTTLC IEPRDGRLHI FMPPMTHLEH FVDVLEQLEA VAKTLDKPIV VEGYEPPKDP RLQKFLITPD PGVIEVNIHP AASWKELVHN TETLYHQAYL SRLGAEKFML DGRHTGTGGG NHVTLGGRTP ADSPLLRRPE LLQSLVTFWQ HHPGLSYLFS GMFIGPTSQA PRPDEGRDEA LYEMEIAFQN MPEGFVEEPW LVDRLMRNLL VDITGNTHRS EFCIDKLYAA GSASGRQGLL EFRGFEMPPH PHMSLVQMLL LRCLVARFWK EPYKKPLVRW GTSLHDKFML PHFVWQDVKE VVEDLQRHGF PFKLEWLAPF EEFRFPHYGR QKIDDMEIEL RWAIEPWHVL GEEITGSGTA RYVDSSVERL QVKLSGLTDG RYVLSCNGRR VPIRSTGRKG EYVGAVRYKA WAPPSALHPT LGTDTPLIFD LIDTWNGLSV GGCTYHVSHP GGRTYDNVPV NSNEAEARRV NRFWDHGFTQ GTLSPPPAFS ALRSFYPNGD EPRAMSPPAE EPMNEYPHTL DLRKQYNVL
|
| |