Gene Mmwyl1_0988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmwyl1_0988 
Symbol 
ID5368098 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMarinomonas sp. MWYL1 
KingdomBacteria 
Replicon accessionNC_009654 
Strand
Start bp1091576 
End bp1094935 
Gene Length3360 bp 
Protein Length1119 aa 
Translation table11 
GC content47% 
IMG OID640803322 
Producttransglutaminase domain-containing protein 
Protein accessionYP_001339854 
Protein GI152995019 
COG category[E] Amino acid transport and metabolism
[S] Function unknown 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases
[COG4196] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATTA GAGTCGCAAT ACAGCACAAG ACTACCTACG AGTTTGATCG TTTTATCAAC 
GTGGCTCCCC ACGTACTAAG ACTACAACCG GCGGCGCACT CACGCACAAA AATTCATGCT
TACTCGCTTA AAGTCTTGCC TGAAACACAT TTTATCAATG TGCAACAAGA TCCGTTTGGC
AACTTCCAAA CTCGCTTAGT GTTTCCTGAA AAGACTAATA AGTTAGAGTT TTACGTGGAA
GTTATCGCCG ATATGACGGT AATCAACCCA TTCGATTTCT TCGTTGAAAG CTACGCAGAA
GAATATCCAT TTGCCTACGA CAAAGCGCTT AAAAAAGAGC TCGAGCCTTA CTTACAAGTA
ACCGAATCTT GCCCGTTATT AGATAAATGG TTATCAACAG TTGATCGTAA AAATACGCCG
ATTAACGATT TTCTCGTGGC GATCAATAGT CGCTTAGCAG CGGATATTGG TTACGGAATT
CGTTTAGAGC CAGGCGTTCA GACCTGTGAA GAAACACTGA CCCTGAAAAA AGGTTCTTGT
CGTGACACAT CATGGTTACT GGTTCAAATA CTTCGTAGCC TTGGATTAGC TGCACGCTTT
GCGTCAGGTT ATTTAGTACA GCTCACTGCG GACGTAAAAG CTCTCGACGG TCCATCTGGC
CCAGAAGAAG ATTTCACCGA CTTACACGCA TGGTGTGAAG TGTTTTTACC AGGTGCAGGC
TGGGTTGGCT TAGATCCTAC TTCAGGCCTA TTCGCTGGCG AAGGTCACAT TCCTCTGGCT
TGTACACCAG AACCAGCTTC GGCAGCCCCA ATCACAGGCT TTGTTGACGA ATGTGAATGT
GAGTTTAGCT ACAGCAACAT TGTTACTCGT ATCCATGAAG ATCCGCGTGT GACCAAACCC
TATGCAGAAG GCGAATGGGA CACTATCAAA GCTCTTGGCT TTGCGGTCGA TAAGCAACTT
GAAGACGGAG ACGTTCGCCT AACCATGGGT GGAGAACCGA CCTTTGTCTC AATCGACGAC
ATGGATTCAG AACAATGGAA TACTGGCGCA TTGGGCGCAG AAAAATTAAA ACTAGCCAAA
GATTTGCTAA TTAAAATGAA GCAAGAATTT GGTGCTAACG GCCTACTCCA TTATGGGCAA
GGCAAATGGT ATCCGGGCGA AGAAGTACCA CGCTGGGCGC TGGGTTGTTT CTGGCGCACT
GACGGCGAAG CCTTGTGGAA TGATCCAAAA TATTTAGCTC GTGTAGACAA AGACTACAAA
CACACCATCA AAGACGCCCA AGCGTTTGGC GAATTGTTGT GTGAAAAACT CGCCTTGGAT
AAGCAATACC TTCAAAGCAC CTACGAAGAC ACGCTGTATT ATTTATGGAT GGAGCAATCG
CTTCCTGCGG ATGCAGATCC AACCAAAGCC GATTTAAAAG ATGATCTAGA ACGCCGTCGT
CTAGCAAAAC TGCTTAGCCG TGGTTTAAGC ACCACAACAG GTTTCGTTTT GCCGTTAGAA
TTCGATACGG TAAACAATCG TTGGAATAGC TCCCTTTGGC CAATGCGCAG TGACGTTATC
ACCTTAATTC CGGGCGACAG CCCAATGGGT TATCGTTTAC CACTAAATTC TTTACCTGCT
CAAGCTGAAG AAGATCGCAT CCCTGAGCGT GACCCGTTTG ATCCTCGCCA ACCTTTGGCG
AACAAAAAAG ACAACGCGGT ATTAGAAAGC GTTGCCAAGC AGCATTTCGC TAAAAAACCA
GCAGCACCAG CCTTAGCGCC AGAAAAAACG TTGAAAAACG TAATTCGCAC TACCTTGTGT
ATCGAACCAC GCGATGGCCG CTTGCATATA TTTATGCCGC CAATGACACA TCTTGAGCAC
TTTGTAGATG TTCTTGAGCA GCTTGAAGCG GTTGCGAAAA CCCTTGATAA ACCAATCGTT
GTCGAAGGTT ATGAGCCTCC AAAAGATCCG CGCTTGCAAA AGTTCCTGAT CACGCCAGAT
CCAGGTGTTA TCGAAGTAAA TATTCATCCT GCCGCGAGCT GGAAAGAACT GGTTCACAAT
ACCGAAACCT TGTATCACCA AGCTTACCTT TCACGCCTTG GCGCAGAAAA ATTCATGCTA
GATGGTCGCC ATACTGGCAC AGGTGGGGGT AATCACGTAA CGCTAGGTGG TCGCACACCA
GCAGACAGTC CGCTGTTGCG CCGCCCTGAG TTGTTACAGA GTTTAGTGAC CTTCTGGCAG
CATCATCCAG GTTTGTCTTA TCTTTTCTCT GGCATGTTCA TTGGCCCAAC TAGCCAAGCA
CCTCGCCCAG ACGAAGGACG AGATGAAGCT CTGTATGAAA TGGAAATTGC TTTCCAAAAT
ATGCCTGAAG GCTTTGTTGA AGAACCTTGG TTAGTTGACC GCTTAATGCG TAACTTACTG
GTCGATATCA CAGGTAACAC CCATCGTTCT GAGTTCTGCA TAGATAAGCT CTACGCAGCT
GGCAGCGCCA GCGGTAGACA AGGTTTATTG GAATTCCGTG GTTTTGAAAT GCCGCCACAT
CCGCATATGT CGCTCGTACA AATGCTGCTG TTGCGCTGTC TCGTGGCCCG TTTCTGGAAA
GAACCGTACA AAAAGCCTTT AGTTCGTTGG GGCACTAGTT TGCATGACAA ATTCATGCTG
CCGCATTTTG TTTGGCAAGA CGTGAAAGAA GTGGTCGAAG ATCTTCAGCG CCATGGTTTC
CCATTCAAAC TAGAATGGTT AGCACCATTT GAAGAGTTCC GTTTCCCTCA TTATGGTCGT
CAAAAAATCG ATGACATGGA AATCGAGCTT CGTTGGGCAA TTGAGCCTTG GCACGTTCTA
GGCGAAGAAA TAACCGGATC GGGTACTGCT CGCTATGTAG ATTCTTCCGT AGAACGTTTG
CAGGTAAAAT TATCTGGCTT AACCGATGGA CGTTATGTAC TTAGCTGTAA TGGCCGTCGT
GTGCCGATTC GTTCTACCGG TCGCAAAGGC GAATACGTGG GTGCTGTGCG TTACAAAGCC
TGGGCACCAC CATCTGCTCT GCACCCAACA CTAGGGACAG ACACGCCACT AATATTTGAT
TTAATCGATA CCTGGAATGG CTTATCTGTG GGCGGTTGTA CTTATCATGT GTCTCACCCA
GGTGGACGTA CGTATGACAA TGTACCGGTC AATAGCAACG AAGCGGAAGC TCGTCGTGTG
AACCGTTTTT GGGATCACGG CTTCACTCAA GGCACGCTAT CTCCACCGCC AGCCTTCAGT
GCATTGCGCT CTTTTTACCC AAATGGAGAC GAGCCTCGTG CCATGTCGCC ACCAGCAGAA
GAGCCAATGA ATGAATATCC GCATACATTA GATCTAAGGA AGCAGTATAA TGTTCTCTAA
 
Protein sequence
MTIRVAIQHK TTYEFDRFIN VAPHVLRLQP AAHSRTKIHA YSLKVLPETH FINVQQDPFG 
NFQTRLVFPE KTNKLEFYVE VIADMTVINP FDFFVESYAE EYPFAYDKAL KKELEPYLQV
TESCPLLDKW LSTVDRKNTP INDFLVAINS RLAADIGYGI RLEPGVQTCE ETLTLKKGSC
RDTSWLLVQI LRSLGLAARF ASGYLVQLTA DVKALDGPSG PEEDFTDLHA WCEVFLPGAG
WVGLDPTSGL FAGEGHIPLA CTPEPASAAP ITGFVDECEC EFSYSNIVTR IHEDPRVTKP
YAEGEWDTIK ALGFAVDKQL EDGDVRLTMG GEPTFVSIDD MDSEQWNTGA LGAEKLKLAK
DLLIKMKQEF GANGLLHYGQ GKWYPGEEVP RWALGCFWRT DGEALWNDPK YLARVDKDYK
HTIKDAQAFG ELLCEKLALD KQYLQSTYED TLYYLWMEQS LPADADPTKA DLKDDLERRR
LAKLLSRGLS TTTGFVLPLE FDTVNNRWNS SLWPMRSDVI TLIPGDSPMG YRLPLNSLPA
QAEEDRIPER DPFDPRQPLA NKKDNAVLES VAKQHFAKKP AAPALAPEKT LKNVIRTTLC
IEPRDGRLHI FMPPMTHLEH FVDVLEQLEA VAKTLDKPIV VEGYEPPKDP RLQKFLITPD
PGVIEVNIHP AASWKELVHN TETLYHQAYL SRLGAEKFML DGRHTGTGGG NHVTLGGRTP
ADSPLLRRPE LLQSLVTFWQ HHPGLSYLFS GMFIGPTSQA PRPDEGRDEA LYEMEIAFQN
MPEGFVEEPW LVDRLMRNLL VDITGNTHRS EFCIDKLYAA GSASGRQGLL EFRGFEMPPH
PHMSLVQMLL LRCLVARFWK EPYKKPLVRW GTSLHDKFML PHFVWQDVKE VVEDLQRHGF
PFKLEWLAPF EEFRFPHYGR QKIDDMEIEL RWAIEPWHVL GEEITGSGTA RYVDSSVERL
QVKLSGLTDG RYVLSCNGRR VPIRSTGRKG EYVGAVRYKA WAPPSALHPT LGTDTPLIFD
LIDTWNGLSV GGCTYHVSHP GGRTYDNVPV NSNEAEARRV NRFWDHGFTQ GTLSPPPAFS
ALRSFYPNGD EPRAMSPPAE EPMNEYPHTL DLRKQYNVL