Gene Mmwyl1_0214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmwyl1_0214 
Symbol 
ID5367242 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMarinomonas sp. MWYL1 
KingdomBacteria 
Replicon accessionNC_009654 
Strand
Start bp248472 
End bp250874 
Gene Length2403 bp 
Protein Length800 aa 
Translation table11 
GC content45% 
IMG OID640802556 
Productglycoside hydrolase family 3 protein 
Protein accessionYP_001339091 
Protein GI152994256 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGACA CAAGAATCAA ACAACTCTTA GCCGAACTCA CCTTAGAAGA AAAAGCCATG 
CTCTGTACTG GCAAGAACTT TTGGAATCTA CATGGTATTG AACGCTTAGA CTTACCTTCC
ATCATGGTGA CTGATGGGCC CCATGGTTTA CGTAAACAAG CAGGCGAAGG CGATCATGTC
GGCCTAAACG CAGCGGTAAA AGCCACTTGT TTCCCAACAG CATCAGGGCT TGCGGCATCT
TGGAATACCA AGCTTATTGA AGAAATGGGG ATAGCGCTTG GCAAAGAATG CCGCGCCGAA
GGTGTGTCTG TCTTGCTTGG ACCTGGCACC AATATTAAAC GTAACCCTTT AGGTGGTCGT
AACTTCGAAT ACTTTTCCGA AGATCCATTA CTTGCAGGCG ATATGTCGGC CGCGTGGATC
AAAGGCGTAC AAAGCCAAGG CGTGGGCACC AGCTTAAAAC ATTTCGCGGT GAACAATCAC
GAACACTGCC GTATGACAGT CGATGCTATC GTCGACCAAA GAACTCTTCG TGAGATTTAC
CTACCCGCTT TTGAAAGAGC GGTTACACAG ACACAACCTT GGACCATCAT GTGCTCCTAC
AACAAGGTAA ATGGCACCTA TGCGGCAGAA CACAAACAAC TGCTTGATGA CATTCTTGTT
AACGAATGGG GATTTGAAGG CATTGTCGTA ACCGATTGGG GCGCAAATAA TGACCGTGTA
GAAGGCGTTA AAAACGGTCA GCATCTTGAG ATGCCAAGCT CTGGTGAGAT GAATACTAAG
AAGATCATTA CTGCTGTGGA GAACGGCCAG CTAACGATAG AAGCTTTAGA TAAATCCGTC
GCTCGAGTAT TGGAACTGAT TCTGAAATCA CAAGCTTCTC TAGAAGCCAA TACAGTTCAA
GCAGACTTAC AAGCTCACCA TGAACTCGCA GCAAGAATCG CAGAAGAAAC CTGTGTACTA
TTGAAAAATG ATGGCTTATT ACCAGTTCCC GTCGGAAAGA AAATCGCCGT TATTGGCGCT
TTAGCCAACA ACACTCGTTA CCAAGGCGCG GGCAGCTCTA AAATTAATCC ATTCAAATTG
GAACAGCCTC TCGACGAAAT TAAAAAACAA TTTGGTAATG ATAACGTCAG TTATACTGCG
GGTTATCACC TTAATGATGC GGAAGATAGT ACCGAGATAG CCAAGGCGAT TGAGTTGGCC
AAACAAGCAG ATGTAGTCTT TTTGTTTGCC GGGCTAACGC CAAAATATGA ATCCGAAGGC
TTTGACCGCC AGCACCTGAA TCTACCACAA GTCCAGCTGG ACCTTATAGC GGCTCTTGGC
GAACAGCTCA ATAAAACCGT GGTTGTTTTA CAGAATGGCG CCCCCGTATT ATTGCCTTTC
GTAGATAAAA CCGCCGCCAT ACTTGAAGCC TATTTAGGCG GCCAAGCAGG CGCGTCAGCG
ATTGCAAAAA TCTTGTCTGG TGAGGTCAAC CCAAGTGGTA AATTGGCCGA AACCTTCCCT
GCCAATTTAG AAGATGTGCC AAGCCAACCC TATTTCCCAG GTACCACTAA GCAAGTTCAA
TACCGCGAGG CGATTTGGGT AGGTTATCGT TATTTCAATA CCACTAAAAC GAAAGCGCTT
TTCCCATTCG GTCATGGTTT GTCCTATACC AATTTCACTT ACTATCATTT GAAACTGCTC
AGCGGTGATA AAACGCAATT TAATGAAAGC GATTCCATCA AGCTGCAAAT TACCATTAGC
AACATGGGTG ATAAGGCTGG TGCAGAAGTC GTGCAGTGTT ATGTGGGACA AAAATCTCCG
TCACAACCTC GCCCAGCAAA AGAGCTCAAA GCCTTTGAGA AAGTCTTTTT AGAGCCGGGT
GAGAGCAAAG AAGTTATGTT CGAACTGGAT TATCGAGCCT TCGCTTATTG GCACAAAGAA
AAAGCGGCTT GGATTGCCGA AAGTGGTGAC TATCAAATTC ATATTGGCTC TTCTGTTAAT
GACATTCGAG ATACCGCGAC AATCGCCTTA GAGACAGGCA TCGTAGCCGA GCAACCAAAT
TGCGCTTTGG CTACGTATTT TGATCCCGCT AAACACGACT TCAACGACGC AGCTTTTAGT
GCTTTATTGG GTTACGACAT TCCAACACCA ACACCCATTA CGCCTTACAC AGTGAATTCG
ACCTTAGGCG ACATTGCCAA CGAGCCATTA GGAAAGCCTC TTTTCGATGA CATGCTATCT
GTGTTTACAC AAATGATGGG GGGAGATCAA AACAACTCAG CAGCCGAAGC AGATCGTCTG
ATGGCGGAAT CTATGGTGGC CGATATGCCA TTACGTAATT TACCTGTATT CAACGGTCAG
CAATATAGCG AAGAAGACAT TTTGATATTA ATCCATTCAA TGAATAGTCG CTCAATGATG
TAA
 
Protein sequence
MSDTRIKQLL AELTLEEKAM LCTGKNFWNL HGIERLDLPS IMVTDGPHGL RKQAGEGDHV 
GLNAAVKATC FPTASGLAAS WNTKLIEEMG IALGKECRAE GVSVLLGPGT NIKRNPLGGR
NFEYFSEDPL LAGDMSAAWI KGVQSQGVGT SLKHFAVNNH EHCRMTVDAI VDQRTLREIY
LPAFERAVTQ TQPWTIMCSY NKVNGTYAAE HKQLLDDILV NEWGFEGIVV TDWGANNDRV
EGVKNGQHLE MPSSGEMNTK KIITAVENGQ LTIEALDKSV ARVLELILKS QASLEANTVQ
ADLQAHHELA ARIAEETCVL LKNDGLLPVP VGKKIAVIGA LANNTRYQGA GSSKINPFKL
EQPLDEIKKQ FGNDNVSYTA GYHLNDAEDS TEIAKAIELA KQADVVFLFA GLTPKYESEG
FDRQHLNLPQ VQLDLIAALG EQLNKTVVVL QNGAPVLLPF VDKTAAILEA YLGGQAGASA
IAKILSGEVN PSGKLAETFP ANLEDVPSQP YFPGTTKQVQ YREAIWVGYR YFNTTKTKAL
FPFGHGLSYT NFTYYHLKLL SGDKTQFNES DSIKLQITIS NMGDKAGAEV VQCYVGQKSP
SQPRPAKELK AFEKVFLEPG ESKEVMFELD YRAFAYWHKE KAAWIAESGD YQIHIGSSVN
DIRDTATIAL ETGIVAEQPN CALATYFDPA KHDFNDAAFS ALLGYDIPTP TPITPYTVNS
TLGDIANEPL GKPLFDDMLS VFTQMMGGDQ NNSAAEADRL MAESMVADMP LRNLPVFNGQ
QYSEEDILIL IHSMNSRSMM