Gene Mmwyl1_4404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmwyl1_4404 
Symbol 
ID5365750 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMarinomonas sp. MWYL1 
KingdomBacteria 
Replicon accessionNC_009654 
Strand
Start bp5003685 
End bp5005295 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content43% 
IMG OID640806810 
Productextracellular solute-binding protein 
Protein accessionYP_001343234 
Protein GI152998399 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATGA CATTACTCGC CAGTGCTATC ATGGCTACAT CTATCTTCGC TCTACAAGCT 
CAAGCTGCAG ACGTACCAGC AGGCGTAGAA TTAGCGGCTA AACAAGAACT TGTTCGTGGT
GGTGGTGCAG AACCAGCAAC CCTTGATCCA CAAAAAATGG AAGGCACACC AGCTTCTATC
CGTTCTAAAG ATCTTTTTGA AGGTTTATAC AACCAAGATG GTGACGGTAA CCAAGTACCA
GGTGTAGCAG AAAGCTATGA CGTAAACGCC GACAACACTC AATACACCTT CCACCTTCGC
AAAGATGCCA AATGGTCTAA TGGTGACCCA GTTACCGCTG AAGACTTTGT TTACGCTTTC
ACTCGCGCAG TTGATCCTAA ATTGGCATCT CCATACGCTT GGTTTATGGA AATTCCTGCG
ATTACTAACG CCTCCAAAAT CATTGCCGGA GAAGCCGACC CTTCTACTCT TGGCGTTAAA
GCACTAGACG ACCATACTTT CCAAGTGACG CTTGAGCGTC CAGTCCCTTA CTTTGTAAAA
ATGACCTCTC ATCAAACCAT GTTCCCTGTA CCGAAAAAAG TGGTCGAGAA ATGGGGGGAT
AATTGGACAA AACCAGAGCA TATGGTATCT AACGGTGCCT ATAAAATGGA CGAATGGGTC
GTTAACGAAA AAATGGTGTT TACGCGTAAC AAAAACTATT GGAATGATGC AAAAACCATC
ATCAACAAGG TTACCTATTT ACCAATTGAA TCCCCTAATG CTGAGCTAAA ACGCTTCCAA
GCGGGTCAAA TGGATCTTAG CTACGAGATT CCAAATGATC ACTTCAAACA ACTCATGCGA
GACATTCCTG ATGAAGTTGT CGTAACACCA AAACTAGGCA CCTATTACTA CCAATTCAAC
ATAACAAAAG CACCATACAA CGATGTCCGA GTACGTAAAG CGTTGTCCTA TTCCATTGAT
CGTAACGTGA TCACAAAGTT TGTAACGGGC ACAGGTGAAC TACCTGCGTA TTCTTTCACA
CCAGAAGTAG TAAACGACTT TTCACCGGCA ACACCTGAAT ACGCAACTTG GTCACAAAAA
GAACGTGACG ATAAAGCCAG AGCGTTACTA GAAGAAGCGG GCTATGGCAA AAGCAATCCA
CTGTCTTTCA GCTTGCTTTA CAACACCAAC GAAAACCATA AAAAAATCGC CATTGCGATC
GCTTCTATGT GGAAAAAAAC CTTAGGCGTG AACGTTACTC TGGAAAACCA AGAATGGAAA
ACTTACCTAG AATCGAAAAA GCATCAGCAA TTTGATATTG CACGTGCCGG TTGGATTGGT
GACTACAACG AAGCCTCTAC TATGTTAGAT CTTCTAACCA CTACTCACGG TAATAACGAC
GGTAAATACA GCAACGCAGA ATACGACAAG CTATTGCACG ATGCGCGTAC CATGCAGCAC
CCTACAGAGA ACTACAACAA AGCAGAAGAA ATCGCTATTG AACAAGATAT GGCGGTTGCC
CCTATTTATC AATATACCGA AAAACGTTTA GTAAAAAGCT ACTTAGGCGG TTATATGCCT
AACCCAGAAG ACAACGTTTA CGTGCGCGAT ATGTACATCA TCAAGCACTA A
 
Protein sequence
MKMTLLASAI MATSIFALQA QAADVPAGVE LAAKQELVRG GGAEPATLDP QKMEGTPASI 
RSKDLFEGLY NQDGDGNQVP GVAESYDVNA DNTQYTFHLR KDAKWSNGDP VTAEDFVYAF
TRAVDPKLAS PYAWFMEIPA ITNASKIIAG EADPSTLGVK ALDDHTFQVT LERPVPYFVK
MTSHQTMFPV PKKVVEKWGD NWTKPEHMVS NGAYKMDEWV VNEKMVFTRN KNYWNDAKTI
INKVTYLPIE SPNAELKRFQ AGQMDLSYEI PNDHFKQLMR DIPDEVVVTP KLGTYYYQFN
ITKAPYNDVR VRKALSYSID RNVITKFVTG TGELPAYSFT PEVVNDFSPA TPEYATWSQK
ERDDKARALL EEAGYGKSNP LSFSLLYNTN ENHKKIAIAI ASMWKKTLGV NVTLENQEWK
TYLESKKHQQ FDIARAGWIG DYNEASTMLD LLTTTHGNND GKYSNAEYDK LLHDARTMQH
PTENYNKAEE IAIEQDMAVA PIYQYTEKRL VKSYLGGYMP NPEDNVYVRD MYIIKH