Gene Mmwyl1_4349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmwyl1_4349 
Symbol 
ID5367514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMarinomonas sp. MWYL1 
KingdomBacteria 
Replicon accessionNC_009654 
Strand
Start bp4938635 
End bp4939882 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content46% 
IMG OID640806755 
Productextracellular solute-binding protein 
Protein accessionYP_001343179 
Protein GI152998344 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000381076 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.000997313 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAAAA TTGCTTTCGG TTTAACCACA GTTGCACTTT CTCTAGCAGC GGCTACGGCT 
CACGCTGGTG AAGTAGAAGT TCTACATTGG TGGACATCAG GTGGCGAAGC CGCTGCGATC
AATGTACTTA AAGAAGAAAT GGTAGACGCT GGCCACACTT GGAAAGACTT CGCAGTCGCT
GGTGGCGGTG GTGAATCTGC TATGACCGTT CTAAAATCTC GTGCCATTTC TGGCAACCCA
CCATCTGCTG CTCAAATCAA AGGCCCAACC ATTCAAGAAT GGGGTGACCT TGGCTTCCTA
ACAAATCTTG ATGATGTTGC TAAAGCGGGC GAATGGGACG TTATCCTTCC TCAAGTTGTT
AGTAACGTAA TGAAGTACGA CGGCCACTAC GTAGCGGCTC CAGTAAACGT TCACCGTGTA
AACTGGATGT GGGCAAACCC TGAAGTATTC CGTAAATCAG GCGCGTCTAT CCCAACCACT
TGGGAACAAT TCATTGTTGA AGCGAAAAAA ATTAAAGCTG CTGGTTTCAC ACCACTTGCT
CACGGTGGTC AAAACTGGCA AGACGCTACT CTATTCGAAG CAATCGCTTT GGCTAAAGGC
GCTAAATTCT ACAACAGCGC GTTCATCGGC TTGTCTGATA AGACACTACG TAGCCAAGAC
ATGATCGACG TATTTGATAC TTTCAAACAA ATGCGCGAAT TCGTTGATCC AGGTTTCTCT
GGCCGCGATT GGAACGTAGC AACCTCTATG GTTATCAATG GTGATGCTGC GATGCAAATC
ATGGGTGACT GGGCGAAAGG CGAGTTCACA GCAGCAGGAA AAAAAGCCGG TATCGATTAC
GTTTGTTACC CAGCGCCAGG CACAAGCGGT GCTTTCACTT TCAACATCGA TTCTCTAGCG
ATGTTTAAAG TAGATGGCAA AGACAAACAA GCAGCTCAAA AAGACCTTGC TCGTTTGATC
CTAGAACCTA AGTTCCAAGA AACCTTCAAC TTGAACAAAG GCTCTATTCC AGTTCGTTTG
AATATGCCTC GCGAGAAATT TGATACTTGT GCACATTCTT CAATGGATGC GTTTTTAGCA
AGTTCAACGA CTGGCAACCT AGTACCAAGT ATGGCTCACG GCATGGCTGT GAACTCTATG
GTTCAAGGCG CTATCTTTGA CGTGGTGACT AACTTCTTCA ATGACGAATC TATGACGTCT
AAAGAAGCAG TCGACAAGCT AGCTCGCGCA GTTAAAGCCA GCATGTAA
 
Protein sequence
MKKIAFGLTT VALSLAAATA HAGEVEVLHW WTSGGEAAAI NVLKEEMVDA GHTWKDFAVA 
GGGGESAMTV LKSRAISGNP PSAAQIKGPT IQEWGDLGFL TNLDDVAKAG EWDVILPQVV
SNVMKYDGHY VAAPVNVHRV NWMWANPEVF RKSGASIPTT WEQFIVEAKK IKAAGFTPLA
HGGQNWQDAT LFEAIALAKG AKFYNSAFIG LSDKTLRSQD MIDVFDTFKQ MREFVDPGFS
GRDWNVATSM VINGDAAMQI MGDWAKGEFT AAGKKAGIDY VCYPAPGTSG AFTFNIDSLA
MFKVDGKDKQ AAQKDLARLI LEPKFQETFN LNKGSIPVRL NMPREKFDTC AHSSMDAFLA
SSTTGNLVPS MAHGMAVNSM VQGAIFDVVT NFFNDESMTS KEAVDKLARA VKASM