Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmwyl1_4349 |
Symbol | |
ID | 5367514 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Marinomonas sp. MWYL1 |
Kingdom | Bacteria |
Replicon accession | NC_009654 |
Strand | + |
Start bp | 4938635 |
End bp | 4939882 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640806755 |
Product | extracellular solute-binding protein |
Protein accession | YP_001343179 |
Protein GI | 152998344 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000381076 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.000997313 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAAAAA TTGCTTTCGG TTTAACCACA GTTGCACTTT CTCTAGCAGC GGCTACGGCT CACGCTGGTG AAGTAGAAGT TCTACATTGG TGGACATCAG GTGGCGAAGC CGCTGCGATC AATGTACTTA AAGAAGAAAT GGTAGACGCT GGCCACACTT GGAAAGACTT CGCAGTCGCT GGTGGCGGTG GTGAATCTGC TATGACCGTT CTAAAATCTC GTGCCATTTC TGGCAACCCA CCATCTGCTG CTCAAATCAA AGGCCCAACC ATTCAAGAAT GGGGTGACCT TGGCTTCCTA ACAAATCTTG ATGATGTTGC TAAAGCGGGC GAATGGGACG TTATCCTTCC TCAAGTTGTT AGTAACGTAA TGAAGTACGA CGGCCACTAC GTAGCGGCTC CAGTAAACGT TCACCGTGTA AACTGGATGT GGGCAAACCC TGAAGTATTC CGTAAATCAG GCGCGTCTAT CCCAACCACT TGGGAACAAT TCATTGTTGA AGCGAAAAAA ATTAAAGCTG CTGGTTTCAC ACCACTTGCT CACGGTGGTC AAAACTGGCA AGACGCTACT CTATTCGAAG CAATCGCTTT GGCTAAAGGC GCTAAATTCT ACAACAGCGC GTTCATCGGC TTGTCTGATA AGACACTACG TAGCCAAGAC ATGATCGACG TATTTGATAC TTTCAAACAA ATGCGCGAAT TCGTTGATCC AGGTTTCTCT GGCCGCGATT GGAACGTAGC AACCTCTATG GTTATCAATG GTGATGCTGC GATGCAAATC ATGGGTGACT GGGCGAAAGG CGAGTTCACA GCAGCAGGAA AAAAAGCCGG TATCGATTAC GTTTGTTACC CAGCGCCAGG CACAAGCGGT GCTTTCACTT TCAACATCGA TTCTCTAGCG ATGTTTAAAG TAGATGGCAA AGACAAACAA GCAGCTCAAA AAGACCTTGC TCGTTTGATC CTAGAACCTA AGTTCCAAGA AACCTTCAAC TTGAACAAAG GCTCTATTCC AGTTCGTTTG AATATGCCTC GCGAGAAATT TGATACTTGT GCACATTCTT CAATGGATGC GTTTTTAGCA AGTTCAACGA CTGGCAACCT AGTACCAAGT ATGGCTCACG GCATGGCTGT GAACTCTATG GTTCAAGGCG CTATCTTTGA CGTGGTGACT AACTTCTTCA ATGACGAATC TATGACGTCT AAAGAAGCAG TCGACAAGCT AGCTCGCGCA GTTAAAGCCA GCATGTAA
|
Protein sequence | MKKIAFGLTT VALSLAAATA HAGEVEVLHW WTSGGEAAAI NVLKEEMVDA GHTWKDFAVA GGGGESAMTV LKSRAISGNP PSAAQIKGPT IQEWGDLGFL TNLDDVAKAG EWDVILPQVV SNVMKYDGHY VAAPVNVHRV NWMWANPEVF RKSGASIPTT WEQFIVEAKK IKAAGFTPLA HGGQNWQDAT LFEAIALAKG AKFYNSAFIG LSDKTLRSQD MIDVFDTFKQ MREFVDPGFS GRDWNVATSM VINGDAAMQI MGDWAKGEFT AAGKKAGIDY VCYPAPGTSG AFTFNIDSLA MFKVDGKDKQ AAQKDLARLI LEPKFQETFN LNKGSIPVRL NMPREKFDTC AHSSMDAFLA SSTTGNLVPS MAHGMAVNSM VQGAIFDVVT NFFNDESMTS KEAVDKLARA VKASM
|
| |