Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmwyl1_2758 |
Symbol | |
ID | 5367068 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Marinomonas sp. MWYL1 |
Kingdom | Bacteria |
Replicon accession | NC_009654 |
Strand | - |
Start bp | 3120809 |
End bp | 3123841 |
Gene Length | 3033 bp |
Protein Length | 1010 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640805133 |
Product | sarcosine oxidase alpha subunit family protein |
Protein accession | YP_001341606 |
Protein GI | 152996771 |
COG category | [E] Amino acid transport and metabolism [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0404] Glycine cleavage system T protein (aminomethyltransferase) [COG0492] Thioredoxin reductase |
TIGRFAM ID | [TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGAAA AGAGTATGAC TCAAAAGAAT CGTCTTCAAG ATGGCGGCCG TATCGATCGC TCCAAAGTAC TGGCTTTTAC CTACAATGGT AAAAAATACA CAGGCTTCGA GGGTGACACC CTAGCGTCTG CTTTGTTAGC TAATGGTGTC GATATTATCG GCCGAAGCTT TAAATATAGC CGTCCTCGCG GCATCGTTGC TGCGGGTGCG GAAGAGCCAA ATGCGATCAT GCAAATAGGA TCGACTGAGG CAACGCAAAT TCCTAACGTG CGCGCGACTC AACAATCGCT TTATAGCGGT TTAGTGGCAG GGTCCACTAA TGGTTGGCCG AGTGTGGATA ATGACCTGAT GGGGTTGGTT GGTAAAGTTG GTGGGAAAAT GATGCCTCCG GGCTTCTACT ACAAAACTTT CATGTACCCA AAATCCATGT GGGAAACGTA CGAGTCTTAT ATTCGTAAAG CTGCAGGTCT TGGTCGCGCA CCAAAAGAAA ATGATCCTGA TATTTATGAC AAGATGAATC AACACTGTGA TGTCATCATT GTCGGTGCTG GCCCTGCTGG TTTGGCTGCA GCGTTAACCG CTGCGCGTGC GGGCGCTCGA GTGATTATCG CGGATGAGCA AAATGAATTT GGTGGTAGCT TACTAAGCAG CAAAGAATTG TTAGGTGGTA AGCCTGCTGC AGAGTGGGTT GCAGATGTTG TCAAAGAATT GTCTACATAC GACGATGTGT TGATGTTACC GAATTCTCAG GTGAACGGTT ATCACGATCA TAACTTCTTA ACCATTCAAC AGCATTGTAC TGACCAGTTT GCAGACCGTG CTCCTAATGG CCAAGTGGCT CAGCGCTACC ACCGTGTACG TGCTAAGTGG GTTGTTTTGG CTACGGGAGC CCATGAACGT CCTTTAGTAT ATGGTAACAA CGATGTGCCT GGTTGCATGG TGGCAAATGC TGTATCGACT TACATCAACC GTTACGGCGT TGTTCCTGGT AAAAACTTAG TACTAATGAC GTCTAACGAC AACGCTTACC GTACTGCGCT AGATTGGCAT GATGCAGGCC GCAACGTGGT TGCGATTGTC GATTCTCGTA AAAACCCACA CGGAACCTTT GTGGATGCAG CGCGTGACCG TGGTATCAGT ATCATTGTTG GTTCTGGGGT CATTGAGGCA CATGGCAGCA AGCGCGTATC AGGTGTGTCT ATTGCACCAT TGAACGACGC GGGTGATGCG GTTGTGGGTC CTATCATTAA GATGGCGGCT GACACAGTTG CAACTTCTGG TGGTTGGAGT CCGGTTATTC ACTTGTCTTG CCACACTGGA GCACGTCCTG TTTGGAATGA AGAGATTCTA GGTTTCACTC CGGGTGCTAC CTATCAGAAG CAATTAACTG CAGGTGCGAT TAATGGCGCG TTTTCTACCG CACAAGCTCT GTCTGAAGGT GTAAAAGCTG CATCTGATGC TTTGTCTCAG TTGGGACTCG CTAAGTCAGA TGTGGTATTG CCAGAAACCG AAGTGATTGA AGAGGGCAAA GCGATGGCTT TGTTCCATAT TCCACACACA AAACCAACAT CAAAAGCACC TAAGCAGTTC GTTGATTATC AAAACGACGT AACCGCATCT GGCATTGAGT TGGCTTGTCG TGAAGGCTTC GAGTCTATCG AGCACGTTAA ACGCTACACG GCAATGGGCT TTGGTACTGA TCAAGGTAAA CTAGGTAATA TCAACGGTAT GGCGATTGCG GCAAAAATGC TCAATCAAAC GATACCGCAA ACAGGTACGA CTATTTTCCG TCCTAACTAT ACGCCAGTAA CGTTTGGTGC TATCGCTGGA CGTGACTGTG GTGCTTTGTT TGATCCTGAG CGTTATACAG CAATGCATGC TTGGCATGTT GAGCACGGCG CGAAGTTTGA AGATGTGGGT CAATGGAAGC GTGCTTGGTA TTACCCTAAA GGCAACGAGA CGATGCAGCA AGCATTGGAT CGTGAATGTT TAGCGACTCG TAAGTCTGTA GGGATATTGG ATGCTTCTAC CTTGGGTAAA ATTGATATCC AAGGTAAAGA TGCTCGTGAA TTCCTTGGTC GTGTTTACAC CAATGCGTGG GCGAAATTAC CAGTAGGTAA ATGTCGTTAT GGCTTGATGT GTGGCGAAGA CGGCATGGTA TTCGATGACG GTGTGACATC TTGCTTGGCG GAGAATCATT TCTTGATGAC GACCACGTCT GGCGGTGCTG CACGTGTGTT GTCTTGGTTA GAGATTTATC ATCAAACCGA ATGGCCTGAA CTAGAAGTGT ATTTCAACAG TGTGACAGAT CATTGGTCAA CTATGACAAT TTCTGGCCCA AATAGCCGTA AGTTGTTAGA AAAACTGACT GATTCTGATG TATCAAAAGA AAATATGGCC TTTATGGATT GGAAACCTAT GACGGTTGCT GGCGTGCCAG CCCGAGTGTT CCGTATTTCC TTTACTGGTG AATTATCTTT TGAAATCAAC GTACAAGCTA ACTATGGTTT GCATGTATGG AAAGCATTGT TTGAAAAAGG TGCAGAGTTT AATCTGACGC CATATGGCAC GGAAACCATG CACATACTGC GAGCTGAAAA AGGTTTCATT ATTGCAGGTC AAGATACTGA TGGTTCTGTA CATCCGTTTG ACTTAGGTAT GTCTTGGGCT GTGTCTATGC AGAAACCGTT TAGCTTTATT GGTAAGCGTG GTATGCAACG TGAAGATTGC GTTCGTGAGA ATCGCAAACA GCTTGTTGGT TTGAAAACGC TAGACCCTAA AGCGGTGTTG CCAGAAGGTG CGCAAGGTGT ACTTGATCCG AAAGCACCCA TTCCAATGCC AATGGTTGGC CATGTGACGT CAAGTTACTG GAGTGCGAAC TTGAATCGTT CTGTGGCAAT GGGGTTTGTT AAAGGCGGCC TAGATAAAAT GGGTGAGAGA GTCTTCTACC CATTAGCTGA TGGCCGTGTT ATTGAAGCTG AAATTTGCAG CCCAGTATTC TTAGATCCAA AAGGGGAGCG TCAAAATGTC TGA
|
Protein sequence | MTEKSMTQKN RLQDGGRIDR SKVLAFTYNG KKYTGFEGDT LASALLANGV DIIGRSFKYS RPRGIVAAGA EEPNAIMQIG STEATQIPNV RATQQSLYSG LVAGSTNGWP SVDNDLMGLV GKVGGKMMPP GFYYKTFMYP KSMWETYESY IRKAAGLGRA PKENDPDIYD KMNQHCDVII VGAGPAGLAA ALTAARAGAR VIIADEQNEF GGSLLSSKEL LGGKPAAEWV ADVVKELSTY DDVLMLPNSQ VNGYHDHNFL TIQQHCTDQF ADRAPNGQVA QRYHRVRAKW VVLATGAHER PLVYGNNDVP GCMVANAVST YINRYGVVPG KNLVLMTSND NAYRTALDWH DAGRNVVAIV DSRKNPHGTF VDAARDRGIS IIVGSGVIEA HGSKRVSGVS IAPLNDAGDA VVGPIIKMAA DTVATSGGWS PVIHLSCHTG ARPVWNEEIL GFTPGATYQK QLTAGAINGA FSTAQALSEG VKAASDALSQ LGLAKSDVVL PETEVIEEGK AMALFHIPHT KPTSKAPKQF VDYQNDVTAS GIELACREGF ESIEHVKRYT AMGFGTDQGK LGNINGMAIA AKMLNQTIPQ TGTTIFRPNY TPVTFGAIAG RDCGALFDPE RYTAMHAWHV EHGAKFEDVG QWKRAWYYPK GNETMQQALD RECLATRKSV GILDASTLGK IDIQGKDARE FLGRVYTNAW AKLPVGKCRY GLMCGEDGMV FDDGVTSCLA ENHFLMTTTS GGAARVLSWL EIYHQTEWPE LEVYFNSVTD HWSTMTISGP NSRKLLEKLT DSDVSKENMA FMDWKPMTVA GVPARVFRIS FTGELSFEIN VQANYGLHVW KALFEKGAEF NLTPYGTETM HILRAEKGFI IAGQDTDGSV HPFDLGMSWA VSMQKPFSFI GKRGMQREDC VRENRKQLVG LKTLDPKAVL PEGAQGVLDP KAPIPMPMVG HVTSSYWSAN LNRSVAMGFV KGGLDKMGER VFYPLADGRV IEAEICSPVF LDPKGERQNV
|
| |