Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmwyl1_2388 |
Symbol | |
ID | 5366327 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Marinomonas sp. MWYL1 |
Kingdom | Bacteria |
Replicon accession | NC_009654 |
Strand | - |
Start bp | 2699321 |
End bp | 2700493 |
Gene Length | 1173 bp |
Protein Length | 390 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640804753 |
Product | thiazole biosynthesis protein ThiH |
Protein accession | YP_001341245 |
Protein GI | 152996410 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATAGACG GAAGCAGTAA TAAAATTCGT TTGTTAGAGC AAAGCCCAGC AGTAGAAGGT CAGTTTAGCT ATTTTGTCGA AAATACAGAT TGGCAAGCAG TGACGCAATC CCATCAAGAA AAAAACGAGC AAGATGTGTT GCGAGCCTTG AGTAAGAGCA AACTAGACGT GAACGATTTT GCTGCTTTGA TTGCGCCTGC GGCAGAGCCT TATTTGTTGG AGATGGTAGC AAAAAGCGAA CAACTTACTT TACAGCGCTT TGGCAATACT TTGAGCTTAT TTGCACCCTT GTATTTATCT AATACCTGTG CGAACGAATG CACCTACTGT GGTTTTTCTA TGAGTAATGC AATCAGGCGT CTTACTCTGA ATGAAACTCA GGTGGGAAAA GAAGTCGCGG CGATTAAAGG TAAGGGCTTT GATCATATCC TCTTGGTAAC AGGGGAAACC AATAAGGTTT CCATGCCGTA CTTTGAACGC ATGATTCCCT TGATCAAGCC CCATTTCAGT CAGCTCTCTA TGGAAGTACA GCCATTAGAT GCAGATGAAT ACAAACAGCT GCAAGGTATC GGTTTGGACG GCGTGTTGGT TTATCAAGAA ACCTATCGTC GCAAAACCTA TCTTGAGCAT CATTTGCGGG GTAACAAATC AAACTTTAAC TATCGCCTCG ATGCGCCGGA CCGTATTGGG CAGGCTGGTA TTCATAAGAT TGGTTTGGGT GTGTTACTTG GCTTGGAAGA TTGGCGCACG GATTCGGTGA TGATGGCGCA TCATCTACGG CACCTGCAAA AGCGCTATTG GCGAAGCCGC TTTAGTGTGG CGTTTCCACG GATTCGCCCT TGTGAAGGAG GGATCGTTCC GAAATCTGTT ATCTCTGATC GGCAATTGGT TCAGCTTATT GCTGCTTGGC GCTTGTTTGA TCGGGATTTA GAAATGAGCC TGTCTACTCG CGAATCACCT GAATTTCGCC ATCATGCAGT GCGAATGGGG TTTACCACTA TGAGTGCAGA ATCCAAAACC CAGCCGGGTG GATACGCGGA TGACTCTCAA GAGGCCCTTG AGCAATTCGA GATCAGCGAC GAGCGACCTG TTGCTGAAAT TATGGCGATG ATTCGTGAAC AGGGGCGCGA AGTGGTGTGG AAAGACTGGG ACCCTGCACT TTCACATGTT TAA
|
Protein sequence | MIDGSSNKIR LLEQSPAVEG QFSYFVENTD WQAVTQSHQE KNEQDVLRAL SKSKLDVNDF AALIAPAAEP YLLEMVAKSE QLTLQRFGNT LSLFAPLYLS NTCANECTYC GFSMSNAIRR LTLNETQVGK EVAAIKGKGF DHILLVTGET NKVSMPYFER MIPLIKPHFS QLSMEVQPLD ADEYKQLQGI GLDGVLVYQE TYRRKTYLEH HLRGNKSNFN YRLDAPDRIG QAGIHKIGLG VLLGLEDWRT DSVMMAHHLR HLQKRYWRSR FSVAFPRIRP CEGGIVPKSV ISDRQLVQLI AAWRLFDRDL EMSLSTRESP EFRHHAVRMG FTTMSAESKT QPGGYADDSQ EALEQFEISD ERPVAEIMAM IREQGREVVW KDWDPALSHV
|
| |