Gene Mmwyl1_2388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmwyl1_2388 
Symbol 
ID5366327 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMarinomonas sp. MWYL1 
KingdomBacteria 
Replicon accessionNC_009654 
Strand
Start bp2699321 
End bp2700493 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content47% 
IMG OID640804753 
Productthiazole biosynthesis protein ThiH 
Protein accessionYP_001341245 
Protein GI152996410 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAGACG GAAGCAGTAA TAAAATTCGT TTGTTAGAGC AAAGCCCAGC AGTAGAAGGT 
CAGTTTAGCT ATTTTGTCGA AAATACAGAT TGGCAAGCAG TGACGCAATC CCATCAAGAA
AAAAACGAGC AAGATGTGTT GCGAGCCTTG AGTAAGAGCA AACTAGACGT GAACGATTTT
GCTGCTTTGA TTGCGCCTGC GGCAGAGCCT TATTTGTTGG AGATGGTAGC AAAAAGCGAA
CAACTTACTT TACAGCGCTT TGGCAATACT TTGAGCTTAT TTGCACCCTT GTATTTATCT
AATACCTGTG CGAACGAATG CACCTACTGT GGTTTTTCTA TGAGTAATGC AATCAGGCGT
CTTACTCTGA ATGAAACTCA GGTGGGAAAA GAAGTCGCGG CGATTAAAGG TAAGGGCTTT
GATCATATCC TCTTGGTAAC AGGGGAAACC AATAAGGTTT CCATGCCGTA CTTTGAACGC
ATGATTCCCT TGATCAAGCC CCATTTCAGT CAGCTCTCTA TGGAAGTACA GCCATTAGAT
GCAGATGAAT ACAAACAGCT GCAAGGTATC GGTTTGGACG GCGTGTTGGT TTATCAAGAA
ACCTATCGTC GCAAAACCTA TCTTGAGCAT CATTTGCGGG GTAACAAATC AAACTTTAAC
TATCGCCTCG ATGCGCCGGA CCGTATTGGG CAGGCTGGTA TTCATAAGAT TGGTTTGGGT
GTGTTACTTG GCTTGGAAGA TTGGCGCACG GATTCGGTGA TGATGGCGCA TCATCTACGG
CACCTGCAAA AGCGCTATTG GCGAAGCCGC TTTAGTGTGG CGTTTCCACG GATTCGCCCT
TGTGAAGGAG GGATCGTTCC GAAATCTGTT ATCTCTGATC GGCAATTGGT TCAGCTTATT
GCTGCTTGGC GCTTGTTTGA TCGGGATTTA GAAATGAGCC TGTCTACTCG CGAATCACCT
GAATTTCGCC ATCATGCAGT GCGAATGGGG TTTACCACTA TGAGTGCAGA ATCCAAAACC
CAGCCGGGTG GATACGCGGA TGACTCTCAA GAGGCCCTTG AGCAATTCGA GATCAGCGAC
GAGCGACCTG TTGCTGAAAT TATGGCGATG ATTCGTGAAC AGGGGCGCGA AGTGGTGTGG
AAAGACTGGG ACCCTGCACT TTCACATGTT TAA
 
Protein sequence
MIDGSSNKIR LLEQSPAVEG QFSYFVENTD WQAVTQSHQE KNEQDVLRAL SKSKLDVNDF 
AALIAPAAEP YLLEMVAKSE QLTLQRFGNT LSLFAPLYLS NTCANECTYC GFSMSNAIRR
LTLNETQVGK EVAAIKGKGF DHILLVTGET NKVSMPYFER MIPLIKPHFS QLSMEVQPLD
ADEYKQLQGI GLDGVLVYQE TYRRKTYLEH HLRGNKSNFN YRLDAPDRIG QAGIHKIGLG
VLLGLEDWRT DSVMMAHHLR HLQKRYWRSR FSVAFPRIRP CEGGIVPKSV ISDRQLVQLI
AAWRLFDRDL EMSLSTRESP EFRHHAVRMG FTTMSAESKT QPGGYADDSQ EALEQFEISD
ERPVAEIMAM IREQGREVVW KDWDPALSHV