Gene Mmwyl1_4105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmwyl1_4105 
Symbol 
ID5368425 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMarinomonas sp. MWYL1 
KingdomBacteria 
Replicon accessionNC_009654 
Strand
Start bp4639530 
End bp4641290 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content49% 
IMG OID640806498 
Productdihydroxy-acid dehydratase 
Protein accessionYP_001342936 
Protein GI152998101 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000000339797 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGTGATT CTGAGAAAGG CACGTTCGTA CCGAAAACAT GGCCCCGCAA ATTACGTTCA 
ACCGAATGGT TTGGCGGTAC TTCACGTGAT CATATTTACC ACCGCAGCTG GATGAAAAAC
CAAGGCTTGC CGGCGGACTT ATTTGATGGT CGTCCAGTGA TTGGCATTTG TAATACTTGG
TCGCAGCTTA CACCTTGCAA CGCCCATCTT CGTGACCTCG CTGATCGTGT TAAGCATGGT
ATTTATGAAG CGGGCGGTTT GCCACTGGAA TTCCCTGTGT TTTCTGTGGG TGAGAGCTCA
CTGCGTCCTA CTGCGATGAT GTATCGCAAC ATGGCAGCAA TGGATGTAGA AGAAGCGTTG
CGAGCCAATC CGCTCGACGG TGTTGTCTTG TTGGCGGGTT GTGACAAAAC CACACCAGCG
CTGCTTATGG GCGCGTGTAG TGTCGATATT CCAGCGATTG TTGTGTCTGG TGGCCCGATG
TTGAATGGCT ACTTCAGAGG CGAGCGTGTG GGTTCTGGTA CAGCCTTGTG GCAAATGTCG
GAAGACATTA AAGCGGGCAA GATGACGCAA GAAGACTTCC TTGAGGCAGA GCAATCTATG
TCTCGCTCTG CGGGCTCTTG TAACACTATG GGCACTGCCA GCACGATGGC GTCTATGGCG
GAAGCCTTGG GTATGGCGTT ATCGGGTAAC GCAGCGATTC CTGCAGTAGA CAGCCGTCGT
CGCGTTATGG CGCATCTTAC AGGCCGTCGC ATCGTCGATA TGGTGAAAGA CGATTTGAAA
CCGTCTGATA TTTTGACTCG CCAAGCCTTC GAAAACGCGA TTCGTGTCAA TGGCGCGATT
GGTGGTTCTA CCAATGCTGT GATTCACTTA TTGGCTATTG CAGGTCGTGT TGGTGTGGAT
TTAACCCTAG ATGATTGGGA TAAATTGGGC CAAGAAATCG CCACTATTGT GAACCTGATG
CCGTCGGGCA AATATCTGAT GGAAGAATTC TTCTACGCTG GTGGTTTGCC AGTAGTGATC
AAGCATTTGG CCGAAGCGGG CAAGCTGCAC AAAGATGCAA TTACCGTGTC TGGTGAATCG
ATTTGGGAAG AAGTGAAAGA GGTCCGCAAT TGGAATCCAG ACGTGATTTT GCCAGTTGAA
AAAGCACTGA CACAAAAAGG CGGTATTGTT GTATTGAAAG GCAACCTTGC GCCACAAGGC
GCGGTGCTCA AACCTTCCGC GGCGTCGGCG CACTTATTGC AGCATCGCGG TCGAGCTGTC
GTGTTTGAAG ATATTGACGA CTACAAAGCT AAGATCAATG ACGAAGCTTT GGATATCGAT
GAAAACTGTG TCATGGTGCT AAAAAACTGT GGGCCAAAAG GTTACCCAGG CATGGCGGAA
GTGGGCAACA TGGGTTTGCC TCCAAAGGTG CTGCGTAAAG GCATTAAAGA TATGGTGCGG
ATTTCTGATG CTCGTATGTC GGGAACGGCT TATGGAACTG TAGTTTTACA TACCACACCA
GAAGCTGCAG CAGGCGGGCC ATTAGCCGTT ATTCAAAATG GCGACATAAT TGAGTTAGAC
GTAGAAAATC GCCGCCTTCA TGTGGATATT TCCGATGAAG AAATGGCAAC ACGTTTAGCA
AACTGGAAAA GTCATTTGGA ACCACCAAAA AGTGGTTATA TCAAACTATT TCATGATCAC
GTACAGGGCG CTGATACCGG TGCCGATTTT GACTTCTTAA AAGGTTGTCG AGGCGCTGCG
GTGCCAAAAG ATAGTCATTA A
 
Protein sequence
MSDSEKGTFV PKTWPRKLRS TEWFGGTSRD HIYHRSWMKN QGLPADLFDG RPVIGICNTW 
SQLTPCNAHL RDLADRVKHG IYEAGGLPLE FPVFSVGESS LRPTAMMYRN MAAMDVEEAL
RANPLDGVVL LAGCDKTTPA LLMGACSVDI PAIVVSGGPM LNGYFRGERV GSGTALWQMS
EDIKAGKMTQ EDFLEAEQSM SRSAGSCNTM GTASTMASMA EALGMALSGN AAIPAVDSRR
RVMAHLTGRR IVDMVKDDLK PSDILTRQAF ENAIRVNGAI GGSTNAVIHL LAIAGRVGVD
LTLDDWDKLG QEIATIVNLM PSGKYLMEEF FYAGGLPVVI KHLAEAGKLH KDAITVSGES
IWEEVKEVRN WNPDVILPVE KALTQKGGIV VLKGNLAPQG AVLKPSAASA HLLQHRGRAV
VFEDIDDYKA KINDEALDID ENCVMVLKNC GPKGYPGMAE VGNMGLPPKV LRKGIKDMVR
ISDARMSGTA YGTVVLHTTP EAAAGGPLAV IQNGDIIELD VENRRLHVDI SDEEMATRLA
NWKSHLEPPK SGYIKLFHDH VQGADTGADF DFLKGCRGAA VPKDSH