Gene Mmwyl1_1080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmwyl1_1080 
Symbol 
ID5366236 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMarinomonas sp. MWYL1 
KingdomBacteria 
Replicon accessionNC_009654 
Strand
Start bp1204666 
End bp1206480 
Gene Length1815 bp 
Protein Length604 aa 
Translation table11 
GC content45% 
IMG OID640803421 
Productphosphogluconate dehydratase 
Protein accessionYP_001339946 
Protein GI152995111 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR01196] 6-phosphogluconate dehydratase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00217258 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000267621 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATCCGA TTGTTGCTAA GGTAACGAAC GACATCATTG AACGAAGCAA GGTGTTGCGC 
GAGCAATACC TTAAAGATAT GAAAAAAGCG CAAGAGCAAG GGCCACACAG GGGGAAATTG
TCCTGTGGGA ATTTGGCTCA TGGCTTCGCA GCATGCCAAC CTCAAGATAA ACAAAAGCTG
ACTCTAATGG AAGAAGCGAA CATAGGTATC ATATCTTCTT ATAACGATAT GTTGTCTGCT
CATCAGCCTT ATGAAAATTA CCCCGATCAA ATTCGTGCAG CGGTTAAAGA AATGGGCTCT
GTGGCTCAGT TCGCTGGCGG TGTTCCCGCT ATGTGTGATG GTGTTACGCA AGGGCAAGAC
GGTATGGAAC TTAGTCTATT TAGCCGTGAC AATATCGCAC AAGGTGCGGC AATAGCGCTT
TCTCATAATA TGTTTGATGC GGCTATCTAT TTAGGTATTT GTGACAAGAT TGTTCCTGGC
TTGTTGATTG CTGCACTGCG TTTTGGTCAT TTGCCAGCTT TGTTTATTCC AGCCGGTCCA
ATGCGCTCAG GTATTACTAA CGCAGCTAAA GCGGCGGTTC GTCAGCGTTA CGCTCAGGGT
CAAGCGACTC GTGAGGAGTT ACTTGAAGCT GAATCAGCTT CTTATCACAG CGCGGGTACT
TGTACTTTTT ATGGTACGGC TAACTCAAAT CAGTTATTAG TCGAAATTAT GGGGCTTCAG
CTTCCTGGTT CTTCTTTTGT TAATCCAGAT GATCCACTAC GAGGTGCATT AACTAACTAT
GCTTCTCAGT TATCTACCAA GATTACGGCA TTGGGAAGAG ACTACCGACC ACTCTACGAA
ATCGTAGATG AGCGTAGCAT TGTCAACGCT ATTGTTGGTT TGCTGGCAAC GGGTGGTTCG
ACTAACCACA CTATGCATAT TGTCGCTTAT GCTCGTGCTG CTGGCATTAT TATCACTTGG
GATGACTTCT CCGCTCTTTC CAAAGTTGTG CCATCACTGA CTAAAATCTA TCCGAATGGC
CAAGCAGATA TTAACCATTT CCACGCGGCT GGCGGCATGG CGTTTCTGGT TAAGCAATTA
CTGAAAGGTG GTCTATTGCA TGAAGATGTA AACACTATCG TGGGTAAAGG TTTAACTCAT
TACACCAAAG AGCCTTTTTT AGAAGGTGAT AATCTTGTGT GGCGTGATGG TACAGATGAA
AGTTTGGACC TAAATGTTGT ACGTCCTATC GAAGACCCTT TCAGTAAAGA AGGTGGGTTG
GCGCTTCTTA AAGGTAACCT TGGTCGTTCA GTGATTAAAG TTTCAGCCGT GAAAGATGAA
AATCGAATTA TCGAAGCACC CGCTGCTGTG TTTCATAGTC AAAATGCATT GGCTAATGCT
ATTGCTAGTG GTGAATTGAA ACGTGACTGT GTGGCTGTTG TTCGTTTCCA AGGGCCAAAA
GCTTTAGGTA TGCCAGAGTT GCATAAACTG ACTCCTTACC TTGGTAATTT GCAAGATCAA
GGCTATCGAG TGGCGTTAGT GACTGACGGT CGTATGTCTG GTGCATCTGG TAAAGTACCT
GCCGCAATTC ACTTGACGCC TGAAGCATTG GCCGGTGGTT TAATTGCCAA GATTCAAGAT
GGCGACATGA TCCGCTTAGA TGCAATTGAA GGTACTTTGT CTGTGTTGAT CTCAGACGAA
GAGCTAAATA AACGCGAAGC AGCACAACAA GATTTAACCG ATTCCCATCA AGGTATGGGA
CGTGAACTAT TTAGCGCACA ACGTATTTTA GTAAGTGGCG CAGAACAAGG TGCTTGCAGC
TTGTTTAATG ATTAA
 
Protein sequence
MNPIVAKVTN DIIERSKVLR EQYLKDMKKA QEQGPHRGKL SCGNLAHGFA ACQPQDKQKL 
TLMEEANIGI ISSYNDMLSA HQPYENYPDQ IRAAVKEMGS VAQFAGGVPA MCDGVTQGQD
GMELSLFSRD NIAQGAAIAL SHNMFDAAIY LGICDKIVPG LLIAALRFGH LPALFIPAGP
MRSGITNAAK AAVRQRYAQG QATREELLEA ESASYHSAGT CTFYGTANSN QLLVEIMGLQ
LPGSSFVNPD DPLRGALTNY ASQLSTKITA LGRDYRPLYE IVDERSIVNA IVGLLATGGS
TNHTMHIVAY ARAAGIIITW DDFSALSKVV PSLTKIYPNG QADINHFHAA GGMAFLVKQL
LKGGLLHEDV NTIVGKGLTH YTKEPFLEGD NLVWRDGTDE SLDLNVVRPI EDPFSKEGGL
ALLKGNLGRS VIKVSAVKDE NRIIEAPAAV FHSQNALANA IASGELKRDC VAVVRFQGPK
ALGMPELHKL TPYLGNLQDQ GYRVALVTDG RMSGASGKVP AAIHLTPEAL AGGLIAKIQD
GDMIRLDAIE GTLSVLISDE ELNKREAAQQ DLTDSHQGMG RELFSAQRIL VSGAEQGACS
LFND