Gene Mmwyl1_2600 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmwyl1_2600 
Symbol 
ID5365986 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMarinomonas sp. MWYL1 
KingdomBacteria 
Replicon accessionNC_009654 
Strand
Start bp2941082 
End bp2942428 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content45% 
IMG OID640804973 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_001341448 
Protein GI152996613 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.452677 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000258304 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCACAGT GGAACCTAAC AAGTTGGAGA GAAAAGACGG CGCTACAACA GCCTGTTTAT 
CCTAATGCAG AGCATTTGGC TCAAGTTGAA AATACATTAG GAAAAATGCC TCCTCTTGTT
TTTGCGGGTG AGGCGCGTCA GCTTAAAAAA GCTTTAGCAC AGGTGGCCAA TAGGCAGTCA
TTTTTGCTGC AAGGTGGTGA TTGTGCTGAA AGCTTTGCTG AGTTCCATGC TAACAATATT
CGCGATACGT TTAAAGTTAT GCTGCAAATG GCGGTTGTGT TGACTTATGC TGGTAAATGC
CCAGTAGTAA AGGTTGGGCG CATGGCTGGG CAATTTGCTA AGCCGAGATC GTCAGGTAGT
GAAGTAATTG GTGGTATTGA ATTGCCTAGT TACCGTGGTG ATATCATCAA TGGTATCGAT
TTTACTGAGC AAGCAAGGGT TCCTGATCCT GAGCGTTTGG TGCAAGTTTA CAATCAGAGT
GCATCGACAA TGAACTTGCT TCGAGCTTTT GCTCAGGGTG GATTTGCAGA TTTACATCAA
GTGCATCAAT GGAATTTGGA CTTTTTGAAT GCGAGTCCAG CGGGTAGTCG TTTCCAGGGC
GTGGCTGACA AGATTGATGA CGCGCTTCAG TTTATGGAGG CGTGTGGTAT TGGTCCTGGT
TTAGCTCAGT TAAAAGAGAC AGATTTTTAT ACTTCCCATG AGGCGTTGTT GTTGCCTTAT
GAGCAGGCTT TGACTCGTAA AGATAGTCTG ACTGGTGACT GGTATGATTG TTCTGCGCAC
ATGTTATGGA TTGGAGATCG TACTCGTCAA TTAGATGGCG CCCATGTAGA GTTTTTGCGT
GGTGTACAAA ACCCAATTGG CGTAAAGGCT GGCCCTACTA TGGATCCAGA AGATTTACTA
AGATTGTGTG ATGTGCTGAA TCCAAATAAC GAAGCGGGTC GTTTGAATAT TATTGTGCGT
ATGGGGGCAG ATAAAGTCGA AGACGGCATG CCTAAGCTCA TTCAAGCAAT TCAGCGCGAA
GGTAAGCAGG TTGTGTGGAG TAGTGACCCG ATGCACGGCA ACACTGTAAA AGCGTCTACG
GGGTATAAGA CTCGACGTGT CGATGATGTG TTAAAAGAGG TGCAGCAGTT CTTCCAAGTT
CATAACGCTG AAGGTAGCTA TGCAGGCGGC GTTCATTTTG AAATGACGGG GCAGAATGTG
ACTGAGTGTG TGGGTGGTGC TTTTGAGGTG ACTGAAGCAG ATTTGGCTGA TCGTTACCAT
ACTCATTGTG ACCCTCGCTT GAATGCTGAT CAATCTTTAG AGTTGGCATT TATGATTTCG
GAGACACTTA AGAAAGCAAG GTCTTAA
 
Protein sequence
MSQWNLTSWR EKTALQQPVY PNAEHLAQVE NTLGKMPPLV FAGEARQLKK ALAQVANRQS 
FLLQGGDCAE SFAEFHANNI RDTFKVMLQM AVVLTYAGKC PVVKVGRMAG QFAKPRSSGS
EVIGGIELPS YRGDIINGID FTEQARVPDP ERLVQVYNQS ASTMNLLRAF AQGGFADLHQ
VHQWNLDFLN ASPAGSRFQG VADKIDDALQ FMEACGIGPG LAQLKETDFY TSHEALLLPY
EQALTRKDSL TGDWYDCSAH MLWIGDRTRQ LDGAHVEFLR GVQNPIGVKA GPTMDPEDLL
RLCDVLNPNN EAGRLNIIVR MGADKVEDGM PKLIQAIQRE GKQVVWSSDP MHGNTVKAST
GYKTRRVDDV LKEVQQFFQV HNAEGSYAGG VHFEMTGQNV TECVGGAFEV TEADLADRYH
THCDPRLNAD QSLELAFMIS ETLKKARS