Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmwyl1_2600 |
Symbol | |
ID | 5365986 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Marinomonas sp. MWYL1 |
Kingdom | Bacteria |
Replicon accession | NC_009654 |
Strand | - |
Start bp | 2941082 |
End bp | 2942428 |
Gene Length | 1347 bp |
Protein Length | 448 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640804973 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_001341448 |
Protein GI | 152996613 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.452677 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00000258304 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCACAGT GGAACCTAAC AAGTTGGAGA GAAAAGACGG CGCTACAACA GCCTGTTTAT CCTAATGCAG AGCATTTGGC TCAAGTTGAA AATACATTAG GAAAAATGCC TCCTCTTGTT TTTGCGGGTG AGGCGCGTCA GCTTAAAAAA GCTTTAGCAC AGGTGGCCAA TAGGCAGTCA TTTTTGCTGC AAGGTGGTGA TTGTGCTGAA AGCTTTGCTG AGTTCCATGC TAACAATATT CGCGATACGT TTAAAGTTAT GCTGCAAATG GCGGTTGTGT TGACTTATGC TGGTAAATGC CCAGTAGTAA AGGTTGGGCG CATGGCTGGG CAATTTGCTA AGCCGAGATC GTCAGGTAGT GAAGTAATTG GTGGTATTGA ATTGCCTAGT TACCGTGGTG ATATCATCAA TGGTATCGAT TTTACTGAGC AAGCAAGGGT TCCTGATCCT GAGCGTTTGG TGCAAGTTTA CAATCAGAGT GCATCGACAA TGAACTTGCT TCGAGCTTTT GCTCAGGGTG GATTTGCAGA TTTACATCAA GTGCATCAAT GGAATTTGGA CTTTTTGAAT GCGAGTCCAG CGGGTAGTCG TTTCCAGGGC GTGGCTGACA AGATTGATGA CGCGCTTCAG TTTATGGAGG CGTGTGGTAT TGGTCCTGGT TTAGCTCAGT TAAAAGAGAC AGATTTTTAT ACTTCCCATG AGGCGTTGTT GTTGCCTTAT GAGCAGGCTT TGACTCGTAA AGATAGTCTG ACTGGTGACT GGTATGATTG TTCTGCGCAC ATGTTATGGA TTGGAGATCG TACTCGTCAA TTAGATGGCG CCCATGTAGA GTTTTTGCGT GGTGTACAAA ACCCAATTGG CGTAAAGGCT GGCCCTACTA TGGATCCAGA AGATTTACTA AGATTGTGTG ATGTGCTGAA TCCAAATAAC GAAGCGGGTC GTTTGAATAT TATTGTGCGT ATGGGGGCAG ATAAAGTCGA AGACGGCATG CCTAAGCTCA TTCAAGCAAT TCAGCGCGAA GGTAAGCAGG TTGTGTGGAG TAGTGACCCG ATGCACGGCA ACACTGTAAA AGCGTCTACG GGGTATAAGA CTCGACGTGT CGATGATGTG TTAAAAGAGG TGCAGCAGTT CTTCCAAGTT CATAACGCTG AAGGTAGCTA TGCAGGCGGC GTTCATTTTG AAATGACGGG GCAGAATGTG ACTGAGTGTG TGGGTGGTGC TTTTGAGGTG ACTGAAGCAG ATTTGGCTGA TCGTTACCAT ACTCATTGTG ACCCTCGCTT GAATGCTGAT CAATCTTTAG AGTTGGCATT TATGATTTCG GAGACACTTA AGAAAGCAAG GTCTTAA
|
Protein sequence | MSQWNLTSWR EKTALQQPVY PNAEHLAQVE NTLGKMPPLV FAGEARQLKK ALAQVANRQS FLLQGGDCAE SFAEFHANNI RDTFKVMLQM AVVLTYAGKC PVVKVGRMAG QFAKPRSSGS EVIGGIELPS YRGDIINGID FTEQARVPDP ERLVQVYNQS ASTMNLLRAF AQGGFADLHQ VHQWNLDFLN ASPAGSRFQG VADKIDDALQ FMEACGIGPG LAQLKETDFY TSHEALLLPY EQALTRKDSL TGDWYDCSAH MLWIGDRTRQ LDGAHVEFLR GVQNPIGVKA GPTMDPEDLL RLCDVLNPNN EAGRLNIIVR MGADKVEDGM PKLIQAIQRE GKQVVWSSDP MHGNTVKAST GYKTRRVDDV LKEVQQFFQV HNAEGSYAGG VHFEMTGQNV TECVGGAFEV TEADLADRYH THCDPRLNAD QSLELAFMIS ETLKKARS
|
| |