Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_0974 |
Symbol | |
ID | 4284952 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | + |
Start bp | 1071803 |
End bp | 1073173 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 638140443 |
Product | 3-deoxy-D-arabinoheptulosonate-7-phosphate synthase |
Protein accession | YP_756205 |
Protein GI | 114569525 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCTGGT CCCCCCAATC CTGGCGCTCC AAACCCGTCA GTCAGATGCC GAACTACAAG GATGCCGCCA AGCTGGATGC CGCGATTGCC GAGCTTTCGG CGCGTCCGGC TCTGGTCTTT GCCGGCGAGG CACGACGCCT GCGGCGCCAG CTGGCCGATG TGACGGCGGG CAAGGCCTTC CTGTTGCAGG GCGGTGATTG TGCCGAGAGC TTCAAGGAGT TCTCGACCGA GGGGGTGCGC GACACCTTCC GTGTGCTGCT GCAGATGGCG GTGGTGATGA CCTTTGCCGC CTCCAAGCCG ATCGTGAAGG TCGGCCGGAT CGCGGGCCAG TTCGCCAAGC CCCGCTCGGC GGACATGGAG ACCATTGATG GCGTGTCATT GCCCAGCTAT CGCGGTGACA GCGTCAATGG ACCCGAATTC ACGCCGGAAG CGCGTGAGCC CGATCCGCAG CGCTTGATCC GCGCTTATGA CCAGTCAGCC TCGACACTGA ACCTGCTGCG CGCCTTTGCG TCAGGCGGTT ATGCCGACCT GCACAATGTC CACCAGTGGA CCCAGGACTT CGTCAGCGAC AGCCCGGCGG CGGAACGCTA TGCCGAGACC GCGGCGCGCA TCTCCGAAGC CCTGGCCTTC ATGAAGGCCT GCGGCATCGG TCGCGACAGT GCGCCGTCGC TGGAAGCGGT CGACTTCTTC ACCAGTCACG AAGCGCTGCA TCTGCCCTTC GAGGAAGCGT TGACCCGCCG CGACCCCAAT ACCGGCCAGT GGTATGCCAC CTCGGCGCAC ATGATCTGGA CCGGCGAGCG GACCCGTCAG CTCGACGGGG CCCATGTCGA GTATGCGCGG GGCATCGCCA ATCCCGTCGG CGTCAAATGC GGCCCGACCA TGCAGCCGGA CGACCTGCTG CCGCTGATCG ACGCGCTGAA CCCGGACAAT GAGGCCGGGC GCCTCGTCCT GATCGTGCGC ATGGGCGCCG ATAATGTGGT CAAGAACTTG CCCAAGCTCG CCGCCGCCGT GACCAAGGCC GGCCGCAAGG TGGTCTGGTC GTCCGACCCG ATGCACGGCA ACACCCACAA AACCTCAAAT GGCTACAAGA CCCGCGACTT TGACCGCATC CTGTCCGAAC TCGAGGGCTT CATGGATGTA CTCTATGCCG AGGGGGCCTA TCCCGGCGGT GTGCATTTCG AGATGACCGG TCGCGATGTG ACCGAGTGCG TCGGCGGCGC CAAGACGGTC ACCGAGGCTG ATCTGGCGGC GCGCTATCAC ACCCATTGCG ATCCGCGCCT GAATGCCGAC CAGGCGCTCG ACATGGCCTT CCGCATTGCC GAGAGCCTGA AGCGGGTCCG CAACAACAAC TCGGCTGCCA ACGCGGCCTG A
|
Protein sequence | MTWSPQSWRS KPVSQMPNYK DAAKLDAAIA ELSARPALVF AGEARRLRRQ LADVTAGKAF LLQGGDCAES FKEFSTEGVR DTFRVLLQMA VVMTFAASKP IVKVGRIAGQ FAKPRSADME TIDGVSLPSY RGDSVNGPEF TPEAREPDPQ RLIRAYDQSA STLNLLRAFA SGGYADLHNV HQWTQDFVSD SPAAERYAET AARISEALAF MKACGIGRDS APSLEAVDFF TSHEALHLPF EEALTRRDPN TGQWYATSAH MIWTGERTRQ LDGAHVEYAR GIANPVGVKC GPTMQPDDLL PLIDALNPDN EAGRLVLIVR MGADNVVKNL PKLAAAVTKA GRKVVWSSDP MHGNTHKTSN GYKTRDFDRI LSELEGFMDV LYAEGAYPGG VHFEMTGRDV TECVGGAKTV TEADLAARYH THCDPRLNAD QALDMAFRIA ESLKRVRNNN SAANAA
|
| |