Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_3358 |
Symbol | |
ID | 4253924 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | - |
Start bp | 4005597 |
End bp | 4006655 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 638119996 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_735481 |
Protein GI | 113971688 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0607485 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATTACC AAAATGATGA CGTTCGCATT AATGAAGTCA AAGAGTTACT TCCACCAATT GCGATTCTAG AACGATTTCC TGCCTCCGAA AACGCTGCTG CAACTGTGTT TAACGCCCGT CAAAGCATTC ATAACATTCT TGCCCGCCAA GATGATCGTT TGTTAGTGGT CATTGGTCCT TGCTCAATTC ACGATCCTAA GGCCGCATTG GAATATGGCC AGCGATTGGT TGCGCTCAGA GAACGTTATC AAGATCAGCT CGAAGTGGTG ATGCGTGTTT ATTTTGAAAA GCCGCGTACT ACAGTGGGTT GGAAGGGACT CATCAACGAT CCTTATATGG ATAACAGCTT CAAACTGAAC GATGGTTTGC GTACCGCGCG TAAACTTTTA GTCGATTTGA ATGACTCTGG CATGCCGACC GCGGGTGAGT TTCTCGATAT GATCACTCCG CAATATGTGG CGGATATGAT GTGCTGGGGC GCGATTGGTG CGCGCACCAC TGAATCGCAG GTGCACCGTG AATTAGCGTC TGGGTTGTCT TGCCCTGTAG GCTTTAAAAA TGGCACCGAC GGCACGATCA AGGTGGCGAT TGATGCCATT GGCGCCGCGA ATGCGCCGCA CCACTTCTTA TCGGTAACCA AATTTGGCCA TTCGGCGATT GTGTCGACCA AAGGTAATCC TGATTGCCAT ATCATTCTGC GTGGTGGCCG TGAGCCTAAT TACAGTGCCA GCCATGTTGC GCAGATCAGC GAACAACTGA AAAAGGCCAA GCTTGTCGAT AACATTATGA TCGACTTTAG CCATGCCAAT AGCAGCAAAC AATATCAACG CCAGATGGAT GTCGCGAACG ATGTGGCCGA GCAAGTGGCC GCGGGTAATA AGGCAATTTT CGGTGTGATG GTTGAAAGCC ACTTAGTGGA AGGTCGCCAA GATCTGATTG AAGGCCAGGC GTTATGTTAT GGCCAGAGTA TTACGGATGC CTGTATTGGC TGGGATGATA CCGAGCGTCT GTTAGATGTG TTAAATCAGA GCGTTATTGC ACGCCGCCAG CGGGTCTAA
|
Protein sequence | MYYQNDDVRI NEVKELLPPI AILERFPASE NAAATVFNAR QSIHNILARQ DDRLLVVIGP CSIHDPKAAL EYGQRLVALR ERYQDQLEVV MRVYFEKPRT TVGWKGLIND PYMDNSFKLN DGLRTARKLL VDLNDSGMPT AGEFLDMITP QYVADMMCWG AIGARTTESQ VHRELASGLS CPVGFKNGTD GTIKVAIDAI GAANAPHHFL SVTKFGHSAI VSTKGNPDCH IILRGGREPN YSASHVAQIS EQLKKAKLVD NIMIDFSHAN SSKQYQRQMD VANDVAEQVA AGNKAIFGVM VESHLVEGRQ DLIEGQALCY GQSITDACIG WDDTERLLDV LNQSVIARRQ RV
|
| |