Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal223_3646 |
Symbol | |
ID | 7089580 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS223 |
Kingdom | Bacteria |
Replicon accession | NC_011663 |
Strand | - |
Start bp | 4324797 |
End bp | 4325852 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 643462526 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_002359547 |
Protein GI | 217974796 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00003774 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.00948062 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTATTACC AAAATGATGA CGTTCGCATT AAAGAAGTAA AAGAGTTACT TCCTCCTATC GCGATTCTAG AACGATTTCC TGCTTCTGAA AAAGCCTCTG CGACTGTGTT TAATGCGCGA AATAGTATCC ACAATATTCT GGCTAAGTCT GATGATCGCC TGTTAGTGGT GATTGGGCCT TGTTCTATCC ACGATCCCAA AGCAGCGTTG GAATATGGTC AACGTCTGGT TGCGCTGCGT GAGCGTTATA AGGATCAACT CGAAATCGTG ATGCGAGTGT ATTTTGAAAA GCCAAGAACC ACAGTGGGTT GGAAGGGGCT TATCAACGAT CCTTACATGG ATAACAGCTT TAAACTCAAC GATGGTTTAC GCACTGCGCG TAAGTTATTG GTGGATTTGA ACGATAGCGG CATGCCAACC GCGGGTGAGT TTCTTGATAT GATCACCCCA CAATATATGG CAGATTTAAT GTGCTGGGGA GCCATTGGTG CTCGTACTAC TGAATCACAA GTGCACAGAG AGTTAGCCTC GGGTCTTTCT TGTCCGGTCG GTTTTAAAAA TGGAACTGAT GGCACCATTA AAGTCGCTAT CGATGCGATA GGTGCTGCCA ATGCACCGCA CCATTTTTTA TCTGTGACTA AGTTGGGTCA TTCGGCGATC GTTTCGACGA AAGGCAATCC TGATTGCCAC ATTATTTTAC GTGGCGGCCG CGAGCCTAAT TATAGTGCGC CGCATGTCGC TGAAATTAGC CAACAGTTAT TAAAAGCTAA GCTTACCGAC AACATCATGA TCGACTTTAG CCACGCCAAT AGTAGTAAAC AGTATCAACG TCAGTTAGTG GTTGCCGAAG ATGTGGCTGG CCAAGTGGCG ACGGGCAATA CTGCTATTTT TGGTGTTATG GTAGAAAGCC ATTTAGTGGA AGGTCGTCAG GATTTAATTG AAGGTCAAGA GTTGTGTTAT GGCCAGAGTA TTACCGATGC GTGTATCGGT TGGGATGATA CTGAGAGCCT GTTGGCCATT CTGAATCAGA GTATTATCGA ACGCCGTCAG GTTTAA
|
Protein sequence | MYYQNDDVRI KEVKELLPPI AILERFPASE KASATVFNAR NSIHNILAKS DDRLLVVIGP CSIHDPKAAL EYGQRLVALR ERYKDQLEIV MRVYFEKPRT TVGWKGLIND PYMDNSFKLN DGLRTARKLL VDLNDSGMPT AGEFLDMITP QYMADLMCWG AIGARTTESQ VHRELASGLS CPVGFKNGTD GTIKVAIDAI GAANAPHHFL SVTKLGHSAI VSTKGNPDCH IILRGGREPN YSAPHVAEIS QQLLKAKLTD NIMIDFSHAN SSKQYQRQLV VAEDVAGQVA TGNTAIFGVM VESHLVEGRQ DLIEGQELCY GQSITDACIG WDDTESLLAI LNQSIIERRQ V
|
| |