Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sputcn32_3238 |
Symbol | |
ID | 5080079 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella putrefaciens CN-32 |
Kingdom | Bacteria |
Replicon accession | NC_009438 |
Strand | - |
Start bp | 3765112 |
End bp | 3766167 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640500435 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_001184749 |
Protein GI | 146294325 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000276363 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATTACC AAAACGATGA CGTTCGCATT AAAGAAATCA AAGAATTACT TCCTCCCATT GCGATTCTTG AACGATTTCC TGCTTCCGAA AAAGCCTCTG CCACCGTATT TAATGCCCGT CAAAGTATTC ACCATATTTT AGCGAAAAAT GATGATCGCC TTTTAGTGGT AATCGGCCCA TGCTCTATCC ATGATCCTAA AGCCGCATTG GAATATGGTC AGCGTCTTGT TTCTCTGCGC GAGCATTATA AAGATCAACT TGAGATTGTG ATGCGAGTGT ATTTTGAAAA GCCAAGAACG ACTGTGGGTT GGAAGGGATT GATCAACGAT CCTTATATGG ATAACAGTTT TAAACTAAAT GATGGACTGC GTACCGCCCG AAAATTATTA GTGGATTTAA ATGATAGTGG TATGCCAACG GCGGGAGAGT TCCTTGATAT GATCACGCCA CAATATATGG CCGACTTAAT GTGTTGGGGA GCCATTGGTG CGCGTACAAC TGAGTCACAG GTACATCGAG AGTTGGCATC GGGGCTTTCG TGTCCTGTCG GGTTTAAAAA TGGCACTGAT GGTACGATTA AAGTCGCCAT TGATGCGATT GGTGCTGCAA ATGCACCACA CCATTTTCTT TCTGTCACCA AGTTTGGTCA TTCTGCGATT GTGTCAACTA AGGGGAATCC CGATTGCCAT ATTATTTTGC GTGGTGGACG CGAACCTAAC TACAGTGCGC CCCATGTCGC ACAGATTAGT GAGCAGCTAC AAAAAGCCAA ATTACCCGAT AATGTGATGA TCGACTTTAG TCATGCTAAT AGTAGTAAAC AGTATCAGCG CCAAATGGTA GTCGCTCAAG ATGTGGCTGA CCAAGTTGCC GCAGGCAATA AAGCTATTTT TGGCGTTATG GTCGAGAGTC ATTTAGTGGA AGGGCGTCAG GATCTCGTTG AGGGGCAAGA TTTATGCTAC GGCCAAAGCA TTACCGATGC CTGTATTGGT TGGGATGACA CTGAGCGTTT GTTGGCTGTA TTAGCCCAAA GTGTGATTGA GCGCCGCAAG GCCTAA
|
Protein sequence | MHYQNDDVRI KEIKELLPPI AILERFPASE KASATVFNAR QSIHHILAKN DDRLLVVIGP CSIHDPKAAL EYGQRLVSLR EHYKDQLEIV MRVYFEKPRT TVGWKGLIND PYMDNSFKLN DGLRTARKLL VDLNDSGMPT AGEFLDMITP QYMADLMCWG AIGARTTESQ VHRELASGLS CPVGFKNGTD GTIKVAIDAI GAANAPHHFL SVTKFGHSAI VSTKGNPDCH IILRGGREPN YSAPHVAQIS EQLQKAKLPD NVMIDFSHAN SSKQYQRQMV VAQDVADQVA AGNKAIFGVM VESHLVEGRQ DLVEGQDLCY GQSITDACIG WDDTERLLAV LAQSVIERRK A
|
| |