Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_2834 |
Symbol | |
ID | 4253405 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | - |
Start bp | 3389216 |
End bp | 3390307 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 638119469 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_734962 |
Protein GI | 113971169 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000633865 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAACAGG ATACGATCAA TAACGTACAC ATCAGTTCAG AGAAAGTTCT TATTACACCG CAGGAGCTAA AAGCGGCCCT GCCACTCTCT GAGCATGCGT ATCGTTATAT CCTCAATGCA CGCAAAACAG TGGCGGATAT TGTCCATAAG CGCGATAACC GCGTGCTTAT CGTCACTGGA CCTTGTTCTA TCCATGATAT CGATGCCGCC AAGGAATACG CCTTAAAGCT TAAAAAACTA CACGATGAAC TCAGCGATGA GTTTTACATC CTGATGCGAG TCTACTTTGA GAAGCCACGT ACCACAGTGG GTTGGAAGGG GATGATTAAC GATCCTGATA TGGACGAGTC CTTCGATGTT GAGAAGGGTT TAAAGATGGC CCGTGAGCTG ATGATTTGGT TGGCCGAACT CGAACTGCCT GTGGCAACCG AAGCGCTCGA TCCGATCAGC CCACAGTACA TCTCGGAATT AGTCACTTGG TCGGCCATTG GTGCTCGCAC GACTGAGTCG CAAACCCACA GGGAAATGGC GTCGGGTTTA TCTATGCCCG TCGGCTTTAA AAATGGCACA GATGGAAAGC TCGATGTAGC GATTAACGCG CTGAAATCGG CGGCCAGCAG CCACAGATTT ATGGGGATTA ACCAGCAGGG CCAAGTGGCG TTATTGCAAA CCGCGGGAAA TCCCGATGGA CATGTGATCC TGCGTGGCGG CGCCACGCCA AACTACGATG CAAAGAGCGT TGCCGAGTGT GAAGCGCAGT TACATAAAGC CAAACTCAAT GCTCGCCTTA TTATCGATTG TAGCCACGGT AATTCGTCTA AGGACTACAC AAGGCAAGTC CCTGTGTGTG AAGATGTATT TGCCCAGATT TACAACGGCA ATAAATCCAT CATTGGCGTG ATGTTAGAAA GCCATTTAAA TGAAGGTAAT CAGAGTTGTG ATAAACCATT GAGCGAATTG GCCTATGGGG TTTCTGTGAC AGACTCCTGT ATTAATTGGG AAAAAACCGA AAGCGTTTTA CGTGCGGGCG CATCGAAGTT ATCTTCGGTA TTAGCAACCC GTTTCGATAT GCTCAAAGTG GCCAATGCTT AA
|
Protein sequence | MQQDTINNVH ISSEKVLITP QELKAALPLS EHAYRYILNA RKTVADIVHK RDNRVLIVTG PCSIHDIDAA KEYALKLKKL HDELSDEFYI LMRVYFEKPR TTVGWKGMIN DPDMDESFDV EKGLKMAREL MIWLAELELP VATEALDPIS PQYISELVTW SAIGARTTES QTHREMASGL SMPVGFKNGT DGKLDVAINA LKSAASSHRF MGINQQGQVA LLQTAGNPDG HVILRGGATP NYDAKSVAEC EAQLHKAKLN ARLIIDCSHG NSSKDYTRQV PVCEDVFAQI YNGNKSIIGV MLESHLNEGN QSCDKPLSEL AYGVSVTDSC INWEKTESVL RAGASKLSSV LATRFDMLKV ANA
|
| |