Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2475 |
Symbol | |
ID | 6144252 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2522501 |
End bp | 2523514 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641617347 |
Product | putative semialdehyde dehydrogenase |
Protein accession | YP_001744519 |
Protein GI | 170680975 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0136] Aspartate-semialdehyde dehydrogenase |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGAAG GCTGGAACAT TGCCGTCCTG GGCGCAACTG GCGCTGTGGG CGAAGCCCTG CTTGAAACGC TGGCTGAACG TCAGTTCCCG GTTGGGGAAA TTTATGCACT GGCACGTAAC GAAAGCGCAG GCGAACAACT GCGCTTTGGT GGTAAGACAA TCACCGTGCA GGATGCCGCT GAATTCGACT GGACGCAGGC GCAGCTGGCG TTTTTTGTTG CAGGCAAAGA AGCTACCGCT GCCTGGGTTG AAGAAGCGAC CAACTCAGGT TGTCTGGTGA TCGACAGCAG CGGATTGTTT GCTCTCGAAC CCGACGTACC GCTGGTGGTG CCGGAAGTAA ACCCGTTTGT ACTGACCGAT TACCGCAACC GGAATGTCAT CGCCGTACCA GACAGTCTGA CCAGCCAGCT GCTGGCGGCA CTGAAACCGC TAATTGATCA GGGCGGTTTA TCACGTATCA GCGTTACCAG CCTGATTTCA GCCTCCGCCC AGGGTAAAAA AGCGGTCGAT GCGTTAGCGG GGCAGAGTGC TAAATTACTC AATGGCATTC CGATTGACGA AGAAGATTTC TTCGGGCGTC AACTGGCATT CAATATGCTG CCGTTACTGC CGGATAGCGA AGGTAGCGTG CGTGAAGAAC GTCGTATCGT TGACGAAGTA CGCAAAATCC TGCAGGACGA AGGGCTGATG ATTTCGGCAA GCGTCGTCCA GGCACCGGTA TTCTATGGTC ATGCCCAGAT GGTCAACTTT GAAGCACTGC GTCCGCTGGC GGCAGAAGAA GCGCGTGATG CGTTTGCTCA GGGCGAAGAT ATTGTGCTCT CTGAAGAGAA CGAATTCCCG ACTCAGGTGG GGGATGCTTC TGGTACGCCG CATCTTTCCG TTGGCTGCGT ACGTAATGAT TACGGTATGC CGGAGCAAGT TCAGTTCTGG TCGGTGGCCG ATAACGTTCG CTTTGGCGGC GCGCTGATGG CAGTAAAAAT CGCCGAGAAA CTGGTGCAGG AGTATCTGTA CTAA
|
Protein sequence | MSEGWNIAVL GATGAVGEAL LETLAERQFP VGEIYALARN ESAGEQLRFG GKTITVQDAA EFDWTQAQLA FFVAGKEATA AWVEEATNSG CLVIDSSGLF ALEPDVPLVV PEVNPFVLTD YRNRNVIAVP DSLTSQLLAA LKPLIDQGGL SRISVTSLIS ASAQGKKAVD ALAGQSAKLL NGIPIDEEDF FGRQLAFNML PLLPDSEGSV REERRIVDEV RKILQDEGLM ISASVVQAPV FYGHAQMVNF EALRPLAAEE ARDAFAQGED IVLSEENEFP TQVGDASGTP HLSVGCVRND YGMPEQVQFW SVADNVRFGG ALMAVKIAEK LVQEYLY
|
| |