Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1647 |
Symbol | |
ID | 6146287 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1634430 |
End bp | 1635818 |
Gene Length | 1389 bp |
Protein Length | 462 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641616523 |
Product | putative succinate semialdehyde dehydrogenase |
Protein accession | YP_001743701 |
Protein GI | 170684086 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCATTA CTCCGGCAAC TCATGCAATT TCGATAAATC CTGCCACCGG TGAACAACTT TCTGTGCTGC CGTGGGCTGG CGCTGATGAT ATCGAAAACG CACTTCAGCT GGCGGCAGCA GGCTTTCGCG ACTGGCGCGA GACAAATATA GATTATCGTG CTGAAAAACT GCGTGATATC GGTAAGGCTC TGCGTGTCCG TAGCGAAGAA ATGGCGCAAA TGATCACCCG TGAAATGGGC AAACCAATCA ACCAGGCGCG CGCTGAAGTG GCGAAATCGG CGAATTTGTG TGACTGGTAT GCAGAACATG GTCCGGCAAT GCTGAAGGCG GAACCTACGC TGGTGGAAAA TCAGCAGGCA GTTATTGAGT ATCGACCGTT GGGGACAATT CTGGCGATTA TGCCGTGGAA TTTTCCGTTA TGGCAGGTGA TGCGTGGCGC GGTTCCCATC ATTCTTGCAG GTAACGGCTA CTTACTTAAA CATGCGCCGA ATGTGATGGG CTGTGCACAG CTCATTGCCC AGGTGTTTAA AGATGCGGGT ATCCCACAAG GCGTATATGG CTGGCTGAAT GCCGACAACG ACGGCGTCAG TCAGATGATT AAAGACTCGC GCATTGCTGC TGTCACGGTG ACCGGAAGTG TTCGTGCGGG AGCGGCTATT GGCGCACAGG CTGGAGCGGC ACTGAAAAAA TGCGTACTGG AACTGGGCGG TTCGGATCCA TTTATTGTGC TTAACGATGC CGATCTGGAA CTGGCGGTTA AAGCGGCGGT AGCCGGACGT TATCAGAATA CCGGACAGGT TTGTGCAGCG GCAAAACGCT TTATCATCGA AGAGGGAATT GCTTCGGCAT TTACCGAACG TTTTGTGGCA GCTGCGGCAG CCTTGAAAAT GGGCGATCCC CGTGACGAAG AGAACACTCT CGGACCAATG GCTCGTTTTG ATTTACGTGA TGAGCTGCAT CATCAGGTGG AGAAAACCCT GGCGCAGGGT GCGCGTTTGT TACTGGGCGG GGAAAAGATG GCTGGGGCAG GTAACTACTA TCCGCCAACG GTTCTGGCGA ATGTTACCCC AGAAATGACC GCGTTTCGGG AAGAAATGTT TGGCCCCGTT GCGGCAATCA CCATTGCGAA AGATGCAGAA CATGCACTGG AACTGGCTAA TAATAGCGAG TTCGGCCTTT CAGCGACCAT TTTCACCACC GACGAAACCC GGGCCAGACA GATGGCGGCA CGTCTGGAAT GCGGTGGGGT GTTTATCAAT GGTTACTGTG CCAGCGACGC GCGAGTCGCC TTTGGTGGTG TGAAGAAGAG TGGCTTTGGT CGTGAGCTTT CCCATTTTGG CTTACACGAA TTCTGTAATA TCCAGACGGT GTGGAAAGAC CGGGTCTGA
|
Protein sequence | MTITPATHAI SINPATGEQL SVLPWAGADD IENALQLAAA GFRDWRETNI DYRAEKLRDI GKALRVRSEE MAQMITREMG KPINQARAEV AKSANLCDWY AEHGPAMLKA EPTLVENQQA VIEYRPLGTI LAIMPWNFPL WQVMRGAVPI ILAGNGYLLK HAPNVMGCAQ LIAQVFKDAG IPQGVYGWLN ADNDGVSQMI KDSRIAAVTV TGSVRAGAAI GAQAGAALKK CVLELGGSDP FIVLNDADLE LAVKAAVAGR YQNTGQVCAA AKRFIIEEGI ASAFTERFVA AAAALKMGDP RDEENTLGPM ARFDLRDELH HQVEKTLAQG ARLLLGGEKM AGAGNYYPPT VLANVTPEMT AFREEMFGPV AAITIAKDAE HALELANNSE FGLSATIFTT DETRARQMAA RLECGGVFIN GYCASDARVA FGGVKKSGFG RELSHFGLHE FCNIQTVWKD RV
|
| |