Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1729 |
Symbol | ydcW |
ID | 6146236 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1734234 |
End bp | 1735658 |
Gene Length | 1425 bp |
Protein Length | 474 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641616606 |
Product | gamma-aminobutyraldehyde dehydrogenase |
Protein accession | YP_001743784 |
Protein GI | 170681515 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR03374] 1-pyrroline dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.000000210519 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAACATA AGTTACTGAT TAACGGAGAA CTGGTTAGCG GCGAAGGGGA AAAACAGCCT GTCTATAATC CGGCAACGGG GGACGTTTTA CTGGAGATTG CCGAGGCATC CGCAGAACAG GTGGATGCTG CTGTTCGCGC GGCAGATGCA GCATTTGCCG AATGGGGGCA AACCACGCCA AAAGCGCGTG CGGAATGCCT GCTGAAACTG GCGGATGTTA TCGAAGAAAA TGGTCAGGTT TTTGCCGAAC TGGAATCCCG TAATTGTGGC AAACCGCTGC ATAGTGCGTT CAATGATGAA ATTCCGGCGA TTGTCGATGT TTTTCGCTTT TTCGCGGGTG CGGCGCGCTG TCTGAATGGT CTGGCGGCAG GTGAATATCT TGAAGGTCAT ACTTCGATGA TCCGTCGCGA TCCGTTGGGG GTTGTGGCTT CTATCGCACC GTGGAATTAT CCGCTGATGA TGGCAGCGTG GAAACTGGCT CCGGCGCTGG CGGCAGGGAA CTGCGTAGTG CTTAAACCAT CAGAAATTAC CCCGCTGACC GCGTTGAAGT TGGCTGAGCT GGCGAAAGAT ATCTTCCCGG CAGGCGTGAT TAACGTACTG TTTGGCAGAG GCAAAACGGT GGGCGATCCG CTGACCGGAC ATCCCAAAGT GCGGATGGTG TCGTTGACGG GCTCTATCGC CACCGGCGAG CACATCATCA GCCATACCGC GCCGTCCATT AAGCGTACTC ATATGGAACT TGGTGGTAAA GCGCCAGTGA TTGTTTTTGA TGATGCGGAT ATTGAAGCAG TGGTCGAAGG TGTACGTACT TTTGGCTATT ACAATGCTGG ACAGGATTGT ACTGCGGCTT GTCGGATCTA CGCGCAAAAA GGCATTTACG ATACGCTGGT GGAAAAACTC GGTGCTGCGG TGGCAACGTT AAAATCTGGT GCACCGGATG ACGACTCTAC GGAGCTTGGA CCTTTAAGCT CGCTGGCGCA TCTCGAACGC GTCAGCAAGG CGGTAGAAGA GGCGAAAGCG ACAGGGCACA TCAAAGTGAT CACTGGCGGT GAAAAGCGCA AGGGTAATGG CTATTACTAT GCGCCGACGC TGCTGGCTGG CGCATTACAG GACGATGCCA TCGTGCAAAA AGAGGTATTT GGTCCGGTAG TGAGTGTTAC GCTCTTCGAC AACGAAGAAC AGGTGGTGAA CTGGGCAAAT GACAGCCAGT ACGGACTTGC ATCCTCGGTA TGGACGAAAG ATGTGGGCAG GGCGCATCGC GTCAGCGCGC GGCTGCAATA TGGTTGTACC TGGGTCAATA CCCATTTCAT GCTGGTAAGT GAAATGCCGC ACGGTGGGCA GAAACTTTCT GGTTACGGCA AGGATATGTC ACTTTATGGG CTGGAGGATT ACACCGTCGT CCGCCACGTC ATGGTTAAAC ATTAA
|
Protein sequence | MQHKLLINGE LVSGEGEKQP VYNPATGDVL LEIAEASAEQ VDAAVRAADA AFAEWGQTTP KARAECLLKL ADVIEENGQV FAELESRNCG KPLHSAFNDE IPAIVDVFRF FAGAARCLNG LAAGEYLEGH TSMIRRDPLG VVASIAPWNY PLMMAAWKLA PALAAGNCVV LKPSEITPLT ALKLAELAKD IFPAGVINVL FGRGKTVGDP LTGHPKVRMV SLTGSIATGE HIISHTAPSI KRTHMELGGK APVIVFDDAD IEAVVEGVRT FGYYNAGQDC TAACRIYAQK GIYDTLVEKL GAAVATLKSG APDDDSTELG PLSSLAHLER VSKAVEEAKA TGHIKVITGG EKRKGNGYYY APTLLAGALQ DDAIVQKEVF GPVVSVTLFD NEEQVVNWAN DSQYGLASSV WTKDVGRAHR VSARLQYGCT WVNTHFMLVS EMPHGGQKLS GYGKDMSLYG LEDYTVVRHV MVKH
|
| |