Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1822 |
Symbol | puuC |
ID | 6143841 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1841630 |
End bp | 1843117 |
Gene Length | 1488 bp |
Protein Length | 495 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641616698 |
Product | gamma-glutamyl-gamma-aminobutyraldehyde dehydrogenase |
Protein accession | YP_001743876 |
Protein GI | 170680173 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 0.0935717 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTTTC ATCATCTGGC TTACTGGCAG GATAAAGCGT TAAGTCTCGC CATTGAAACC CGCTTATTTA TTAACGGTGA ATATACTGCT GCGGCGGAAA ATGAAACTTT TGAAACCGTT GATCCGGTCA CCCAGGCACC GCTGGCGAAA ATTGCCCGCG GCAAGAGCGT CGATATCGAC CGTGCGGTGA GCGCAGCACG CGGCGTATTT GAACGCGGCG ACTGGTCACT CTCTTCTCCG GCTAAACGTA AAGCGGTACT GAATAAACTC GCCGATTTAA TGGAAGCTAA CGCTGAAGAA CTGGCGCTGC TGGAAACGCT CGACACCGGC AAACCGATTC GTCACAGCCT GCGTGATGAT ATTCCCGGCG CGGCGCGCGC CATTCGCTGG TACGCCGAAG CGATCGACAA AGTGTATGGC GAAGTGGCGA CCACCAGTAG CCATGAGCTG GCGATGATCG TGCGTGAACC GGTCGGCGTG ATTGCCGCCA TCGTGCCGTG GAACTTCCCG CTGCTGCTGA CTTGCTGGAA ACTCGGCCCG GCGCTGGCGG CGGGAAACAG CGTGATTCTA AAACCGTCTG AAAAATCACC GCTCAGTGCG ATTCGTCTCG CGGGGCTGGC GAAAGAAGCG GGGCTGCCGG ATGGTGTGTT GAACGTGGTG ACGGGTTTTG GTCATGAAGC CGGGCAGGCG CTTTCGCGCC ATAACGATAT CGATGCCATT GCCTTTACTG GCTCAACCCG CACCGGGAAA CAGCTGCTGA AGGACGCGGG CGACAGCAAC ATGAAACGCG TCTGGCTGGA GGCGGGCGGC AAAAGCGCCA ACATCGTTTT CGCCGACTGC CCGGATTTAC AACAGGCAGC AAGCGCCACC GCCGCAGGCA TTTTCTACAA CCAGGGCCAG GTGTGCATCG CCGGAACGCG TTTGTTGCTG GAAGAGAGCA TCGCCGATGA ATTCTTAGCC CTGTTAAAAC AGCAGGCGCA AAACTGGCAG CCGGGCCATC CACTTGATCC CGCAACCACC ATGGGCACCT TAATCGACTG CGCCCACGCC GACTCGGTCC ATAGCTTTAT TCGGGAAGGC GAAAGCAAAG GGCAACTGCT GCTGGATGGC CGTAACGCCG AGCTGGCTGC CGCCATCGGC CCGACCATCT TTGTGGAGGT AGACCCGAAT GCGTCCTTAA GCCGCGAAGA GATTTTCGGT CCGGTGCTGG TGGTCACGCG TTTCACATCA GAAGAACAGG CGCTACAGCT TGCCAACGAC TGCCAGTACG GACTTGGCGC GGCGGTATGG ACGCGCGACC TCTCCCGCGC GCACCGCATG AGTCGCCGCC TGAAAGCCGG CTCCGTCTTC GTCAATAACT ACAACGACGG CGATATGACC GTGCCGTTTG GCGGCTATAA GCAGAGCGGC AACGGGCGCG ACAAATCCCT GCATGCCCTT GACAAATTCA CCGAACTGAA AACCATCTGG ATAAGCCTGG AGGCCTGA
|
Protein sequence | MNFHHLAYWQ DKALSLAIET RLFINGEYTA AAENETFETV DPVTQAPLAK IARGKSVDID RAVSAARGVF ERGDWSLSSP AKRKAVLNKL ADLMEANAEE LALLETLDTG KPIRHSLRDD IPGAARAIRW YAEAIDKVYG EVATTSSHEL AMIVREPVGV IAAIVPWNFP LLLTCWKLGP ALAAGNSVIL KPSEKSPLSA IRLAGLAKEA GLPDGVLNVV TGFGHEAGQA LSRHNDIDAI AFTGSTRTGK QLLKDAGDSN MKRVWLEAGG KSANIVFADC PDLQQAASAT AAGIFYNQGQ VCIAGTRLLL EESIADEFLA LLKQQAQNWQ PGHPLDPATT MGTLIDCAHA DSVHSFIREG ESKGQLLLDG RNAELAAAIG PTIFVEVDPN ASLSREEIFG PVLVVTRFTS EEQALQLAND CQYGLGAAVW TRDLSRAHRM SRRLKAGSVF VNNYNDGDMT VPFGGYKQSG NGRDKSLHAL DKFTELKTIW ISLEA
|
| |