Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1942 |
Symbol | puuC |
ID | 6971505 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1837566 |
End bp | 1839053 |
Gene Length | 1488 bp |
Protein Length | 495 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643385872 |
Product | gamma-glutamyl-gamma-aminobutyraldehyde dehydrogenase |
Protein accession | YP_002270361 |
Protein GI | 209396804 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTTTC ATCATCTGGC TTACTGGCAG GATAAAGCGT TAAGTCTCGC CATTGAAAAC CGCTTATTTA TTAACGGTGA ATATACTGCT GCGGCGGAAA ATGAAACCTT TGAAACCGTT GATCCGGTTA CCCAGGCACC GCTGGCGAAT ATTGCGCGCG GCAAGAGCGT CGATATCGAC CGTGCGGTGA GCGCAGCACG CGGCGTATTT GAACGCGGCG ACTGGTCACT CTCTTCTCCG GCAAAACGTA AAGCGGTGCT GAATAAACTC GCCGATTTAA TAGAAGCCAA CGCCGAAGAG CTGGCACTGC TGGAAACTCT CGACACCGGC AAACCGATTC GTCACAGTCT GCGTGATGAT ATACCCGGCG CGGCGCGCGC CATTCGCTGG TACGCCGAAG CGATCGACAA AGTGTATGGC GAAGTGGCGA CCACCAGTAG CCATGAGCTG GCGATGATCG TGCGTGAACC GGTCGGCGTG ATTGCCGCCA TCGTGCCGTG GAACTTCCCG CTGCTGCTGA CTTGCTGGAA GCTTGGCCCG GCGCTGGCAG CAGGGAACAG CGTTGTCCTG AAACCGTCTG AAAAATCACC GCTCAGTGCG ATTCGTCTCG CGGGGCTGGC GAAAGAAGCA GGCTTGCCGG ATGGTGTGTT GAACGTGGTG ACGGGTTTTG GTCATGAAGC GGGGCAGGCG CTGTCGCGTC ATAACGATAT CGACGCCATT GCCTTTACCG GTTCGACCCG TACCGGGAAA CAGCTGCTGA AAGATGCAGG CGACAGCAAC ATGAAACGCG TCTGGCTGGA AGCGGGCGGC AAAAGCGCCA ACATCGTTTT CGCTGACTGC CCGGATTTGC AACAGGCGGC AAGCGCCACC GCAGCAGGCA TCTTCTACAA CCAGGGGCAG GTGTGCATCG CCGGAACGCG CCTGTTGCTG GAAGAGAGCA TAGCCGATGA ATTCTTAGCC CTGTTAAAAC AGCAGGCGCA AAACTGGCAG CCGGGCCATC CACTTGATCC CGCAACCACC ATGGGCACCT TAATCGACTG CGCCCACGCC GACTCGGTCC ATAGCTTTAT TCAGGAAGGC GAAAGCAAAG GGCAACTGTT GTTGGATGGC CGTAACGCCG GGCTGGCTGT CGCCATCGGC CCGACCATCA TTGTGGATGT AGACCCGAAT GCGTCCTTAA GCCGCGAAGA GATTTTCGGT CCGGTGCTGG TGGTCACGCG TTTCACATCA GAAGAACAGG CGCTACAGCT TGCCAACGAC AGCCAGTACG GCCTTGGCGC GGCGGTATGG ACGCGCGACC TCTCCCGCGC GCACCGCATG AGCCGACGCC TGAAAGCCGG TTCCGTCTTC GTCAATAACT ACAACGACGG CGATATGACC GTGCCGTTTG GCGGCTATAA GCAGAGCGGC AACGGTCGCG ACAAATCCCT GCATGCCCTT GAAAAATTCA CCGAACTGAA AACCATCTGG ATAAGCCTGG AGGCCTGA
|
Protein sequence | MNFHHLAYWQ DKALSLAIEN RLFINGEYTA AAENETFETV DPVTQAPLAN IARGKSVDID RAVSAARGVF ERGDWSLSSP AKRKAVLNKL ADLIEANAEE LALLETLDTG KPIRHSLRDD IPGAARAIRW YAEAIDKVYG EVATTSSHEL AMIVREPVGV IAAIVPWNFP LLLTCWKLGP ALAAGNSVVL KPSEKSPLSA IRLAGLAKEA GLPDGVLNVV TGFGHEAGQA LSRHNDIDAI AFTGSTRTGK QLLKDAGDSN MKRVWLEAGG KSANIVFADC PDLQQAASAT AAGIFYNQGQ VCIAGTRLLL EESIADEFLA LLKQQAQNWQ PGHPLDPATT MGTLIDCAHA DSVHSFIQEG ESKGQLLLDG RNAGLAVAIG PTIIVDVDPN ASLSREEIFG PVLVVTRFTS EEQALQLAND SQYGLGAAVW TRDLSRAHRM SRRLKAGSVF VNNYNDGDMT VPFGGYKQSG NGRDKSLHAL EKFTELKTIW ISLEA
|
| |