Gene ECH74115_1942 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1942 
SymbolpuuC 
ID6971505 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1837566 
End bp1839053 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content57% 
IMG OID643385872 
Productgamma-glutamyl-gamma-aminobutyraldehyde dehydrogenase 
Protein accessionYP_002270361 
Protein GI209396804 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTTTC ATCATCTGGC TTACTGGCAG GATAAAGCGT TAAGTCTCGC CATTGAAAAC 
CGCTTATTTA TTAACGGTGA ATATACTGCT GCGGCGGAAA ATGAAACCTT TGAAACCGTT
GATCCGGTTA CCCAGGCACC GCTGGCGAAT ATTGCGCGCG GCAAGAGCGT CGATATCGAC
CGTGCGGTGA GCGCAGCACG CGGCGTATTT GAACGCGGCG ACTGGTCACT CTCTTCTCCG
GCAAAACGTA AAGCGGTGCT GAATAAACTC GCCGATTTAA TAGAAGCCAA CGCCGAAGAG
CTGGCACTGC TGGAAACTCT CGACACCGGC AAACCGATTC GTCACAGTCT GCGTGATGAT
ATACCCGGCG CGGCGCGCGC CATTCGCTGG TACGCCGAAG CGATCGACAA AGTGTATGGC
GAAGTGGCGA CCACCAGTAG CCATGAGCTG GCGATGATCG TGCGTGAACC GGTCGGCGTG
ATTGCCGCCA TCGTGCCGTG GAACTTCCCG CTGCTGCTGA CTTGCTGGAA GCTTGGCCCG
GCGCTGGCAG CAGGGAACAG CGTTGTCCTG AAACCGTCTG AAAAATCACC GCTCAGTGCG
ATTCGTCTCG CGGGGCTGGC GAAAGAAGCA GGCTTGCCGG ATGGTGTGTT GAACGTGGTG
ACGGGTTTTG GTCATGAAGC GGGGCAGGCG CTGTCGCGTC ATAACGATAT CGACGCCATT
GCCTTTACCG GTTCGACCCG TACCGGGAAA CAGCTGCTGA AAGATGCAGG CGACAGCAAC
ATGAAACGCG TCTGGCTGGA AGCGGGCGGC AAAAGCGCCA ACATCGTTTT CGCTGACTGC
CCGGATTTGC AACAGGCGGC AAGCGCCACC GCAGCAGGCA TCTTCTACAA CCAGGGGCAG
GTGTGCATCG CCGGAACGCG CCTGTTGCTG GAAGAGAGCA TAGCCGATGA ATTCTTAGCC
CTGTTAAAAC AGCAGGCGCA AAACTGGCAG CCGGGCCATC CACTTGATCC CGCAACCACC
ATGGGCACCT TAATCGACTG CGCCCACGCC GACTCGGTCC ATAGCTTTAT TCAGGAAGGC
GAAAGCAAAG GGCAACTGTT GTTGGATGGC CGTAACGCCG GGCTGGCTGT CGCCATCGGC
CCGACCATCA TTGTGGATGT AGACCCGAAT GCGTCCTTAA GCCGCGAAGA GATTTTCGGT
CCGGTGCTGG TGGTCACGCG TTTCACATCA GAAGAACAGG CGCTACAGCT TGCCAACGAC
AGCCAGTACG GCCTTGGCGC GGCGGTATGG ACGCGCGACC TCTCCCGCGC GCACCGCATG
AGCCGACGCC TGAAAGCCGG TTCCGTCTTC GTCAATAACT ACAACGACGG CGATATGACC
GTGCCGTTTG GCGGCTATAA GCAGAGCGGC AACGGTCGCG ACAAATCCCT GCATGCCCTT
GAAAAATTCA CCGAACTGAA AACCATCTGG ATAAGCCTGG AGGCCTGA
 
Protein sequence
MNFHHLAYWQ DKALSLAIEN RLFINGEYTA AAENETFETV DPVTQAPLAN IARGKSVDID 
RAVSAARGVF ERGDWSLSSP AKRKAVLNKL ADLIEANAEE LALLETLDTG KPIRHSLRDD
IPGAARAIRW YAEAIDKVYG EVATTSSHEL AMIVREPVGV IAAIVPWNFP LLLTCWKLGP
ALAAGNSVVL KPSEKSPLSA IRLAGLAKEA GLPDGVLNVV TGFGHEAGQA LSRHNDIDAI
AFTGSTRTGK QLLKDAGDSN MKRVWLEAGG KSANIVFADC PDLQQAASAT AAGIFYNQGQ
VCIAGTRLLL EESIADEFLA LLKQQAQNWQ PGHPLDPATT MGTLIDCAHA DSVHSFIQEG
ESKGQLLLDG RNAGLAVAIG PTIIVDVDPN ASLSREEIFG PVLVVTRFTS EEQALQLAND
SQYGLGAAVW TRDLSRAHRM SRRLKAGSVF VNNYNDGDMT VPFGGYKQSG NGRDKSLHAL
EKFTELKTIW ISLEA