Gene EcSMS35_1822 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1822 
SymbolpuuC 
ID6143841 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1841630 
End bp1843117 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content58% 
IMG OID641616698 
Productgamma-glutamyl-gamma-aminobutyraldehyde dehydrogenase 
Protein accessionYP_001743876 
Protein GI170680173 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.0935717 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTTTC ATCATCTGGC TTACTGGCAG GATAAAGCGT TAAGTCTCGC CATTGAAACC 
CGCTTATTTA TTAACGGTGA ATATACTGCT GCGGCGGAAA ATGAAACTTT TGAAACCGTT
GATCCGGTCA CCCAGGCACC GCTGGCGAAA ATTGCCCGCG GCAAGAGCGT CGATATCGAC
CGTGCGGTGA GCGCAGCACG CGGCGTATTT GAACGCGGCG ACTGGTCACT CTCTTCTCCG
GCTAAACGTA AAGCGGTACT GAATAAACTC GCCGATTTAA TGGAAGCTAA CGCTGAAGAA
CTGGCGCTGC TGGAAACGCT CGACACCGGC AAACCGATTC GTCACAGCCT GCGTGATGAT
ATTCCCGGCG CGGCGCGCGC CATTCGCTGG TACGCCGAAG CGATCGACAA AGTGTATGGC
GAAGTGGCGA CCACCAGTAG CCATGAGCTG GCGATGATCG TGCGTGAACC GGTCGGCGTG
ATTGCCGCCA TCGTGCCGTG GAACTTCCCG CTGCTGCTGA CTTGCTGGAA ACTCGGCCCG
GCGCTGGCGG CGGGAAACAG CGTGATTCTA AAACCGTCTG AAAAATCACC GCTCAGTGCG
ATTCGTCTCG CGGGGCTGGC GAAAGAAGCG GGGCTGCCGG ATGGTGTGTT GAACGTGGTG
ACGGGTTTTG GTCATGAAGC CGGGCAGGCG CTTTCGCGCC ATAACGATAT CGATGCCATT
GCCTTTACTG GCTCAACCCG CACCGGGAAA CAGCTGCTGA AGGACGCGGG CGACAGCAAC
ATGAAACGCG TCTGGCTGGA GGCGGGCGGC AAAAGCGCCA ACATCGTTTT CGCCGACTGC
CCGGATTTAC AACAGGCAGC AAGCGCCACC GCCGCAGGCA TTTTCTACAA CCAGGGCCAG
GTGTGCATCG CCGGAACGCG TTTGTTGCTG GAAGAGAGCA TCGCCGATGA ATTCTTAGCC
CTGTTAAAAC AGCAGGCGCA AAACTGGCAG CCGGGCCATC CACTTGATCC CGCAACCACC
ATGGGCACCT TAATCGACTG CGCCCACGCC GACTCGGTCC ATAGCTTTAT TCGGGAAGGC
GAAAGCAAAG GGCAACTGCT GCTGGATGGC CGTAACGCCG AGCTGGCTGC CGCCATCGGC
CCGACCATCT TTGTGGAGGT AGACCCGAAT GCGTCCTTAA GCCGCGAAGA GATTTTCGGT
CCGGTGCTGG TGGTCACGCG TTTCACATCA GAAGAACAGG CGCTACAGCT TGCCAACGAC
TGCCAGTACG GACTTGGCGC GGCGGTATGG ACGCGCGACC TCTCCCGCGC GCACCGCATG
AGTCGCCGCC TGAAAGCCGG CTCCGTCTTC GTCAATAACT ACAACGACGG CGATATGACC
GTGCCGTTTG GCGGCTATAA GCAGAGCGGC AACGGGCGCG ACAAATCCCT GCATGCCCTT
GACAAATTCA CCGAACTGAA AACCATCTGG ATAAGCCTGG AGGCCTGA
 
Protein sequence
MNFHHLAYWQ DKALSLAIET RLFINGEYTA AAENETFETV DPVTQAPLAK IARGKSVDID 
RAVSAARGVF ERGDWSLSSP AKRKAVLNKL ADLMEANAEE LALLETLDTG KPIRHSLRDD
IPGAARAIRW YAEAIDKVYG EVATTSSHEL AMIVREPVGV IAAIVPWNFP LLLTCWKLGP
ALAAGNSVIL KPSEKSPLSA IRLAGLAKEA GLPDGVLNVV TGFGHEAGQA LSRHNDIDAI
AFTGSTRTGK QLLKDAGDSN MKRVWLEAGG KSANIVFADC PDLQQAASAT AAGIFYNQGQ
VCIAGTRLLL EESIADEFLA LLKQQAQNWQ PGHPLDPATT MGTLIDCAHA DSVHSFIREG
ESKGQLLLDG RNAELAAAIG PTIFVEVDPN ASLSREEIFG PVLVVTRFTS EEQALQLAND
CQYGLGAAVW TRDLSRAHRM SRRLKAGSVF VNNYNDGDMT VPFGGYKQSG NGRDKSLHAL
DKFTELKTIW ISLEA