Gene ECH74115_4745 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4745 
Symbolasd 
ID6968050 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4392655 
End bp4393758 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content54% 
IMG OID643388446 
Productaspartate-semialdehyde dehydrogenase 
Protein accessionYP_002272874 
Protein GI209395764 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0136] Aspartate-semialdehyde dehydrogenase 
TIGRFAM ID[TIGR01745] aspartate-semialdehyde dehydrogenase, gamma-proteobacterial 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAATG TTGGTTTTAT CGGCTGGCGC GGTATGGTCG GCTCCGTTCT CATGCAACGC 
ATGGTTGAAG AGCGCGACTT CGACGCCATT CGCCCTGTCT TCTTTTCTAC TTCTCAGCTT
GGTCAGGCTG CGCCGTCTTT TGGCGGAACC ACTGGCACAC TTCAGGATGC CTTTGATCTG
GAGGCGCTAA AGGCCCTCGA TATCATTGTG ACCTGTCAGG GCGGCGATTA TACCAACGAA
ATCTATCCAA AGCTTCGTGA AAGCGGATGG CAGGGTTACT GGATTGACGC AGCATCGTCT
CTGCGCATGA AAGATGACGC CATCATCATT CTTGACCCCG TCAATCAGGA CGTCATTACC
GACGGATTAA ATAATGGCAT CAGGACTTTT GTTGGCGGTA ACTGTACCGT AAGCCTGATG
TTGATGTCGT TGGGTGGTTT ATTCGCCAAT GATCTTGTTG ATTGGGTGTC CGTTGCAACC
TACCAGGCCG CTTCCGGCGG TGGTGCGCGG CATATGCGTG AGTTATTAAC CCAGATGGGC
CATCTGTATG GCCATGTAGC AGATGAACTC GCGACCCCGT CCTCTGCTAT TCTCGATATC
GAACGCAAAG TCACAACCTT AACCCGTAGC GGTGAGCTGC CGGTGGATAA CTTTGGCGTG
CCGCTGGCGG GTAGCCTGAT TCCGTGGATC GACAAACAGC TCGATAACGG TCAGAGCCGC
GAAGAGTGGA AAGGGCAGGC GGAAACCAAC AAGATCCTCA ACACATCTTC CGTAATTCCG
GTAGATGGTT TATGTGTGCG TGTCGGGGCA TTGCGCTGCC ACAGCCAGGC ATTCACTATT
AAATTGAAAA AAGATGTGTC GATTCCGACC GTGGAAGAAC TGCTGGCTGC GCACAATCCG
TGGGCGAAAG TCGTTCCGAA CGATCGGGAA ATCACTATGC GTGAGCTAAC CCCAGCTGCC
GTTACCGGCA CGCTGACCAC GCCGGTAGGC CGCCTGCGTA AGCTGAATAT GGGACCAGAG
TTCCTGTCAG CCTTTACCGT GGGCGACCAG CTGCTGTGGG GGGCCGCGGA GCCGCTGCGT
CGGATGCTTC GTCAACTGGC GTAA
 
Protein sequence
MKNVGFIGWR GMVGSVLMQR MVEERDFDAI RPVFFSTSQL GQAAPSFGGT TGTLQDAFDL 
EALKALDIIV TCQGGDYTNE IYPKLRESGW QGYWIDAASS LRMKDDAIII LDPVNQDVIT
DGLNNGIRTF VGGNCTVSLM LMSLGGLFAN DLVDWVSVAT YQAASGGGAR HMRELLTQMG
HLYGHVADEL ATPSSAILDI ERKVTTLTRS GELPVDNFGV PLAGSLIPWI DKQLDNGQSR
EEWKGQAETN KILNTSSVIP VDGLCVRVGA LRCHSQAFTI KLKKDVSIPT VEELLAAHNP
WAKVVPNDRE ITMRELTPAA VTGTLTTPVG RLRKLNMGPE FLSAFTVGDQ LLWGAAEPLR
RMLRQLA