Gene ECH74115_3700 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3700 
SymboldapA 
ID6967689 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3419094 
End bp3419972 
Gene Length879 bp 
Protein Length292 aa 
Translation table11 
GC content52% 
IMG OID643387494 
Productdihydrodipicolinate synthase 
Protein accessionYP_002271947 
Protein GI209396670 
COG category[E] Amino acid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0329] Dihydrodipicolinate synthase/N-acetylneuraminate lyase 
TIGRFAM ID[TIGR00674] dihydrodipicolinate synthase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones100 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCACGG GAAGTATTGT CGCGATTGTT ACTCCGATGG ATGAAAAAGG TAATGTCTGT 
CGGGCTAGCT TGAAAAAACT GATTGATTAT CATGTCGCCA GCGGTACTTC GGCGATCGTT
TCTGTTGGCA CCACTGGCGA GTCCGCTACC TTAAATCATG ACGAACATGC TGATGTGGTG
ATGATGACGC TGGAGCTGGC TGACGGGCGC ATTCCGGTGA TTGCCGGGAC CGGTGCTAAC
GCTACTGCGG AAGCCATTAG CCTGACGCAG CGCTTCAATG ACAGTGGTAT CGTCGGCTGC
CTGACGGTAA CCCCTTACTA CAATCGTCCG TCGCAAGAAG GTTTGTATCA GCATTTCAAA
GCCATCGCTG AGCATACTGA CCTGCCGCAA ATTCTGTATA ATGTGCCGTC CCGTACTGGC
TGCGATCTGC TCCCGGAAAC GGTGGGCCGT CTGGCGAAAG TAAAAAATAT TATCGGAATC
AAAGAGGCAA CAGGGAACTT AACGCGTGTA AACCAGATCA AAGAGCTGGT TTCAGATGAT
TTTGTTCTGC TGAGCGGCGA TGATGCGAGC GCGCTGGACT TCATGCAATT AGGCGGTCAT
GGGGTTATTT CCGTTACGGC TAACGTCGCA GCGCGTGATA TGGCCCAGAT GTGCAAACTG
GCAGCAGAAG GGCATTTTGC CGAGGCACGC GTTATTAATC AGCGTCTGAT GCCATTACAC
AACAAACTAT TTGTCGAACC CAATCCAATC CCGGTGAAAT GGGCATGTAA GGAACTGGGT
CTTGTGGCGA CCGATACGCT GCGCCTGCCA ATGACACCAA TCACCGACAG TGGTCGTGAG
ACGGTCAGAG CGGCGCTTAA GCATGCCGGT TTGCTGTAA
 
Protein sequence
MFTGSIVAIV TPMDEKGNVC RASLKKLIDY HVASGTSAIV SVGTTGESAT LNHDEHADVV 
MMTLELADGR IPVIAGTGAN ATAEAISLTQ RFNDSGIVGC LTVTPYYNRP SQEGLYQHFK
AIAEHTDLPQ ILYNVPSRTG CDLLPETVGR LAKVKNIIGI KEATGNLTRV NQIKELVSDD
FVLLSGDDAS ALDFMQLGGH GVISVTANVA ARDMAQMCKL AAEGHFAEAR VINQRLMPLH
NKLFVEPNPI PVKWACKELG LVATDTLRLP MTPITDSGRE TVRAALKHAG LL