Gene EcSMS35_4896 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4896 
Symbol 
ID6144432 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp5015513 
End bp5016955 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content55% 
IMG OID641619699 
ProductN-acyl-D-amino-acid deacylase family protein 
Protein accessionYP_001746806 
Protein GI170683584 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3653] N-acyl-D-aspartate/D-glutamate deacylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.591808 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGTTG ACTGGTTAAT CAAAAATGTG ACCGTTATCG ACGGTAGCGG CGGCCCTGAA 
TTTCGCGGTG ACGTGGCAAT AACAGGTGAT CGGATTGTCG ATATTGCCCC TGCGCTTAAC
GTTACGGCGC AGCAGGTCAT TGACGGGGAA GGGCGGGTGC TGGCACCAGG GTTTATCGAC
GTTCATACCC ATGACGATAT CAACGTTATT CGCATCCCGG AATATCTGCC AAAGATCAGC
CAGGGGATCA CTACGGTGAT TGTCGGTAAC TGCGGGATTA GTGCGGCGTC GGCGAAAATG
AAAGGTGAAG TTCCTGACCC GATGAATCTG TTGGGGGAAG CAGAGCACTT TATTTATCCT
ACCGTTGAAA GCTATGCCCA GGCAGTGGAA GCAGCCAGGC CGTCACTGAA CGTTGGCACG
CTGATTGGTC ATACGGCGCT GCGTAATAAC CATATGGACG ACCTGTTTCG CCCGGCGACG
GTGGATGAAA TTGCCGCTAT GCGTGCCGAC TTACGTCTGG CGTTGAGTCA GGGGGCGCTG
GGGTTAAGTT CCGGGCTGGC CTATGCAACG GCATTTCAGG CGACTACCGA AGAAGTCATG
GCGCTGGCGG AAGAATTAGC CGGTGAGAAG GGCGTTTATA CCACCCATCT GCGTTCTGAA
TTTGAACCGA TTCTCGATGC CCTGGATGAG GCGTTTCGTA TTGGGCGTCA TGGCAAAGTG
CCTGTCGTGG TTTCCCACCA TAAATGCGCC GGGGCGAAAA ACTGGGGGCG GACAAAAGAA
ACGTTGGCCT TTTTTGACCA AATGCGTCAA CACCAGGAAA TTGGCTGCGA TGTCTACCCT
TACTCAGCCA GCTCTTCAAC GCTGGATCTC AAACAAGTCA CCGACGAATT CGATATTGTG
ATCACCTGGT CACAAACACA TCCGCAGCAG GCCGGGAAAA CACTGGCGCA AATCGCCATC
GACTGGCAGA TGAGTATGCT GGAGGCCGCG AAGCTGCTAA TGCCTGCCGG GGCTATCTAT
TACAACATGG ACGAGCGCGA CGTCCGCCGA GTGTTGAGTT ATCCGGTCAG TATGATTGGC
TCTGACGGCC TGCCCAATGA TCCCATGCCG CATCCACGTT TATGGGGCGC CTTCCCTCGC
GTGCTGGGCC ACTATTGCCG TGATGAAGGC TTATTCCCAC TGACCACGGC TATCCACAAA
ATGACCGGGC TTTCTGCCAG CCGTTTTCGC CTGCCCCAGC GTGGGCTGGT GAAAGTTGGC
TATTTTGCGG ACCTGGTGTT GTTCGATCCA CAAACCATTC GTGATGTTGC CAGCTTTTCT
GATCCCAAAC GCCCGGCAGA TGGCATTGAG GCGGTGATGG TGAACGGCGT TATGAGCTAT
GGTCCCGATA AACATATTAC AGGCCGTGCG GGACGCTTCC TGCGCCGGCA GACCTCACAT
TAA
 
Protein sequence
MQVDWLIKNV TVIDGSGGPE FRGDVAITGD RIVDIAPALN VTAQQVIDGE GRVLAPGFID 
VHTHDDINVI RIPEYLPKIS QGITTVIVGN CGISAASAKM KGEVPDPMNL LGEAEHFIYP
TVESYAQAVE AARPSLNVGT LIGHTALRNN HMDDLFRPAT VDEIAAMRAD LRLALSQGAL
GLSSGLAYAT AFQATTEEVM ALAEELAGEK GVYTTHLRSE FEPILDALDE AFRIGRHGKV
PVVVSHHKCA GAKNWGRTKE TLAFFDQMRQ HQEIGCDVYP YSASSSTLDL KQVTDEFDIV
ITWSQTHPQQ AGKTLAQIAI DWQMSMLEAA KLLMPAGAIY YNMDERDVRR VLSYPVSMIG
SDGLPNDPMP HPRLWGAFPR VLGHYCRDEG LFPLTTAIHK MTGLSASRFR LPQRGLVKVG
YFADLVLFDP QTIRDVASFS DPKRPADGIE AVMVNGVMSY GPDKHITGRA GRFLRRQTSH