Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4896 |
Symbol | |
ID | 6144432 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 5015513 |
End bp | 5016955 |
Gene Length | 1443 bp |
Protein Length | 480 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641619699 |
Product | N-acyl-D-amino-acid deacylase family protein |
Protein accession | YP_001746806 |
Protein GI | 170683584 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3653] N-acyl-D-aspartate/D-glutamate deacylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.591808 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGTTG ACTGGTTAAT CAAAAATGTG ACCGTTATCG ACGGTAGCGG CGGCCCTGAA TTTCGCGGTG ACGTGGCAAT AACAGGTGAT CGGATTGTCG ATATTGCCCC TGCGCTTAAC GTTACGGCGC AGCAGGTCAT TGACGGGGAA GGGCGGGTGC TGGCACCAGG GTTTATCGAC GTTCATACCC ATGACGATAT CAACGTTATT CGCATCCCGG AATATCTGCC AAAGATCAGC CAGGGGATCA CTACGGTGAT TGTCGGTAAC TGCGGGATTA GTGCGGCGTC GGCGAAAATG AAAGGTGAAG TTCCTGACCC GATGAATCTG TTGGGGGAAG CAGAGCACTT TATTTATCCT ACCGTTGAAA GCTATGCCCA GGCAGTGGAA GCAGCCAGGC CGTCACTGAA CGTTGGCACG CTGATTGGTC ATACGGCGCT GCGTAATAAC CATATGGACG ACCTGTTTCG CCCGGCGACG GTGGATGAAA TTGCCGCTAT GCGTGCCGAC TTACGTCTGG CGTTGAGTCA GGGGGCGCTG GGGTTAAGTT CCGGGCTGGC CTATGCAACG GCATTTCAGG CGACTACCGA AGAAGTCATG GCGCTGGCGG AAGAATTAGC CGGTGAGAAG GGCGTTTATA CCACCCATCT GCGTTCTGAA TTTGAACCGA TTCTCGATGC CCTGGATGAG GCGTTTCGTA TTGGGCGTCA TGGCAAAGTG CCTGTCGTGG TTTCCCACCA TAAATGCGCC GGGGCGAAAA ACTGGGGGCG GACAAAAGAA ACGTTGGCCT TTTTTGACCA AATGCGTCAA CACCAGGAAA TTGGCTGCGA TGTCTACCCT TACTCAGCCA GCTCTTCAAC GCTGGATCTC AAACAAGTCA CCGACGAATT CGATATTGTG ATCACCTGGT CACAAACACA TCCGCAGCAG GCCGGGAAAA CACTGGCGCA AATCGCCATC GACTGGCAGA TGAGTATGCT GGAGGCCGCG AAGCTGCTAA TGCCTGCCGG GGCTATCTAT TACAACATGG ACGAGCGCGA CGTCCGCCGA GTGTTGAGTT ATCCGGTCAG TATGATTGGC TCTGACGGCC TGCCCAATGA TCCCATGCCG CATCCACGTT TATGGGGCGC CTTCCCTCGC GTGCTGGGCC ACTATTGCCG TGATGAAGGC TTATTCCCAC TGACCACGGC TATCCACAAA ATGACCGGGC TTTCTGCCAG CCGTTTTCGC CTGCCCCAGC GTGGGCTGGT GAAAGTTGGC TATTTTGCGG ACCTGGTGTT GTTCGATCCA CAAACCATTC GTGATGTTGC CAGCTTTTCT GATCCCAAAC GCCCGGCAGA TGGCATTGAG GCGGTGATGG TGAACGGCGT TATGAGCTAT GGTCCCGATA AACATATTAC AGGCCGTGCG GGACGCTTCC TGCGCCGGCA GACCTCACAT TAA
|
Protein sequence | MQVDWLIKNV TVIDGSGGPE FRGDVAITGD RIVDIAPALN VTAQQVIDGE GRVLAPGFID VHTHDDINVI RIPEYLPKIS QGITTVIVGN CGISAASAKM KGEVPDPMNL LGEAEHFIYP TVESYAQAVE AARPSLNVGT LIGHTALRNN HMDDLFRPAT VDEIAAMRAD LRLALSQGAL GLSSGLAYAT AFQATTEEVM ALAEELAGEK GVYTTHLRSE FEPILDALDE AFRIGRHGKV PVVVSHHKCA GAKNWGRTKE TLAFFDQMRQ HQEIGCDVYP YSASSSTLDL KQVTDEFDIV ITWSQTHPQQ AGKTLAQIAI DWQMSMLEAA KLLMPAGAIY YNMDERDVRR VLSYPVSMIG SDGLPNDPMP HPRLWGAFPR VLGHYCRDEG LFPLTTAIHK MTGLSASRFR LPQRGLVKVG YFADLVLFDP QTIRDVASFS DPKRPADGIE AVMVNGVMSY GPDKHITGRA GRFLRRQTSH
|
| |