Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4895 |
Symbol | |
ID | 6143763 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 5014092 |
End bp | 5015312 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641619698 |
Product | putative D-serine ammonia-lyase |
Protein accession | YP_001746805 |
Protein GI | 170680265 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3616] Predicted amino acid aldolase or racemase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.333429 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATACC ACTCAGAACC TCTGGTTCCC CATAAATCTG CCCTGATGCA AATGCCCGCA AATCTCCTTG CCGAAGATGT CTGTTTACCT GCCGCGATCA TTAAAAAGCA GGCCCTGGAG AACAACATTA CGTGGATGCA ACGCTACGCT GACGCGCGCG GCGTTTCACT GGCACCGCAC GGTAAAACCA CCATGACGCC GTGGATTTTT CAGGCGCAGC AGGCGGCTGG TGCCTGGGCA ATAGGTGTCG GTAGCGCATG GCAGGCTGGT GCGGCAATGG CGAGCGGTAT TCAGCGAGTG CTGATGGTTA ACCAGCTGGT CGGCAAGGCG AATATGCAGT TGATTGCCCA GTTGCAACGA CATTACCCGA CGGTCGATTT TATCAGCTGT ATCGACAGCA TTGAGAACGC CCGCGCTTTG TCTGCCTTTT TTGCCAGTCA GCAACAGACA CTGAACGTGA TGATTGAGTT GGGTGTCCCC GGCGGGCGCT GTGGTTGCCG CTCAAGGGAT GCCGCACTGA CACTGGCTAA ACAGGCAGCC CAATTACCGG GGTTACGCCT GCGGGGGCTT GAGTTATATG AAGGCGTTTT ACACGGTGAC GATCCACAAC CGCAGGTAGA AGCTCTGCTG CGGGATGCCG CACAACTGGC CTGTGATATG GCAAGTCTGG TTGATGGCGA GTTTATTCTC ACCGGAGCAG GCACGGTCTG GTATGACGTG GTGTGCAATA TCTGGCTGGC GGCAGAAAAG CCTGATAACT GCCGCATTGT TATTCGCCCT GGCTGCTACA TCACTCACGA CACCGGGATC TACGACGCGG CACAACAACA GTTGCTTGCC CGTGATCCTG TGGCTTGCGA TCTGGCAGGC GATCTCACAT CTGCACTGGA ACTGGTCGCC ATGGTGCAAT CGGTGCCGGA AGCTGATCGT GCGGTGGTTA ATTTTGGTAA GCGTGATTGC GCCTTCGACG CCGGACTGCC GCAACCAGTC GCCCACTATC GCAACGGCAA ATCACTGGCA TTTGATCCAC AGGCGATTCG CAGTACAGGC ATTATGGATC AGCACTGCAT GTTGCAGTTG GGTGCCGACA GCGACGTGCA AGTGGGGGAT ATTCTGGTGT TTGGCACATC GCATCCGTGC CTGACCTTCG ACAAATGGAA AACGTTGTTA TTGACTGATG ACGACTACAA CGTACTGGCA GAATTAGACA CTCTCTTCTA A
|
Protein sequence | MKYHSEPLVP HKSALMQMPA NLLAEDVCLP AAIIKKQALE NNITWMQRYA DARGVSLAPH GKTTMTPWIF QAQQAAGAWA IGVGSAWQAG AAMASGIQRV LMVNQLVGKA NMQLIAQLQR HYPTVDFISC IDSIENARAL SAFFASQQQT LNVMIELGVP GGRCGCRSRD AALTLAKQAA QLPGLRLRGL ELYEGVLHGD DPQPQVEALL RDAAQLACDM ASLVDGEFIL TGAGTVWYDV VCNIWLAAEK PDNCRIVIRP GCYITHDTGI YDAAQQQLLA RDPVACDLAG DLTSALELVA MVQSVPEADR AVVNFGKRDC AFDAGLPQPV AHYRNGKSLA FDPQAIRSTG IMDQHCMLQL GADSDVQVGD ILVFGTSHPC LTFDKWKTLL LTDDDYNVLA ELDTLF
|
| |