Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2986 |
Symbol | lysA |
ID | 6146940 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3066270 |
End bp | 3067532 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641617855 |
Product | diaminopimelate decarboxylase |
Protein accession | YP_001745007 |
Protein GI | 170683211 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0019] Diaminopimelate decarboxylase |
TIGRFAM ID | [TIGR01048] diaminopimelate decarboxylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.0454398 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCACATT CACTGTTCAG CACTGATACC GATCTCACCG CCGAAAATCT GCTGCGTTTG CCCGCAGAAT TTGGCTGCCC GGTGTGGGTC TACGATGCGC AAATTATTCG TCGGCAGATT GCAGCGCTGA AACAGTTTGA TGTGGTGCGC TTCGCACAGA AAGCCTGTTC CAATATTCAT ATTTTGCGCT TAATGCGTGA GCAGGGCGTG AAAGTGGATT CCGTCTCGTT AGGCGAAATA GAGCGTGCGT TGGCGGCGGG TTACAATCCG CAAACGCACC CCGATGATAT TGTTTTTACG GCAGATGTTA TCGATCAGGC GACGCTTGAA CGCGTCAGTG AATTGCAAAT TCCGGTGAAT GCGGGTTCTG TTGATATGCT CGACCAACTG GGTCAGGTTT CGCCAGGGCA TCGGGTATGG CTGCGTGTTA ATCCGGGGTT TGGTCACGGG CATAGCCAAA AAACCAATAC TGGTGGCGAA AACAGCAAGC ACGGTATCTG GTACACCGAT CTGCCCGCCG CACTGGACGT GATACAACGT CATCATCTAC AGCTGGTCGG CATTCACATG CACATTGGTT CTGGCGTTGA TTATGCCCAT CTGGAACAGG TATGTGGTGC TATGGTGCGT CAGGTCCTCG AATTCGGTCA GGATTTACAG GCTATTTCTG CGGGCGGTGG GCTTTCTATT CCTTATCAAC AGGGTGAAGA GGCGGTTGAT ACCGAACATT ATTATGGTCT GTGGAATGCC GCGCGTGAGC AAATCGCCCG CCATTTGGGC CACCCTGTGA AACTGGAAAT TGAACCGGGT CGCTTCCTGG TAGCGCAGTC TGGCGTGTTA ATTACTCAGG TGCGGAGCGT CAAACAAATG GGTAGCCGCC ACTTTGTGCT GGTTGATGCC GGGTTCAACG ATCTGATGCG TCCGGCAATG TACGGTAGTT ACCACCATAT CAGTGCCCTG GCAGCTGATG GTCGTTCACT GGAACACGCA CCAACGGTGG AAACCGTCGT CGCCGGGCCG TTATGTGAAT CGGGCGATGT CTTTACCCAG CAGGAAGGGG GAAATGTTGA AACCCGCGCC TTGCCGGAAG TGAAGGCAGG GGATTATCTG GTACTGCATG ATACGGGGGC ATATGGCGCG TCAATGTCAT CCAACTACAA TAGCCGTCCG CTGTTACCAG AAGTTCTGTT TGATAATGGT CAGGCGCGGT TAATTCGCCG TCGCCAGACC ATCGAAGAAT TACTGGCACT GGAATTGCTT TAA
|
Protein sequence | MPHSLFSTDT DLTAENLLRL PAEFGCPVWV YDAQIIRRQI AALKQFDVVR FAQKACSNIH ILRLMREQGV KVDSVSLGEI ERALAAGYNP QTHPDDIVFT ADVIDQATLE RVSELQIPVN AGSVDMLDQL GQVSPGHRVW LRVNPGFGHG HSQKTNTGGE NSKHGIWYTD LPAALDVIQR HHLQLVGIHM HIGSGVDYAH LEQVCGAMVR QVLEFGQDLQ AISAGGGLSI PYQQGEEAVD TEHYYGLWNA AREQIARHLG HPVKLEIEPG RFLVAQSGVL ITQVRSVKQM GSRHFVLVDA GFNDLMRPAM YGSYHHISAL AADGRSLEHA PTVETVVAGP LCESGDVFTQ QEGGNVETRA LPEVKAGDYL VLHDTGAYGA SMSSNYNSRP LLPEVLFDNG QARLIRRRQT IEELLALELL
|
| |