Gene EcSMS35_2986 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2986 
SymbollysA 
ID6146940 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3066270 
End bp3067532 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content53% 
IMG OID641617855 
Productdiaminopimelate decarboxylase 
Protein accessionYP_001745007 
Protein GI170683211 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0019] Diaminopimelate decarboxylase 
TIGRFAM ID[TIGR01048] diaminopimelate decarboxylase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.0454398 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACATT CACTGTTCAG CACTGATACC GATCTCACCG CCGAAAATCT GCTGCGTTTG 
CCCGCAGAAT TTGGCTGCCC GGTGTGGGTC TACGATGCGC AAATTATTCG TCGGCAGATT
GCAGCGCTGA AACAGTTTGA TGTGGTGCGC TTCGCACAGA AAGCCTGTTC CAATATTCAT
ATTTTGCGCT TAATGCGTGA GCAGGGCGTG AAAGTGGATT CCGTCTCGTT AGGCGAAATA
GAGCGTGCGT TGGCGGCGGG TTACAATCCG CAAACGCACC CCGATGATAT TGTTTTTACG
GCAGATGTTA TCGATCAGGC GACGCTTGAA CGCGTCAGTG AATTGCAAAT TCCGGTGAAT
GCGGGTTCTG TTGATATGCT CGACCAACTG GGTCAGGTTT CGCCAGGGCA TCGGGTATGG
CTGCGTGTTA ATCCGGGGTT TGGTCACGGG CATAGCCAAA AAACCAATAC TGGTGGCGAA
AACAGCAAGC ACGGTATCTG GTACACCGAT CTGCCCGCCG CACTGGACGT GATACAACGT
CATCATCTAC AGCTGGTCGG CATTCACATG CACATTGGTT CTGGCGTTGA TTATGCCCAT
CTGGAACAGG TATGTGGTGC TATGGTGCGT CAGGTCCTCG AATTCGGTCA GGATTTACAG
GCTATTTCTG CGGGCGGTGG GCTTTCTATT CCTTATCAAC AGGGTGAAGA GGCGGTTGAT
ACCGAACATT ATTATGGTCT GTGGAATGCC GCGCGTGAGC AAATCGCCCG CCATTTGGGC
CACCCTGTGA AACTGGAAAT TGAACCGGGT CGCTTCCTGG TAGCGCAGTC TGGCGTGTTA
ATTACTCAGG TGCGGAGCGT CAAACAAATG GGTAGCCGCC ACTTTGTGCT GGTTGATGCC
GGGTTCAACG ATCTGATGCG TCCGGCAATG TACGGTAGTT ACCACCATAT CAGTGCCCTG
GCAGCTGATG GTCGTTCACT GGAACACGCA CCAACGGTGG AAACCGTCGT CGCCGGGCCG
TTATGTGAAT CGGGCGATGT CTTTACCCAG CAGGAAGGGG GAAATGTTGA AACCCGCGCC
TTGCCGGAAG TGAAGGCAGG GGATTATCTG GTACTGCATG ATACGGGGGC ATATGGCGCG
TCAATGTCAT CCAACTACAA TAGCCGTCCG CTGTTACCAG AAGTTCTGTT TGATAATGGT
CAGGCGCGGT TAATTCGCCG TCGCCAGACC ATCGAAGAAT TACTGGCACT GGAATTGCTT
TAA
 
Protein sequence
MPHSLFSTDT DLTAENLLRL PAEFGCPVWV YDAQIIRRQI AALKQFDVVR FAQKACSNIH 
ILRLMREQGV KVDSVSLGEI ERALAAGYNP QTHPDDIVFT ADVIDQATLE RVSELQIPVN
AGSVDMLDQL GQVSPGHRVW LRVNPGFGHG HSQKTNTGGE NSKHGIWYTD LPAALDVIQR
HHLQLVGIHM HIGSGVDYAH LEQVCGAMVR QVLEFGQDLQ AISAGGGLSI PYQQGEEAVD
TEHYYGLWNA AREQIARHLG HPVKLEIEPG RFLVAQSGVL ITQVRSVKQM GSRHFVLVDA
GFNDLMRPAM YGSYHHISAL AADGRSLEHA PTVETVVAGP LCESGDVFTQ QEGGNVETRA
LPEVKAGDYL VLHDTGAYGA SMSSNYNSRP LLPEVLFDNG QARLIRRRQT IEELLALELL