Gene EcSMS35_3140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3140 
SymbolamiC2 
ID6144546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3227446 
End bp3228705 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content52% 
IMG OID641617999 
ProductN-acetylmuramoyl-L-alanine amidase AmiC 
Protein accessionYP_001745149 
Protein GI170683287 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0860] N-acetylmuramoyl-L-alanine amidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGTAT CTAACTATAA TTATGTAATC AGCCGTCGTC GTTTACTGCA AGGTGCGGGC 
GCCATGTGGC TATTGAGCGT AAGTCAGGTC AGCCTGGCTG CGGTCAGCCA GGTCGTGGCG
GTACGCGTCT GGCCTGCGTC CAGCTACACC CGCGTGACGG TAGAATCAAA TCGTCAGCTG
CAATATAAGC AGTTCGCGTT AAGTAACCCT GAACGTGTGG TGGTGGATAT CGAAGATGTA
AACCTGAACT CGGTACTCAA GGGGATGGCG GCACAAATTC GTGCAGACGA CCCGTTCATC
AAGTCGGCGC GCGTCGGGCA ATTTGACCCG CAAACCGTAC GCATGGTTTT TGAATTAAAG
CAAAACGTAA AACCGCAGCT GTTTGCCCTT GCGCCGGTCG CCGGGTTTAA AGAGCGTCTG
GTGATGGACC TCTATCCGGC CAATGCACAG GATATGCAGG ACCCGCTGCT GGCGCTGCTG
GAGGATTACA ACAAAGGCGA CCTCGAAAAG CAGGTGCCGC CAGCACAAAG TGGTCCACAA
CCGGGTAAAG CAGGGCGTGA TCGTCCGATT GTCATTATGC TTGACCCCGG CCACGGTGGC
GAAGACCCCG GTGCGGTGGG GAAATACAAA ACACGCGAAA AAGACGTGGT ATTGCAAATA
GCTCGCCGTC TGCGCTCTCT GATCGAAAAA GAGGGTAATA TGAAGGTGTA CATGACGCGC
AATGAAGACA TCTTCATTCC GTTGCAAGTG CGTGTAGCAA AAGCACAGAA ACAACGTGCT
GACCTTTTTG TTTCTATCCA TGCTGATGCC TTTACCAGTC GCCAGCCGAG CGGTTCTTCC
GTGTTTGCGC TTTCGACCAA AGGCGCGACA AGTACTGCGG CAAAATATCT GGCACAAACC
CAGAACGCCT CGGACTTGAT TGGCGGCGTG AGCAAAAGCG GTGACCGCTA TGTCGACCAC
ACCATGTTCG ATATGGTGCA GTCTCTGACC ATTGCCGACA GCCTGAAGTT GGGTAAAGCG
GTGCTGAATA AGCTCGGTAA AATCAACAAG CTGCATAAAA ATCAGGTTGA ACAGGCCGGG
TTTGCCGTGC TGAAAGCACC TGATATTCCC TCCATTCTGG TCGAAACGGC GTTTATCAGT
AACGTTGAGG AAGAGCGCAA ACTGAAAACG GCAAGATTCC AACAACAAGT GGCAGAATCT
ATCCTAGAGG GAATTAAAGA GTATTTTTCG GATATAGAAG CGTTAGCGAG AAAAGCATAA
 
Protein sequence
MLVSNYNYVI SRRRLLQGAG AMWLLSVSQV SLAAVSQVVA VRVWPASSYT RVTVESNRQL 
QYKQFALSNP ERVVVDIEDV NLNSVLKGMA AQIRADDPFI KSARVGQFDP QTVRMVFELK
QNVKPQLFAL APVAGFKERL VMDLYPANAQ DMQDPLLALL EDYNKGDLEK QVPPAQSGPQ
PGKAGRDRPI VIMLDPGHGG EDPGAVGKYK TREKDVVLQI ARRLRSLIEK EGNMKVYMTR
NEDIFIPLQV RVAKAQKQRA DLFVSIHADA FTSRQPSGSS VFALSTKGAT STAAKYLAQT
QNASDLIGGV SKSGDRYVDH TMFDMVQSLT IADSLKLGKA VLNKLGKINK LHKNQVEQAG
FAVLKAPDIP SILVETAFIS NVEEERKLKT ARFQQQVAES ILEGIKEYFS DIEALARKA