Gene EcolC_3844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3844 
Symbol 
ID6066881 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4199674 
End bp4201011 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content55% 
IMG OID641603256 
ProductN-acetylmuramoyl-l-alanine amidase II 
Protein accessionYP_001726775 
Protein GI170021821 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0860] N-acetylmuramoyl-L-alanine amidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000499267 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000287323 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATGTATC GCATCAGAAA TTGGTTGGTA GCGACGCTGC TGCTGCTGTG CACGCCGGTG 
GGTGCCGCGA CGCTCTCTGA TATTCAGGTT TCTAACGGTA ATCAACAGGC GCGGATAACG
TTGAGTTTTA TTGGCGATCC TGATTATGCG TTTAGCCATC AAAGCAAACG CACCGTGGCG
CTCGATATCA AACAAACGGG CGTGATTCAG GGACTGCCGT TGTTGTTCAG CGGCAATAAT
CTGGTGAAGG CGATTCGCTC TGGAACGCCT AAAGATGCAC AAACGCTACG GCTGGTGGTC
GATCTTACCG AAAACGGTAA AACCGAAGCG GTGAAGCGGC AGAATGGCAG CAATTACACT
GTCGTCTTTA CGATTAACGC CGATGTGCCG CCACCGCCTC CTCCGCCGCC CGTGGTTGCG
AAACGCGTTG AAACGCCTGC GGTTGTCGCA CCGCGCGTCA GCGAACCGGC GCGCAATCCG
TTTAAAACGG AAAGTAACCG CACTACGGGT GTTATCAGCA GTAATACGGT AACGCGTCCG
GCAGCGCGCG CGACGGCTAA CACTGGCGAT AAAATTATCA TCGCTATTGA TGCCGGACAC
GGCGGTCAGG ACCCTGGCGC TATCGGCCCC GGTGGTACGC GGGAGAAAAA TGTCACCATC
GCCATCGCGC GTAAATTGCG TACTTTGCTC AATGACGATC CGATGTTTAA AGGCGTTTTA
ACCCGTGACG GGGATTACTT TATCTCGGTG ATGGGGCGCA GTGATGTGGC ACGTAAGCAA
AACGCCAATT TCCTCGTGTC GATTCACGCT GATGCCGCAC CGAACCGCAG TGCGACTGGC
GCTTCCGTAT GGGTGCTCTC TAACCGTCGC GCCAACAGTG AAATGGCCAG CTGGCTGGAG
CAGCACGAGA AACAGTCGGA GCTGCTGGGT GGGGCGGGTG ATGTGCTGGC GAACAGTCAG
TCTGACCCCT ATTTAAGCCA GGCGGTGCTG GATTTACAGT TCGGTCATTC CCAGCGGGTA
GGGTATGATG TAGCGACCAG TATGATCAGT CAGTTGCAAC GCATTGGCGA AATACATAAA
CGTCGACCAG AACACGCCAG CCTTGGCGTT CTGCGCTCGC CGGATATCCC ATCAGTACTG
GTCGAAACCG GTTTTATCAG CAACAACAGC GAAGAACGTT TGCTGGCGAG CGACGATTAC
CAACAACAGC TGGCAGAAGC CATTTACAAA GGCCTGCGCA ATTATTTCCT TGCGCATCCG
ATGCAATCTG CGCCGCAGGG TGCAACGGCA CAAACTGCCA GTACGGTGAC GACGCCAGAT
CGCACGCTGC CAAACTAA
 
Protein sequence
MMYRIRNWLV ATLLLLCTPV GAATLSDIQV SNGNQQARIT LSFIGDPDYA FSHQSKRTVA 
LDIKQTGVIQ GLPLLFSGNN LVKAIRSGTP KDAQTLRLVV DLTENGKTEA VKRQNGSNYT
VVFTINADVP PPPPPPPVVA KRVETPAVVA PRVSEPARNP FKTESNRTTG VISSNTVTRP
AARATANTGD KIIIAIDAGH GGQDPGAIGP GGTREKNVTI AIARKLRTLL NDDPMFKGVL
TRDGDYFISV MGRSDVARKQ NANFLVSIHA DAAPNRSATG ASVWVLSNRR ANSEMASWLE
QHEKQSELLG GAGDVLANSQ SDPYLSQAVL DLQFGHSQRV GYDVATSMIS QLQRIGEIHK
RRPEHASLGV LRSPDIPSVL VETGFISNNS EERLLASDDY QQQLAEAIYK GLRNYFLAHP
MQSAPQGATA QTASTVTTPD RTLPN