Gene B21_03998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_03998 
SymbolamiB 
ID8115771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp4300675 
End bp4302012 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content55% 
IMG OID644850150 
Producthypothetical protein 
Protein accessionYP_003001723 
Protein GI251787419 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0860] N-acetylmuramoyl-L-alanine amidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00684197 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATATATC GCATCAGAAA TTGGTTGGTA GCGACGTTGC TGCTGCTGTG CACGCCGGTG 
GGTGCCGCGA CGCTCTCTGA TATTCAGGTT TCTAACGGTA ATCAACAGGC GCGGATAACG
TTGAGTTTTA TTGGCGATCC TGATTATGCG TTTAGCCATC AAAGCAAACG CACCGTGGCG
CTCGATATCA AACAAACGGG CGTGATTCAG GGACTGCCGT TGTTGTTCAG CGGCAATAAT
CTGGTGAAGG CGATTCGCTC TGGAACGCCT AAAGATGCAC AAACGCTACG GCTGGTGGTC
GATCTTACCG AAAACGGTAA AACCGAAGCG GTGAAGCGGC AGAATGGCAG CAATTACAAT
GTCGTCTTTA CGATTAACGC CGATGTGCCG CCACCGCCTC CTCCGCCGCC CGTGGTTGCG
AAACGCGTTG AAACGCCTGC GGTTGTCGCA CCGCGCGTCA GCGAACCGGC GCGCAATCCG
TTTAAAACGG AAAGTAACCG CACTACGGGT GTTATCAGCA GTAATACGGT AACGCGTCCG
GCAGCGCGCG CGACGGCTAA CACTGGCGAT AAAATTATCA TCGCTATTGA TGCCGGACAC
GGCGGTCAGG ACCCTGGCGC TATCGGCCCC GGTGGTACGC GGGAGAAAAA TGTCACCATC
GCCATCGCGC GTAAATTGCG CACTTTGCTC AATGACGATC CGATGTTTAA AGGCGTTTTA
ACCCGTGACG GGGATTACTT TATTTCGGTG ATGGGGCGCA GTGATGTGGC ACGTAAGCAA
AACGCCAATT TCCTCGTGTC GATTCACGCT GATGCCGCAC CGAACCGCAG TGCGACTGGC
GCTTCCGTAT GGGTGCTCTC TAACCGTCGT GCCAACAGTG AAATGGCCAG CTGGCTGGAG
CAGCACGAGA AACAGTCGGA GCTGCTGGGT GGGGCGGGTG ATGTGCTGGC GAACAGTCAG
TCTGACCCCT ATTTAAGCCA GGCGGTGCTG GATTTACAGT TCGGTCATTC CCAGCGGGTA
GGGTATGATG TAGCGACCAG TATGATCAGT CAGTTGCAAC GCATTGGCGA AATACATAAA
CGCCGACCAG AACACGCGAG TCTCGGCGTT CTGCGTTCGC CGGATATCCC GTCAGTACTG
GTCGAAACCG GTTTTATCAG CAACAACAGC GAAGAACGTT TGCTGGCGAG CGACGATTAC
CAACAACAGC TGGCAGAAGC CATTTACAAA GGCCTGCGCA ATTATTTCCT TGCGCATCCG
ATGCAATCTG CGCCGCAGGG GGCAACGGCA CAAACTGCCA GTACGGTGAC GACGCCAGAT
CGCACGCTGC CAAACTAA
 
Protein sequence
MIYRIRNWLV ATLLLLCTPV GAATLSDIQV SNGNQQARIT LSFIGDPDYA FSHQSKRTVA 
LDIKQTGVIQ GLPLLFSGNN LVKAIRSGTP KDAQTLRLVV DLTENGKTEA VKRQNGSNYN
VVFTINADVP PPPPPPPVVA KRVETPAVVA PRVSEPARNP FKTESNRTTG VISSNTVTRP
AARATANTGD KIIIAIDAGH GGQDPGAIGP GGTREKNVTI AIARKLRTLL NDDPMFKGVL
TRDGDYFISV MGRSDVARKQ NANFLVSIHA DAAPNRSATG ASVWVLSNRR ANSEMASWLE
QHEKQSELLG GAGDVLANSQ SDPYLSQAVL DLQFGHSQRV GYDVATSMIS QLQRIGEIHK
RRPEHASLGV LRSPDIPSVL VETGFISNNS EERLLASDDY QQQLAEAIYK GLRNYFLAHP
MQSAPQGATA QTASTVTTPD RTLPN