Gene Daro_3049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3049 
Symbol 
ID3568253 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3293633 
End bp3294982 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content59% 
IMG OID637681520 
Productcell wall hydrolase/autolysin 
Protein accessionYP_286249 
Protein GI71908662 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0860] N-acetylmuramoyl-L-alanine amidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value0.429243 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0314787 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTAAAC GCGCTCATCC CAACCTTGGG CGCCGACAGC TTCTCCGTTA TGCCGGCGCC 
TCACTGATCC TTTCCGTTTC GCCGATTGCC GGTGCCGCGG CCAAGTTGCC GTCTGTCCTC
GCCGTGCGTA TATGGCCGGC TGCCGATTAC ACCCGTGTCA CCCTCGAACA CGACGCACCG
CTTAAATTTA CCCATTTCAT TGTCGAGAAC CCGGATCGAC TGGTCGTTGA TATCGAAGGG
GTCGAGTTCA ACAGTGTCCT TGATAGCCTT GCCCGCAAGG TGGCGACTGA CGATCCGAAC
ATCAAGCTGT TACGCGCCGG TCGCTTCAAG CCAGGTGTCG TTCGCTTGGT CATGGAGCTG
AAGGGCAAAG TTAATCCGCA GGTCTTCACG CTGGAGCCGG CAGGCGAGTA TGGCCGTCGT
CTGGTGCTTG ACGTCTATCC AGTCAACCCG CCGGACCCGA TGATGGCGCT GCTCGAAGGG
CGCAAGGACG CGGTTGAGCC GCTGAAGAAT GAGCATGATT TCCAGATCAC TGAAAAGCGG
CCCGATGAAG TTGCCGCCAA GATTCCGGAA AAACCGATCG AGGCACCTGA GGTTCAGACC
AGCAAGAAGT CCGGCAAGCC GATTGTCGAT CGCCTGGTCA CCATCATGCT CGACCCCGGC
CACGGTGGCG AAGATCCCGG TGCCATCGGC AAGGCGGGAA CCTACGAAAA GAATGTCACG
CTGGAAGTAG CTCGCCGCCT GAAGGCGCGA ATCGATGCCG AGCCAAACAT GCGCGCGGTG
CTGACGCGTG ATTCCGATTT CTTCGTGCCG CTACAGATGC GCGTCCAGAA GGCCCGCCGA
ATCCAGTCCG ATCTCTTCCT GTCGATTCAT GCCGATGCCT GGATCAAGCC GGATGCCAAG
GGTTCATCGG TGTTCGTGCT GTCCGAAAAG GGGGCCTCCA GCACCCAGGC TCGCCTGCTC
GCCCAGAAGG AGAATCAGGC CGACCTGATT GGCGGGGTAA ATATTGGTAG CAAGGATCTA
TTTCTGGCCC GTACGCTGCT CGATCTGTCG CAGACCGGGA CGATCAACGA TAGCCTGAAG
CTGGGCAAGT ACCTGCTGGG TGAACTCGGG GCGATCAATA CGCTGCACAA GGCGAACGTT
GAACAGGCCG GTTTTGCCGT GCTCAAGGCG CCGGACATCC CGTCTGCGCT GATTGAAACG
GCGTTCATTT CCAATCCGGA AGAAGAAAGC CGGCTGAACG ACGATGCGTA TCAGGAAAAA
CTGGCCGGAG CGATCGTGCG CGGTATCAGG CAGTATTTCA TCAAGCATCC GCCAGGGCCA
AAGTCCAAGC TGGCCGCGCT CGGCTGGTGA
 
Protein sequence
MSKRAHPNLG RRQLLRYAGA SLILSVSPIA GAAAKLPSVL AVRIWPAADY TRVTLEHDAP 
LKFTHFIVEN PDRLVVDIEG VEFNSVLDSL ARKVATDDPN IKLLRAGRFK PGVVRLVMEL
KGKVNPQVFT LEPAGEYGRR LVLDVYPVNP PDPMMALLEG RKDAVEPLKN EHDFQITEKR
PDEVAAKIPE KPIEAPEVQT SKKSGKPIVD RLVTIMLDPG HGGEDPGAIG KAGTYEKNVT
LEVARRLKAR IDAEPNMRAV LTRDSDFFVP LQMRVQKARR IQSDLFLSIH ADAWIKPDAK
GSSVFVLSEK GASSTQARLL AQKENQADLI GGVNIGSKDL FLARTLLDLS QTGTINDSLK
LGKYLLGELG AINTLHKANV EQAGFAVLKA PDIPSALIET AFISNPEEES RLNDDAYQEK
LAGAIVRGIR QYFIKHPPGP KSKLAALGW