Gene Daro_3500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3500 
Symbol 
ID3567768 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3748963 
End bp3750315 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content64% 
IMG OID637681972 
ProductUDP-N-acetylmuramoylalanine--D-glutamate ligase 
Protein accessionYP_286699 
Protein GI71909112 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0771] UDP-N-acetylmuramoylalanine-D-glutamate ligase 
TIGRFAM ID[TIGR01087] UDP-N-acetylmuramoylalanine--D-glutamate ligase 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value0.0719241 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.127337 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACTCA AGGGCAAACG CGTTCTGGTC GTCGGACTCG GTGAGTCCGG ACTGGCGATG 
GCCAAGTGGC TGCATCGCCA GGGCGCGCTC GTCCGCGTTG CCGACTCGCG CGACAACCCG
CCGAATATTG ACGCGCTGCA GCGCGTCGCA CCAGGGGCTG AACTGGTGGC CGGTGCCTTT
GCCGAGGCCA CTTTCGCCGG TGCCGATTTC GTCGCGCTGT CGCCGGGCGT GCCCAAGGCG
ACGCCAGAGA TTGCCGCGCT TGAGATACCG TTGATTTCCG AAATCGAACT GTTCGCTGAC
GGCGTGCGCG AACAGGTGCC GAATTCACAA ATCATCGCCA TCACCGGCAG CAACGGCAAG
ACCACGACCA CGGCGCTGAC CGCCCATCTG CTCAACGGTG CTGGCGTACC GGCTATCGCC
TGCGGCAACA TTTCTCCGTC GGCGCTTGAT GCGCTGATGG ACGCCCAGGA TGCCGGCGCT
TTGCCGCAAG TCTGGGTTGT CGAACTGTCC AGCTTCCAGC TCGAAACGAC GCATCACCTG
AATGCCGCTG CCGCGACTGT CCTCAACGTC TCGGAAGATC ATCTCGACCG CTACGAGGGC
AGCCTGGCCA ACTACGCCGC CGCCAAGTCA CGAGTTTTTC AGGGCAAGGG CGTGATGGTA
CTGAACCGTG ATGACGACTG GTCGATGGCC AATGGCCGTT GCGGCCGCAA GATGGTGACT
TTTGGCCTGA ATGCCGCACC GCGTGGTGTC GATTACGGCT ACGCCGATGG CGCTATCTGG
CGCGGCAAGG ACAAGCTGGT CGCGATCGAT GCCCTGAAGC TGTCCGGTTT GCACAATGCC
GCCAATGCCA TGGCCGCACT GGCGCTGTGT GAGGCCATCG GCGTCGATCC GCTCCGCCTG
ATTGAGCCAC TGAAGGGCTT CTCCGGCCTG CCGCACCGGG TCGAAACTGT CGCTGAAATC
GGCGGCGTGC TCTACGTTGA TGACTCCAAG GGTACCAATG TCGGCGCCAC CCTGGCGGCC
ATCGAAGGCA TGGGCCGCAA AGTTGCGATC GTCCTCGGCG GCGACGGCAA GGGGCAGGAT
TTCTCGCCGC TCAAGCCAGC GCTGGAAAAG CACGGTCGTG CTGTGGCGCT GATCGGCCGT
GATGCTGCCG CCATCGGCAT GGCGCTCGAA GGCAGCGGCG TGCCAACCCG GATTCTGGGC
GATATGGAAG CTGCCGTGCT CTGGCTGGCG GCGCAAGCCC AAGCTGGCGA CTGCGTGCTG
CTCTCGCCGG CCTGCGCCAG TCTCGACATG TACCGCAACT ACGCGCATCG CGCCCAGGCC
TTCATCGACG CGGTGGAGGG GCTGAAATCA TGA
 
Protein sequence
MELKGKRVLV VGLGESGLAM AKWLHRQGAL VRVADSRDNP PNIDALQRVA PGAELVAGAF 
AEATFAGADF VALSPGVPKA TPEIAALEIP LISEIELFAD GVREQVPNSQ IIAITGSNGK
TTTTALTAHL LNGAGVPAIA CGNISPSALD ALMDAQDAGA LPQVWVVELS SFQLETTHHL
NAAAATVLNV SEDHLDRYEG SLANYAAAKS RVFQGKGVMV LNRDDDWSMA NGRCGRKMVT
FGLNAAPRGV DYGYADGAIW RGKDKLVAID ALKLSGLHNA ANAMAALALC EAIGVDPLRL
IEPLKGFSGL PHRVETVAEI GGVLYVDDSK GTNVGATLAA IEGMGRKVAI VLGGDGKGQD
FSPLKPALEK HGRAVALIGR DAAAIGMALE GSGVPTRILG DMEAAVLWLA AQAQAGDCVL
LSPACASLDM YRNYAHRAQA FIDAVEGLKS