Gene Daro_0020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_0020 
Symbol 
ID3570044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp25900 
End bp26931 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content57% 
IMG OID637678449 
Productpeptidoglycan-binding LysM 
Protein accessionYP_283249 
Protein GI71905662 
COG category[S] Function unknown 
COG ID[COG1652] Uncharacterized protein containing LysM domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones53 
Plasmid unclonability p-value0.732804 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.419154 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTCGCA TTATATCCGC GCTCATCCTG GCCGTGACGG CCGTCTGCGC ATCGGCCGCC 
GAGCCGCTAC AACTCGTCGA CAATCCGCCT GATCGTCATA TCGTCGTCAA GGGCGACACG
TTGTGGGGCA TTTCCGGCAA ATTCCTCAAG CAGCCGTGGC GCTGGCCGGA AATCTGGCAG
ATGAACAAGG AACAGATCAA GAACCCGCAC TGGATTTATC CGGGCGACGT CATCATGCTC
GATATGTCGA GCGGTACCCC GCGCCTGAAG ATTGGCAAAC CCGTCACCGG GCAAAGCGGC
AAGGTTCAGC CGACCGTCTA TAGCACCCCG GTGCAGCAGG TCATTCCGAG CATCCCCCCC
AATGCCATTG AACCGTTCCT CTCCAAGCCA CTGATTATCG AGACAACGGA TCAGAACGCG
ACAGTCAGCA TCGTCGCAAC CCAGGAAGAT CGCATGCTGG TCGGTACGGG TGATTCTTTC
TACGCCCAAG GCATTCCCGA TTCAAGCATC GAAAAATGGA ATGTATTCCG CAAGGGCAAG
CCGCTGAAAG ATCCGGATAC CGGCGAGACT ATTGCTTACG AAGCCGTTTT CCTCGGCAAT
GCCCGCTTGG TCAAGCCAGG CGAACCGGCA ACGCTGCGCG TCACCCTGGC CAAAGAAGAA
ATGAATCGCG GCGACAATCT TTTGCCCGCT CCTCCCCCGG AAATTCTGAC CTACGTACCG
CACCGCCCTG AGCAGGAAGT CTCAGCCAAA GTGCTTGGTA TTTATGGCGG GGTGCAAGAG
GGTGGCGCCA ATTCGGTCAT TTCCATCAGC CGTGGCAAGA ATAGCGGTCT CGAACTGGGA
CATGTCGTTG CGCTCTACCG GAATCGTGTT TCGGTCAGCA TTGATGAAGA CGGCCGTCGC
ACTTCAACTC CGGTACCTGA AGAACGTTAT GGCCTTGCCT TCGTTTTCCG CGTCTTTGAC
CGCGTCGCCT ACGCCTTGGT CGTCGAGTCC TCCAAGGCAG TCATCATCGG GGACTCCGCA
CTGAACCCGT GA
 
Protein sequence
MVRIISALIL AVTAVCASAA EPLQLVDNPP DRHIVVKGDT LWGISGKFLK QPWRWPEIWQ 
MNKEQIKNPH WIYPGDVIML DMSSGTPRLK IGKPVTGQSG KVQPTVYSTP VQQVIPSIPP
NAIEPFLSKP LIIETTDQNA TVSIVATQED RMLVGTGDSF YAQGIPDSSI EKWNVFRKGK
PLKDPDTGET IAYEAVFLGN ARLVKPGEPA TLRVTLAKEE MNRGDNLLPA PPPEILTYVP
HRPEQEVSAK VLGIYGGVQE GGANSVISIS RGKNSGLELG HVVALYRNRV SVSIDEDGRR
TSTPVPEERY GLAFVFRVFD RVAYALVVES SKAVIIGDSA LNP