Gene Daro_3369 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3369 
Symbol 
ID3567237 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3627475 
End bp3629271 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content59% 
IMG OID637681841 
Productpeptidoglycan-binding LysM 
Protein accessionYP_286568 
Protein GI71908981 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3170] Tfp pilus assembly protein FimV 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones55 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAAAAA CAAAATATAC AGCGAGAAAC AACATGAGAA GCAAACTGAC GAGCATTGTC 
GCACTGACCG CCGCACTTGC TCCAGGCGTC GGAAACGCCA TCGGATTCGG GGAAATCACG
CTTCAGTCAC GTATCGGCGA AGCGCTCCTG GCTGAAGTGC CGATTCTGGC GACCGCAGAA
GAACAACCCA TAACGGCCTG CTTTTCCCTG GCATCGATCC GTGGCTCCGA CTTGCCGGTG
ATCACTGCAG CCAAGACAAG ATTGATCCGC CGCGGCCAGA ATTATGTCCT GCACATTCTG
GGCACCCGAC CGGTCAGCGA ACCCATTTTC GCGATCTCCC TGCAAGCGGG CTGCGGTTAC
GACATTGTGC GCGACTACGT GCTGATGCCG GAAGCACCAT TTGCGCAGGC AGAAATCCCT
GCGTCTTCAG GCGCCATTGC CGCGCCAAGA GGTAAACAGC CGAAATTTGC GGAATGGCAT
GCCCGCGAGG GCGACACGCT CGAGGACATT GCCGACGCCC AGGCGCCAGC CACTCTTTCC
GAACGTCAGC GCCTGATTGC CGCCTTAAAA CGAGCCAATC CGGACCTCGC CCCAGACACC
GTACTCAGAG AGGGCTCCAT TGTTCGCATT CCGGCAAGCA AGCAGGTCGG CGCCGAGAAA
AGAGAAAGCC CCTCTCGTGC ATCGACGCAA GCGACAAGCT TTGAGGAAAC GTCACCAAAG
CCAGTTGTAC CACGCAGCAA GCCCAGGCCA AACACCCCGC CAACGAAAGG CATGGACCAA
CTGCTCGTGG GCGCTGCGCC AGAGGAGACC AAGCCGCGCG AAAAAGGAAA TTCTGCGCTC
CTGTCATTGG CCGAGACCGA ACAGCGCCTT CTCAAGCTGG AAACCACCCT GCATTTGCTG
ACGCAGGAAG TGGAAAAAAT GGATCAAGCC CTGGACTTGG CGACCAAGGC CATTGAAGCA
CAAGGCAAAT TGCAGCGGGC GCAGGCGCCC CAGGCTGCTG TAGCGGCGAC CAGTGCCCCC
GCGCTGCCTG CCAACGCCCC AGCGAAAGCC AACTGGCTTG AATTGCTGTT GAGTGCAGCC
CTCGGCGCGG CCATTTCAGT CGGTCTTGCC CAATACCTGG GACGTCGCCA TCGTTATCCT
GGAGAGGAAG AAGCGCCACT GCTCTTCGCC CAGCATCGCG ACGCAGCGCG CCCGGCCACA
CAAGAAACGA CGGACGTATT CGACATCACA GAAAGCGAAA CATCTGACTC AAGCCCGCAA
GTCATCAAGC CATCCCTGCC GCCATCGCCT GAAGAAAGCC CAGTCTCAGC CAGTGCCCCC
AACCCGGACG AAATGCAGGT GGAAGACGAT CACTCGCTGC TGGAACTGGC GGAAATCATG
CTCTCCTTTG GGCGCCTGCG TGGCGCAGCC GACACACTGG CTGCTCACAT TGACGAGACG
CTACCCAGGA GTATTGAGCC ATGGAGCATG TTGCTCGACC TCTACCGCCG AGGCGGCATG
CGCCAGGAGT TTGACGCACT GGCGGAAAAG ATGCGCCGCC ACTTCAATAC CGAGATCCCA
GCCTGGAATG ACTCGACGAC GCCCATCTCC GGACTGAAAA CACTGGAAGA CTTCCCGCAC
GTCATTCAAA AGGCATCCCA ACTCTGGGGC ACCCAAGACA GCGTCGATTA TTTATTCAGT
CTCGTCCACG ACACCCGAAT GGGTCAGCGC AATGGCTTCC CTTTGGAAGT CGTAGAGGAA
ATCGCCTTGC TCATGCGTAT TCTGGTGGAG GCCTACGGTC TCAAACGCTG CGGCTAA
 
Protein sequence
MGKTKYTARN NMRSKLTSIV ALTAALAPGV GNAIGFGEIT LQSRIGEALL AEVPILATAE 
EQPITACFSL ASIRGSDLPV ITAAKTRLIR RGQNYVLHIL GTRPVSEPIF AISLQAGCGY
DIVRDYVLMP EAPFAQAEIP ASSGAIAAPR GKQPKFAEWH AREGDTLEDI ADAQAPATLS
ERQRLIAALK RANPDLAPDT VLREGSIVRI PASKQVGAEK RESPSRASTQ ATSFEETSPK
PVVPRSKPRP NTPPTKGMDQ LLVGAAPEET KPREKGNSAL LSLAETEQRL LKLETTLHLL
TQEVEKMDQA LDLATKAIEA QGKLQRAQAP QAAVAATSAP ALPANAPAKA NWLELLLSAA
LGAAISVGLA QYLGRRHRYP GEEEAPLLFA QHRDAARPAT QETTDVFDIT ESETSDSSPQ
VIKPSLPPSP EESPVSASAP NPDEMQVEDD HSLLELAEIM LSFGRLRGAA DTLAAHIDET
LPRSIEPWSM LLDLYRRGGM RQEFDALAEK MRRHFNTEIP AWNDSTTPIS GLKTLEDFPH
VIQKASQLWG TQDSVDYLFS LVHDTRMGQR NGFPLEVVEE IALLMRILVE AYGLKRCG