Gene Daro_2107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_2107 
Symbol 
ID3567003 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp2274859 
End bp2276505 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content59% 
IMG OID637680580 
Productactivation/secretion signal peptide protein 
Protein accessionYP_285320 
Protein GI71907733 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2831] Hemolysin activation/secretion protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value0.385271 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAGA ACAAATTCAG CACGTGCTTG GCCATGGCAC TCATGGCCTA CGTGGCGCAA 
GCCGCTGCGC AGGCAGGCTC GTCCCTGCAG GGGCCGGTAC CGCCCCAGCC CTTGCCCTCT
GACGCCGGGG CTGTGCCAGC GCTTCCCGCG ATCAAGGCGA AGGCGTCCGA GGCCGGCGGG
GTAAGCGTGC TTCTCAAGTC CGTCGAAATT ACCGGCAACA AGACCCTGGA TAGCGCCACG
CTGCTTTCCG GTCTTGGCCA AGTGTCGGGC AAGTCTTTCG ACATGGCTGG CCTGAATGCC
CTGACTGCAA ATATTGAAAC GCAGTATCGC GCTGCCGGCT ATCCATTCAC CCAGGTGATC
CTTCCGCCGC AGGACCTGAA GGACGGTGTC CTCAGAATCA ATGTCATCGA GGGGCGGTAT
GGCCTCATTC GCGCTACCGG AAAAGGTACG CTGCCGGCGG GCGCCCAGCC TTTCCTCGAC
TTTGGCTTGA ACGGCGGTGA TCCAATCGAG AACAAGGTGC TTGAACGTAG ACTGCTGATA
CTTGACGATC AGCCGGGCAT GAAAATTCGC CCGGTCATTC GGCCCGGCGA CAACTTCGGC
GAGGCTGATC TGGAGGTGAA CGTCGAGCGT GCCTCGTATA TCAGCGGCGA GGCCGGTGTG
GACAACACCG GCGCACGCTC CACCGGCGAA TATCGAGCCC GAGCCGCGTT GTATGCCAAC
AGCCCGTTCA TGTACGGCGA CAAGCTTTCG CTGAATGGCT TGTACACCGA CAAGGATATG
TGGCTCGGTT CGCTGGATTA CGAATTGCCG CTGGGCGCTT CCGGCTTGCG CGGTCAGGTC
GGTTTTGCAC ATACCAGTTA TCAGTTGGGC GCGCAGTTTG CAGCGCTGAA TGCCAAAGGT
TATGCGGACG TAGCGACGGC CAGATTGAGC TATCCACTGA TTCGCTCGCA AGCCACGAAT
GTGCTGCTGA CACTGGGTTA CCAGCACAAG GACCTTGAAG ATCGTTACGA CAGCACACAT
ACCATTCGCA ACAAGAGCAG CGATGGCATT CCGGTCGGGC TACAGTTCGA CAAGCGCGAT
ACGCTGTGGG GCGGAGGTGT GACCTACGGT TCGCTCGGCT GGCTTCCCGG AAATCTCAAA
CTCGATGCCA ACATGACGGC CACGGATAGC GAAACGGCAA AGACCAAAGG CAGCTTCAAC
AAGGTAAATC TGGACATTGC CCGCATCCAG CAAGTGTTCG GCACAGTGAC CGCCTACGCG
CGCTATTCCG GGCAATGGGC GGACAAGAAT CTGGATTCCT CCGAGAAATT CAACCTCGGT
GGTTTCTATG GTGTGCGCGC TTATCCGCTG GGAGAAGGCG TTGGCGACAA GGGATGGTTC
ACCCAGCTGG AACTGCGCTA CGCGATTGGC CAGGTAACGC CTTTCGTTTT CCACGATATG
GGCGAAACGG ACACGAATGC GAAGCCTTGG GATGCAAATT CAGCAGCCAA GCGCAAGTTG
GCCGGCTCCG GTGTCGGTAT CCGTGCCATG CTTGACGGCT GGAGTCTTGA CGCCACCGTC
GCTTGGCGCA CGCAAGGCGG CCCCTCAACC TCGGAAAACG TCGATCGCAA CCCACGTATT
TTTTTCATGC TGGGGCGCCG GTTTTAA
 
Protein sequence
MKKNKFSTCL AMALMAYVAQ AAAQAGSSLQ GPVPPQPLPS DAGAVPALPA IKAKASEAGG 
VSVLLKSVEI TGNKTLDSAT LLSGLGQVSG KSFDMAGLNA LTANIETQYR AAGYPFTQVI
LPPQDLKDGV LRINVIEGRY GLIRATGKGT LPAGAQPFLD FGLNGGDPIE NKVLERRLLI
LDDQPGMKIR PVIRPGDNFG EADLEVNVER ASYISGEAGV DNTGARSTGE YRARAALYAN
SPFMYGDKLS LNGLYTDKDM WLGSLDYELP LGASGLRGQV GFAHTSYQLG AQFAALNAKG
YADVATARLS YPLIRSQATN VLLTLGYQHK DLEDRYDSTH TIRNKSSDGI PVGLQFDKRD
TLWGGGVTYG SLGWLPGNLK LDANMTATDS ETAKTKGSFN KVNLDIARIQ QVFGTVTAYA
RYSGQWADKN LDSSEKFNLG GFYGVRAYPL GEGVGDKGWF TQLELRYAIG QVTPFVFHDM
GETDTNAKPW DANSAAKRKL AGSGVGIRAM LDGWSLDATV AWRTQGGPST SENVDRNPRI
FFMLGRRF