Gene Daro_3646 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3646 
Symbol 
ID3568291 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3919231 
End bp3921285 
Gene Length2055 bp 
Protein Length684 aa 
Translation table11 
GC content62% 
IMG OID637682119 
Productoligopeptidase A 
Protein accessionYP_286845 
Protein GI71909258 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones51 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCCA GCAACCCGCT CCTCGATTTC TCCGACCTGC CCCGCTTCGA CCTGATCCAG 
CCCGAACACG TCAAACCAGC CATCGAATCC CTGCTTGCCG AAGGCAAGGC GCTGATCGAG
CGCCTGACCG CCGACAGCAC GCCCGCCACC TGGCCGGAAT TCGCCGGGGC GCTTTCCGAC
GGGCTGGAGC CTTTCGGCCG CGCCTGGGGC ATTGTCGGCC ACCTGCATTC AGTCAATGAC
GTGCCTGCCT GGCGCGAGGC CTACAACGAA ATGCTGCCTG AGGTGTCGCG CTTCTACGCC
GAACTCGGCC AGAACCTGAA ACTGTTCGAA AAGTACAAGG CCCTGCGTGA AAGTGCCGAA
TACGCGACGC TGAGCGTCGA ACAGAAAAAG ATCGTCAATA ACGAGGTGCG TGATTTCCGC
CTTTCTGGTG CGGAATTACC GGAAGAGCAG AAGCCGCGCT TCCAGGCCAT CATGGAAGAG
CTATCGCAGC TGTCTGCCAA ATTTGCCGAG AACCTGCTCG ACGCGACCAA CGCGTTCGCC
GAAGTCATCA CCGACGAAGC ACAGCTCGCT GGCCTGCCCG ACGACGTCAA GGAAGCCGCG
AAGACCGCGG CCGAAAAAGC CGGCGTCGAT GGTTGGCGTT TCAGCCTGCA GGCCCCGTCC
TACGGCCCGG TCATGCAGTA TGCCGACAAC CGCGAGCTGC GTGCCCGGAT GTACCGCGCC
TACGCGACGC GCGCCGCCGA ATTCCACGAC GGCTCCAGCA AGCCGGAATG GGACAATACG
CCGATCATCC AGCGCATGCT TGAACTGCGC GAGGAAGACG CCAGGATGCT CGGCTTCAAC
AACTTCGCCG AAGTGTCGCT GGCCCCCAAG ATGGCTGACA CGCCGGCCCA GGTGCTGGCT
TTCCTACGCG AACTGGCAGC CAAGGCCAAG CCGTTCGCCG CCAAGGACAT CGCCGAGCTG
CGCGCCTTTG CCAAGGATGA ACTGGGCCTG AACGACTTCC AGCCGTGGGA TGCCGCCTAT
GTATCCGAAA AGCTGCTGCA GGCCCGCTAC GCTTTCTCCG AGCAGGAAGT GAAGCAGTAT
TTCACCGAGC CCAAGGTGCT CGGCGGCCTG TTCAAGGTCA TCGAGAGCCT GTTCAACGTC
AAGGTCAAGC CGGACACGGC GCCGGTCTGG CATGAGGACG TCCGCTTCTA CCGGCTGGAA
ACGCCGACCG GCGATCTGGT CGGCCAGTTC TACCTCGACC TCTACGCCCG CGAAACCAAG
CGTGGCGGTG CCTGGATGGA TGAAGCCCGC TCGCGCCGCC GCACCGTCAC CGGCATCCAG
AAGCCGATTG CCTACTTGAA CTGCAATTTC GCCCGCCCGG TCGGCGGCAA ACCGGCCACC
TTCACGCACG ACGAAGTGAC CACCCTGTTC CACGAAACCG GCCACGGCCT GCACCACCTG
CTGACCCGTG GCGAAGAGCT CGGCGTTTCG GGCATTCATG GCGTCGAGTG GGATGCCGTC
GAACTGCCCT CGCAGTTCAT GGAAAACTAC TGCTGGGAAT GGGAAGTCGT GTCCGGCATG
ACGGCCCACG TCGATACCGG CGCGACGCTG CCGCGCGAAC TGTTCGACAA GATGCTGGCC
GCCAAGAATT TCCAGAGCGG CATGATGGCC GTGCGCCAGA TCGAGTTCTC GCTGTTCGAC
ATGCTGATCC ATTCCGATCT TGACCCGAAA TCCGGCTTGA CCGTGATGGA CGTGCTCAAG
GACGTACGCA AGGAAGTCGC CGTGCTGATT CCGCCCGAAT GGCATCGCTT CCCGAACAGC
TTCTCGCACA TTTTTGGCGG CGGTTATGGG GCCGGTTACT TCAGCTACAA GTGGGCGGAA
GTGCTGTCGG CCGATGCCTA TGCGGCTTTC GAAGAGGCCG GCGATCCCTT CGATGCCACG
GTGGGCAAGC GCTTCCTCGA TGAAATCCTG TCGGTTGGCG GCTCGCGTCC GGCCATCGAG
TCGTTCAAGG CCTTCCGTGG GCGCGAACCG AGCGTCGATG CACTGCTCCG TCATAGCGGT
ATGATTGCGG CCTAA
 
Protein sequence
MTASNPLLDF SDLPRFDLIQ PEHVKPAIES LLAEGKALIE RLTADSTPAT WPEFAGALSD 
GLEPFGRAWG IVGHLHSVND VPAWREAYNE MLPEVSRFYA ELGQNLKLFE KYKALRESAE
YATLSVEQKK IVNNEVRDFR LSGAELPEEQ KPRFQAIMEE LSQLSAKFAE NLLDATNAFA
EVITDEAQLA GLPDDVKEAA KTAAEKAGVD GWRFSLQAPS YGPVMQYADN RELRARMYRA
YATRAAEFHD GSSKPEWDNT PIIQRMLELR EEDARMLGFN NFAEVSLAPK MADTPAQVLA
FLRELAAKAK PFAAKDIAEL RAFAKDELGL NDFQPWDAAY VSEKLLQARY AFSEQEVKQY
FTEPKVLGGL FKVIESLFNV KVKPDTAPVW HEDVRFYRLE TPTGDLVGQF YLDLYARETK
RGGAWMDEAR SRRRTVTGIQ KPIAYLNCNF ARPVGGKPAT FTHDEVTTLF HETGHGLHHL
LTRGEELGVS GIHGVEWDAV ELPSQFMENY CWEWEVVSGM TAHVDTGATL PRELFDKMLA
AKNFQSGMMA VRQIEFSLFD MLIHSDLDPK SGLTVMDVLK DVRKEVAVLI PPEWHRFPNS
FSHIFGGGYG AGYFSYKWAE VLSADAYAAF EEAGDPFDAT VGKRFLDEIL SVGGSRPAIE
SFKAFRGREP SVDALLRHSG MIAA