Gene Daro_3719 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3719 
Symbol 
ID3568152 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3998184 
End bp3999542 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content63% 
IMG OID637682192 
Productpeptidase M16, C-terminal:peptidase M16, N-terminal 
Protein accessionYP_286918 
Protein GI71909331 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones53 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0647985 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATGC GACAACTTGT CCCCTTTTTG CTCGCTGCCG GTCTGTCGAC CGCAGCCCTT 
GCCAACCCTT ATGAAACGAC GCTCAAGAAC GGCCTGCGCG TGATTGTCAA GGAAGACCAC
CGCGCCCCGA CTGCCGTCCA GATGGTCTGG TACCGCATCG GCAGCACGGA CGAGGTCGAT
GGCGCCTCCG GCGTCGCCCA CGTGCTCGAG CACATGATGT TCAAGGGCAC GCCCAGTGTC
GGCCCCGGCG AATTCAACAA GCGCGTCGCG GCCGCCGGCG GCAAGGACAA CGCCTTCACC
AGCCGCGATT ACACCGCCTA CTTCCAGCAG GTGCCGAAGG AAAAACTGGC CGACATGATG
CAGCTCGAAG CCGACCGAAT GCGTCACCTG AATGTCGACG CCAAGGAATT CGAGCAGGAG
ATCAAGGTCG TCATGGAAGA GCGCCGCATG CGCACCGATG ACAACCCGCA AGCCAAACTG
TTCGAGCAGA TGAACGCCGT CGCCTTCCAG GCTCACCCGT ATCGCCGGCC GATCATCGGC
TGGATGAACG ACCTGGAAAC GATGACCGCC GCCGATGCCA AGGCCTGGTA CGACACCTGG
TACGTGCCAA ACAACGCTTA CGTCATCATC ACCGGCGACG TCGATCACAA GGAAGTCTTC
GCTCAGGCCG AAAAATACTA CGGCCCGCTT GAAGGCCGCG CCCTGCCGCC CCGCCGCCAG
CAAATCGAGC CCGTACAGGA AGGCCCCCGC CACGTCACGG TCAAGGGCCC GGCCGAACTG
CCGGTGCTGA TCATGGGCTA CAAGGCGCCG ATCCTGCGCG ACATCGACAA GGACAGCGCC
CCCTACGCAC TGGAAATGCT CGCCTCCATC CTCGATGGTC ACGATGCTGC CCGCTTCAAC
AAGAAATTGG TGCGCGAGGA CAAGGTCGCG CTGTCGGCCG GCATCGACTA CGACAACACC
GCCCGCGGCC CCGGCATGCT CTACCTGCAC GGCACGCCGT CGGAAGGCAA AACCGTCGCC
GATCTGGAAG CTGCGCTGCG CGCCGAGATC GTCCGTGTCC AGAAGGATGG TGTCAGCACC
CAGGAACTCA AGCGCGCCAA GGCCCAGCTG GTGGCTGGTC AGGTCTACAA GCTTGATTCG
ATGTTCGGTC AGGCCATGGA AATCGGCCAG ATCGAATCGG TCGGCCTGCC CTACCAGAAG
CTCGACCACA TGCTGGACAA GCTGCAGAAA GTCACCGCTG CCGACGTTCA GGCCGTAGCC
AGAAAATACT TCAACGACGA TGCCCTGACC ATCGGCGTCC TCGATCCGCA GCCGCTCGAC
GGCAAACCAC GCCGTCCGGC CGTCGCTACC CGCCACTGA
 
Protein sequence
MRMRQLVPFL LAAGLSTAAL ANPYETTLKN GLRVIVKEDH RAPTAVQMVW YRIGSTDEVD 
GASGVAHVLE HMMFKGTPSV GPGEFNKRVA AAGGKDNAFT SRDYTAYFQQ VPKEKLADMM
QLEADRMRHL NVDAKEFEQE IKVVMEERRM RTDDNPQAKL FEQMNAVAFQ AHPYRRPIIG
WMNDLETMTA ADAKAWYDTW YVPNNAYVII TGDVDHKEVF AQAEKYYGPL EGRALPPRRQ
QIEPVQEGPR HVTVKGPAEL PVLIMGYKAP ILRDIDKDSA PYALEMLASI LDGHDAARFN
KKLVREDKVA LSAGIDYDNT ARGPGMLYLH GTPSEGKTVA DLEAALRAEI VRVQKDGVST
QELKRAKAQL VAGQVYKLDS MFGQAMEIGQ IESVGLPYQK LDHMLDKLQK VTAADVQAVA
RKYFNDDALT IGVLDPQPLD GKPRRPAVAT RH