Gene Daro_1720 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_1720 
Symbol 
ID3568539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp1848740 
End bp1850194 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content58% 
IMG OID637680189 
Productpeptidase M20C, Xaa-His dipeptidase 
Protein accessionYP_284937 
Protein GI71907350 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2195] Di- and tripeptidases 
TIGRFAM ID[TIGR01893] aminoacyl-histidine dipeptidase 


Plasmid Coverage information

Num covering plasmid clones56 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGATG CTGTGTTTTT TGGTCTGCAA CCGGCCGCTG TCTGGGCTCA TTTCTCTAGA 
CTGTGCGCGA CGCCACGCGC CTCCAAGCAC GAAGCCATAC TTCGTGACGC GTTGTTGCAA
TGGGCGTTCA GCCGAGGTCT GGCTGCCGAG GTTGATGCTT CAGGTAACCT GATTATCCGC
AAGCCGGCCA GCATTGGGCG CGAGCATTGT CCGGGCGTGG TGCTGCAGGC GCATCTCGAT
ATGGTCTGTC AGAAGAATGC TGACAGCGAC CATGACTTTT CTCGCGATGC GATCATTCCC
GTCTTGCGTG ATGGCTGGCT GGTGACTGAG AAAACGACCT TGGGCGCCGA CAACGGTATT
GGTGTGGCCT TGATTCTTGC CGTTCTGGAG GATGATTCGC TTCAACATGG ACCCATCGAA
GCCCTGCTGA CTGTCGATGA AGAGGCTGGC ATGGGCGGCG CGCGTGGTCT AGAGCCTCGA
GTGTTGCAAA GTCAGCTGAT GCTGAATCTA GATACAGAAG ACTGGGGTGA GTTCTATCTG
GGCTGCGCTG GAGGACTGGA TGTCAATGTA GAACGTCGTG GTGCGGCAGA GGCGGTTCCG
GCTGGCTATC AGGCCTGGCG AATCGACCTT GCTGGCTTGC GGGGCGGGCA TTCCGGCGTG
GATATCCACG AGGAGCGGGG CAATGCGATC AAGCTGCTGG TTCGCGTACT GCGGGAGTTA
GAGGCAATGT TCCCATTGCG TTTGGCTGAA CTTTCCGGAG GGACGGCCCG GAATGCCTTG
CCGCGTGAAG CTTCGGCCGT GATTTTGCTG CCCGTTGATA TGGCGGATCG CCTCGCGGTG
GCGGTGCTAG CAATTCAGCG TTGTCTGCAG GGCGAGTTGC GTGGTGTCGA TGAAGGTGTG
GAACTGAGCG TTTCGGCTTG CGATGCGTCG ATGGTGATGT CGCCGAGTGA GCAACGAATT
TGGCTGGCCT CGTTGCATGC CGCGCCACAT GGCGTACGCC GGATGAGCCG GCAGGTGCCG
GGTGTGGTCG AAACCTCCAA TAATCTGGGC ATGGTCGAAT TGCACCCGAA TGGTGGCTCG
TGCAACTTCA TGGTGCGTTC GCTGCTGGGT AGCGGCAGCA TGGCCCTGGC TGACGAAATC
GCCAGTTTGT GGGCCTTGAG CGACAGTCGG GTGGAAAAGG AAGGTTTTTA TCCGGGCTGG
GCGCCCAATC CGGATTCGCA GTTACTCAAG TTGTGCCAGT CGGTTTATCG GCGGGATTTC
GGTGCCGATT CGAAGGTTCA GGTCATCCAC GCTGGTCTGG AGTGCGGCAT CATTGGCGAC
AAGTATCCGG GGATGGATAT TGTTTCGTTT GGCCCGACGA TTCGCGGGGC ACATGCACCC
GGTGAGCGGG TTGAAGTCGC TTCGGTGGAA AAGTGCTGGA ACTTGCTGAC GGCGATTCTG
GCCGAATTGC GTTGA
 
Protein sequence
MTDAVFFGLQ PAAVWAHFSR LCATPRASKH EAILRDALLQ WAFSRGLAAE VDASGNLIIR 
KPASIGREHC PGVVLQAHLD MVCQKNADSD HDFSRDAIIP VLRDGWLVTE KTTLGADNGI
GVALILAVLE DDSLQHGPIE ALLTVDEEAG MGGARGLEPR VLQSQLMLNL DTEDWGEFYL
GCAGGLDVNV ERRGAAEAVP AGYQAWRIDL AGLRGGHSGV DIHEERGNAI KLLVRVLREL
EAMFPLRLAE LSGGTARNAL PREASAVILL PVDMADRLAV AVLAIQRCLQ GELRGVDEGV
ELSVSACDAS MVMSPSEQRI WLASLHAAPH GVRRMSRQVP GVVETSNNLG MVELHPNGGS
CNFMVRSLLG SGSMALADEI ASLWALSDSR VEKEGFYPGW APNPDSQLLK LCQSVYRRDF
GADSKVQVIH AGLECGIIGD KYPGMDIVSF GPTIRGAHAP GERVEVASVE KCWNLLTAIL
AELR