Gene Daro_1698 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_1698 
Symbol 
ID3568552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp1825424 
End bp1826386 
Gene Length963 bp 
Protein Length320 aa 
Translation table11 
GC content62% 
IMG OID637680165 
Productprolyl aminopeptidase 
Protein accessionYP_284915 
Protein GI71907328 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01249] proline iminopeptidase, Neisseria-type subfamily 


Plasmid Coverage information

Num covering plasmid clones68 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.457189 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTTCCC CTCTTGAACC GCTGAGAACT GGCTATCTCG ATGTCGGGGA CGGGCACCGG 
CTATATTTCG AGACCTGCGG AAATCCGCGT GGCCTGCCCG TTGTGGTGCT GCACGGCGGT
CCTGGAAGTG GCACCAGCGC CAGAATGCGC ACGCTCTATG ACCCGGAGCG CTTCCATACA
GTCCTCTTCG ACCAGCGCGG CTGCGGGCGT TCGCTGCCTC AGGGAGAACT GCACGCCAAC
CATCTGAATG CGCTGATTGC CGACATCGAG CGCCTGCGCC TTCATCTCGG AATTGCACGC
TGGCTGGTCA GCGGCGGCTC ATGGGGCGCC ACGCTGGCAC TAGCCTATAC CGCAAATACC
CCACAGGCCG TACTCGGAAT ACTGGTGCGT AGCGTATTTC TTGCCGGAGA TCACGACATT
CAATGGTTTT TTCAGGGCGC CAAAGCACTC GTTCCCCAAG CGTGGGAAGC CTTTGCCGCG
CAGGTCGCGC CAGAGAACTC CGACCTTCTG AGCGGCTTGC GCCGATACCT GAATGGCACC
GACCTGGCGC AGGCAAGACA AGCCGCAGTA GCCTGGGCCC GCTATGAGCA AAGCCTGGCG
CAGCCTGGCC TGGCACCGCC CCCCTCGCCT GAACTCGATG ATCTTGCGAC GCAGGATCGC
CTGGTCAGGA AATACCGGAT ACAAGCGCAC TATTTGGCGC AGCAATGCTT TCTTGGCGAA
ACGGGCATTT TCAGTCTCAT TGCCCGTTTG CCTGCGGTCC CCGTGGCCAT CGTTCACGGC
CAGCTGGACT GTGTGTGCCA GCCGGAGAAC GCCAGGCGCT TGCAGCAGGC GATCCCGGGG
AGTCGCCTGG CATGGGCCGA CGGCGCTGGC CACGATCCCT TTCACCCGGC CATGAGCAGC
GCCTGGATCG CGTTTCTGCA CCACTTTTCC GAAGCGGGAA ACTTCGACAT TTCCGAGGCC
TGA
 
Protein sequence
MFSPLEPLRT GYLDVGDGHR LYFETCGNPR GLPVVVLHGG PGSGTSARMR TLYDPERFHT 
VLFDQRGCGR SLPQGELHAN HLNALIADIE RLRLHLGIAR WLVSGGSWGA TLALAYTANT
PQAVLGILVR SVFLAGDHDI QWFFQGAKAL VPQAWEAFAA QVAPENSDLL SGLRRYLNGT
DLAQARQAAV AWARYEQSLA QPGLAPPPSP ELDDLATQDR LVRKYRIQAH YLAQQCFLGE
TGIFSLIARL PAVPVAIVHG QLDCVCQPEN ARRLQQAIPG SRLAWADGAG HDPFHPAMSS
AWIAFLHHFS EAGNFDISEA