Gene Daro_1031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_1031 
Symbol 
ID3568112 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp1131973 
End bp1132869 
Gene Length897 bp 
Protein Length298 aa 
Translation table11 
GC content65% 
IMG OID637679490 
Productpeptidase S33, proline iminopeptidase 1 
Protein accessionYP_284257 
Protein GI71906670 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01249] proline iminopeptidase, Neisseria-type subfamily 


Plasmid Coverage information

Num covering plasmid clones80 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGTCG GTGACGGGCA TATCCTGCAT ATCGAGGAAT GCGGCCCGGT TGACGGCCTG 
CCCGTCCTTT TCCTGCATGA TGGCCCGGGT AATGGCTGCC AGCCGGAATA CCGCCGTCTG
TTCGACCCCC ATCGTTTTCG TATCGTCTTC ATCGACCAGC GCGGGGCCGG CCGCAGCCTG
CCGAGCGGGG AGTTGGGCGC CAATACCACG CCCGATCTGG TGGTCGACCT CGAACATGTG
CGCGATGCAC TGGGTATTTC GAACTGGATC GTCCTTGGCG GCGGCTGGGG CAGCCTGCTT
GCCCTGGCCT ACAGCCAATT ATTTACCGAA CGGGTGTGCG GGCTGGTGCT CTGCGGCATC
TTTCTCGGTT CGCGCCAGGA GGTCGAAGCG CTGCTTCAGG CGGCACCGGC CGGTCAGGCC
GACGCGTGGC GGCAATTCGC CGTCGCCATT CCCGAAAACG AGCGCGACGA CTTGCTGGCC
GCCTACGCCA GTCGAATTCT CGGCGCAGAC CTGGCCACTG CGAACCTGGC CAGCCATGCC
TGGCTGAACT ACGGGCGAGC CCTGCGCAAC GAGGGTCCGC TGCCGGCCTG GCCGGACAGT
TTGGCCTTGG CGACAGCCCG CCTGCAGATG CACTACTTGC ACCATGACTG CTTTATCGCT
CCGGGCCAAT TGCTGGCCGG TGTCGAGCAT CTGCGCCATT TGCCTGCCGC CATCGTGCAA
GGGGTGGCCG ACCCGCTCTA TCCGACCCAC TCCGCCGAAG CGCTGCACCG CGCCTGGCCG
GAGGCAACCT GGTTTCCCGT GGCCAATGCC GGCCACGACG TACTGGCCCC GCCAATCGCC
AGAGCTTGTA TCAAGGCGCT AGGTTGGGTC GCTGAGTGCG TTGAAACAGT TGATTAA
 
Protein sequence
MAVGDGHILH IEECGPVDGL PVLFLHDGPG NGCQPEYRRL FDPHRFRIVF IDQRGAGRSL 
PSGELGANTT PDLVVDLEHV RDALGISNWI VLGGGWGSLL ALAYSQLFTE RVCGLVLCGI
FLGSRQEVEA LLQAAPAGQA DAWRQFAVAI PENERDDLLA AYASRILGAD LATANLASHA
WLNYGRALRN EGPLPAWPDS LALATARLQM HYLHHDCFIA PGQLLAGVEH LRHLPAAIVQ
GVADPLYPTH SAEALHRAWP EATWFPVANA GHDVLAPPIA RACIKALGWV AECVETVD