Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Daro_1031 |
Symbol | |
ID | 3568112 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dechloromonas aromatica RCB |
Kingdom | Bacteria |
Replicon accession | NC_007298 |
Strand | + |
Start bp | 1131973 |
End bp | 1132869 |
Gene Length | 897 bp |
Protein Length | 298 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637679490 |
Product | peptidase S33, proline iminopeptidase 1 |
Protein accession | YP_284257 |
Protein GI | 71906670 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | [TIGR01249] proline iminopeptidase, Neisseria-type subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 80 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGTCG GTGACGGGCA TATCCTGCAT ATCGAGGAAT GCGGCCCGGT TGACGGCCTG CCCGTCCTTT TCCTGCATGA TGGCCCGGGT AATGGCTGCC AGCCGGAATA CCGCCGTCTG TTCGACCCCC ATCGTTTTCG TATCGTCTTC ATCGACCAGC GCGGGGCCGG CCGCAGCCTG CCGAGCGGGG AGTTGGGCGC CAATACCACG CCCGATCTGG TGGTCGACCT CGAACATGTG CGCGATGCAC TGGGTATTTC GAACTGGATC GTCCTTGGCG GCGGCTGGGG CAGCCTGCTT GCCCTGGCCT ACAGCCAATT ATTTACCGAA CGGGTGTGCG GGCTGGTGCT CTGCGGCATC TTTCTCGGTT CGCGCCAGGA GGTCGAAGCG CTGCTTCAGG CGGCACCGGC CGGTCAGGCC GACGCGTGGC GGCAATTCGC CGTCGCCATT CCCGAAAACG AGCGCGACGA CTTGCTGGCC GCCTACGCCA GTCGAATTCT CGGCGCAGAC CTGGCCACTG CGAACCTGGC CAGCCATGCC TGGCTGAACT ACGGGCGAGC CCTGCGCAAC GAGGGTCCGC TGCCGGCCTG GCCGGACAGT TTGGCCTTGG CGACAGCCCG CCTGCAGATG CACTACTTGC ACCATGACTG CTTTATCGCT CCGGGCCAAT TGCTGGCCGG TGTCGAGCAT CTGCGCCATT TGCCTGCCGC CATCGTGCAA GGGGTGGCCG ACCCGCTCTA TCCGACCCAC TCCGCCGAAG CGCTGCACCG CGCCTGGCCG GAGGCAACCT GGTTTCCCGT GGCCAATGCC GGCCACGACG TACTGGCCCC GCCAATCGCC AGAGCTTGTA TCAAGGCGCT AGGTTGGGTC GCTGAGTGCG TTGAAACAGT TGATTAA
|
Protein sequence | MAVGDGHILH IEECGPVDGL PVLFLHDGPG NGCQPEYRRL FDPHRFRIVF IDQRGAGRSL PSGELGANTT PDLVVDLEHV RDALGISNWI VLGGGWGSLL ALAYSQLFTE RVCGLVLCGI FLGSRQEVEA LLQAAPAGQA DAWRQFAVAI PENERDDLLA AYASRILGAD LATANLASHA WLNYGRALRN EGPLPAWPDS LALATARLQM HYLHHDCFIA PGQLLAGVEH LRHLPAAIVQ GVADPLYPTH SAEALHRAWP EATWFPVANA GHDVLAPPIA RACIKALGWV AECVETVD
|
| |