Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_1122 |
Symbol | |
ID | 4285140 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | - |
Start bp | 1228832 |
End bp | 1229851 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 638140600 |
Product | proline iminopeptidase |
Protein accession | YP_756353 |
Protein GI | 114569673 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | [TIGR01249] proline iminopeptidase, Neisseria-type subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.29642 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.0858208 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCCT TTCACCCGGC CTGCGCGCCG CACCAGAGCG GCCATCTGGA CGTTGGTGAC GGGCATGCGA TCTATTGGGA AAGCGCCGGG AACCCGGACG GCATCCCGCT GCTGGTGCTG CATGGCGGCC CGGGTTCGGG GATATCCGAC AAGTTCCGGA AGTTGTTCGA TCCGCAGCGA TTCCGCATCA TCCTGCTCGA CCAGCGCGGC GCCGGCCGCT CGACGCCGCA CCTGTCGCTG CAGGCTAATA CGACCGCCCA TCTGGTCGCT GACCTGGAAG CCCTCCGCGG GCATCTGGCC ATCAAGCGCT GGATGGTGTT CGGCCCGTCC TGGGGCTCGA CGCTGGCCCT GGCCTATGCC CAGACGCATC CACACGTCGT CAGTGGCCTC ATCGTCGGCG CCATCTTCAC GGCCCGCGCT TTCGAGCTGG ACTGGTGGCA CAGCCCCGAC GGCGCGCCGA CCATCTTTCC GGACGCCTTC GCAACCTTCA TCGCTCCGGT ACCGCAGGCA GAGCGGACCT CACCCGAAAC GATCATGCGC TGGTATCTTG CGGAGATGCA GGACGAGATC GCGCGCGGGC TTCCGGATCT GACTGAGCTG GCCGACATCT CGACCCCGCT CGATACGCTG CGCCGCTCTG CGGTCTATCG GTGGACTGAG TATGAGGACC GCCTCTCCTA TCTCGACAAT CCGCCAGAAG CTGTGCGCGC GGGACTGGCG GCTCGCGGTG CCGGCTTTAT TGCCGCGCAT TCGCTGATCG AGGTCCATTA TTTCAGCCAG GGATGCTTCC TCGAGGAGGG TGAATTGCTG GCCAAGGCCG ACCGCCTGGC AGACATTCCG ATGGGGATCC TGCACGCCCG CTATGACATG GTGTGCCCCG CCCGCACCGC CTTCGATCTC GCCGCAGCCT GCCCGCATGC CGATTTCCGG CTGGTCGCCG TGGGCGGTCA TGGCATGACC GATGCCAGCC AGGCTGAGCT GAATGTCCTT GTCGACGACG TGGTCTCCCG TATCACCTGA
|
Protein sequence | MSAFHPACAP HQSGHLDVGD GHAIYWESAG NPDGIPLLVL HGGPGSGISD KFRKLFDPQR FRIILLDQRG AGRSTPHLSL QANTTAHLVA DLEALRGHLA IKRWMVFGPS WGSTLALAYA QTHPHVVSGL IVGAIFTARA FELDWWHSPD GAPTIFPDAF ATFIAPVPQA ERTSPETIMR WYLAEMQDEI ARGLPDLTEL ADISTPLDTL RRSAVYRWTE YEDRLSYLDN PPEAVRAGLA ARGAGFIAAH SLIEVHYFSQ GCFLEEGELL AKADRLADIP MGILHARYDM VCPARTAFDL AAACPHADFR LVAVGGHGMT DASQAELNVL VDDVVSRIT
|
| |