Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MCA1737 |
Symbol | pip |
ID | 3102433 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylococcus capsulatus str. Bath |
Kingdom | Bacteria |
Replicon accession | NC_002977 |
Strand | - |
Start bp | 1854183 |
End bp | 1855133 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637170898 |
Product | proline iminopeptidase |
Protein accession | YP_114176 |
Protein GI | 53803927 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | [TIGR01249] proline iminopeptidase, Neisseria-type subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.146462 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACCGC TTTATCCACC GCTCGAGCCC TACGTCTTCC ACCGTTTCGG AGTGGGGGGC GGGCATGAGA TCTATGTCGA GGAATGCGGC AATCCCGAGG GCATTCCCGC GGTGTTTTTG CACGGGGGGC CAGGCTCTGG TTGCAGACAC CACCATCGCT CGTTTTTCGA TCCGGAACGT TACCGGGCGA TACTCGTCGA TCAACGGGGC TGCGGGCGAT CGACCCCGCA TGGTGCGCTC AGGAACAATA CCACCCGTCA TCTGATCGAC GACCTCGAAT CGATCCGAGG GCGCTTGAAT ATCCCGAAAT GGCTGCTTTT CGGTGGCTCC TGGGGGGCGG CCCTGGCGTT GCTCTATGCG CAGGCCTTTC CGGAGCGGGT GAGCGGGCTG ATCCTGAGGG GCAGTTTCCT GGCGCGCAAG CGCGACGTGG ACTGGTTCGT GCGCGATGGT GCCAGTCGCT TCCATCCCGA GGCATGGCAG CGGTTCAGTG ACAATTTCGA TGCCCGGGAG CGGGCCGATC CGGTCCGGGC TATCCACCGC CGGATCAAAG GCGCCGATGA GCTTGAACAG CGGCGAATGG CGAAGGAATG GTGGCTTTGG AGCAGCCGCG TCACGCTGGG TTCCGGGTTC AACCCGGCGG ATGATGATCC CCTTCCCCCC GGAGCCTTGG CGCAGTGCCG CATCGAACTC CATTATGCGG CGGCCCGCTA TTTCATCAGG GAAGGTCAGA TCCTCGAAGA CTGTCCGAAG ATCGCCCATC TGCCCGCGAT CATCGTGCAC GGCCGGCAGG ACCTGGTCTG TCCTCCCGAG GCGGCCTGGC TGCTGCATCG GGCATTGCCG CGATCCGAGT TGACGATTTT GCCGAACGCC GGTCATCTTG CCCAAGGCGA GGAAATGACC GATGCTCTGG TGAGAGCGCT GGACGGCATG GCGGAACGGC TGGGGAGCTG A
|
Protein sequence | MKPLYPPLEP YVFHRFGVGG GHEIYVEECG NPEGIPAVFL HGGPGSGCRH HHRSFFDPER YRAILVDQRG CGRSTPHGAL RNNTTRHLID DLESIRGRLN IPKWLLFGGS WGAALALLYA QAFPERVSGL ILRGSFLARK RDVDWFVRDG ASRFHPEAWQ RFSDNFDARE RADPVRAIHR RIKGADELEQ RRMAKEWWLW SSRVTLGSGF NPADDDPLPP GALAQCRIEL HYAAARYFIR EGQILEDCPK IAHLPAIIVH GRQDLVCPPE AAWLLHRALP RSELTILPNA GHLAQGEEMT DALVRALDGM AERLGS
|
| |