Gene MCA1737 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1737 
Symbolpip 
ID3102433 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp1854183 
End bp1855133 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content63% 
IMG OID637170898 
Productproline iminopeptidase 
Protein accessionYP_114176 
Protein GI53803927 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01249] proline iminopeptidase, Neisseria-type subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.146462 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACCGC TTTATCCACC GCTCGAGCCC TACGTCTTCC ACCGTTTCGG AGTGGGGGGC 
GGGCATGAGA TCTATGTCGA GGAATGCGGC AATCCCGAGG GCATTCCCGC GGTGTTTTTG
CACGGGGGGC CAGGCTCTGG TTGCAGACAC CACCATCGCT CGTTTTTCGA TCCGGAACGT
TACCGGGCGA TACTCGTCGA TCAACGGGGC TGCGGGCGAT CGACCCCGCA TGGTGCGCTC
AGGAACAATA CCACCCGTCA TCTGATCGAC GACCTCGAAT CGATCCGAGG GCGCTTGAAT
ATCCCGAAAT GGCTGCTTTT CGGTGGCTCC TGGGGGGCGG CCCTGGCGTT GCTCTATGCG
CAGGCCTTTC CGGAGCGGGT GAGCGGGCTG ATCCTGAGGG GCAGTTTCCT GGCGCGCAAG
CGCGACGTGG ACTGGTTCGT GCGCGATGGT GCCAGTCGCT TCCATCCCGA GGCATGGCAG
CGGTTCAGTG ACAATTTCGA TGCCCGGGAG CGGGCCGATC CGGTCCGGGC TATCCACCGC
CGGATCAAAG GCGCCGATGA GCTTGAACAG CGGCGAATGG CGAAGGAATG GTGGCTTTGG
AGCAGCCGCG TCACGCTGGG TTCCGGGTTC AACCCGGCGG ATGATGATCC CCTTCCCCCC
GGAGCCTTGG CGCAGTGCCG CATCGAACTC CATTATGCGG CGGCCCGCTA TTTCATCAGG
GAAGGTCAGA TCCTCGAAGA CTGTCCGAAG ATCGCCCATC TGCCCGCGAT CATCGTGCAC
GGCCGGCAGG ACCTGGTCTG TCCTCCCGAG GCGGCCTGGC TGCTGCATCG GGCATTGCCG
CGATCCGAGT TGACGATTTT GCCGAACGCC GGTCATCTTG CCCAAGGCGA GGAAATGACC
GATGCTCTGG TGAGAGCGCT GGACGGCATG GCGGAACGGC TGGGGAGCTG A
 
Protein sequence
MKPLYPPLEP YVFHRFGVGG GHEIYVEECG NPEGIPAVFL HGGPGSGCRH HHRSFFDPER 
YRAILVDQRG CGRSTPHGAL RNNTTRHLID DLESIRGRLN IPKWLLFGGS WGAALALLYA
QAFPERVSGL ILRGSFLARK RDVDWFVRDG ASRFHPEAWQ RFSDNFDARE RADPVRAIHR
RIKGADELEQ RRMAKEWWLW SSRVTLGSGF NPADDDPLPP GALAQCRIEL HYAAARYFIR
EGQILEDCPK IAHLPAIIVH GRQDLVCPPE AAWLLHRALP RSELTILPNA GHLAQGEEMT
DALVRALDGM AERLGS