Gene Mpe_A1779 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1779 
Symbol 
ID4784446 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1914199 
End bp1916232 
Gene Length2034 bp 
Protein Length677 aa 
Translation table11 
GC content71% 
IMG OID640090350 
Productthimet oligopeptidase 
Protein accessionYP_001020973 
Protein GI124266969 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.117155 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGCCG ACACCTATCC CTTCCCGATC CGCATGAACG CTTCCGTCCT CGCTTTCCTG 
TCCCGGCTGC TTCGGCTGTC GAGTTGCGTC GCCCTGTTCG GCGTCAGCGG CTGGAGCGCC
GCGGCCGACA CCGCGCCCTA CGTCTTTCCG AGCTACCGCG ACGGCGCCGC CGTGAAGGCG
CGTTGCGACC GCACGCTGCG CGAGATCGAG GCGCAGGCGC GGCGCATCGC GGCCGGTCGG
GGCCCGGACG GCGTGCTGGT CGAGATCGAC CGGCTCGGAC AGACAGTCGA CGACAGCATG
TCGCCGGTGT TCTTCCTGGC GAACGTGCAC CCCGACAAGC CGGTGCGCGA CGCGGCCGAG
GCCTGCGAGC TGCGCTACCA GGCCTTCACC AGCCGGCTGT ACCAGAACCC CAGGATCTAC
CGGCGTCTGC AGGCGCTGCA GCCGGTCGAT GCGATCGACC GGCAGATGCG GGCCGATCTG
CTCGCGAGTT TCGAGGACGC CGGGGTCGGC TTGCCGGCCG CGAAGCGCGA CCGCGCACGC
GCGCTCAACG ACGAACTGGG CCGCCTGTCG CAGGACTTCG AGCGTCGCCT GCGCGAGGAC
AAGACCCGCG TCGCCTTCAC CGACAGCGAG CTCGACGGGG TGCCGGCCAG TGTGTGGAGC
ACCGCCCCCC GCGACGCGCA GGGCCGGGTG CTGCTGGGGC TCGACTACCC GACCTACTCG
CCGGTGGTGG AGAACGCGCG CAGCCCGGGG GCACGCGAAC GCATGTGGCG GGCCTTCCAG
GCGCGCGGCG GCCAGGCGAA CCTGAAGACG CTGGCGCGGC TGGGCGAGAA GCGCCGCAGC
TACGCGCGCC TGTTCGGTGT CGAGAGCTAT GCCGATTTCA CGCTGCGCCG TCGCATGGCG
CTGAACGTGG GCAACGTGCA GGCCTTCCTC GGCGAGGTGA AGGGCGAGCT GGGCGAGCGC
GAGGAACGCG ATCTGTCCGA ACTGCGTGCC GCCAAGGCCG CCGAGTTGAA GACGGCGCCC
GACACCACGC CGCTGAAACG CTGGGACGTG GGCTACTACC TCGAGCGCGT CAAGCGCGAG
CGCCTGGCGC TCGACCAGGA GAGCTTCCGC CGCTATTTCC CGCCGCAGGC CAGCGTCGAC
TTCGTGTTCG CGCTCGCCGG GCGGCTGTTC GGCGTGGGCT TCGAGCCGGT GCCTCAGTCG
CTGTGGCATC CCGACGCCAA GGCCTACGTC GTGGTCGACG CCGCCAGCCG CACGCCGCTG
GCCACGCTCT ACCTCGATCT CTACCCCCGT GCCGACAAGT ACGGCCACGC GGCGGTGTGG
CCGCTGCGCG GCTCGTCGAC CTGGAGCGGG CAGTTGCCGA CGGCCGCGCT GGTGACCAAC
TTCGACCGCC AGGGCCTGAC GATCGACGAG CTGGAAACGC TGCTGCACGA GTTCGGCCAT
GCGCTGCACG TCACGCTGTC GCACACGCGC TATGCCGCAC AGGCCGGGAC CGCGGTCAAG
CTCGACTTCG TCGAGGCGCC ATCGCAGATG CTGGAGGAAT GGGTCTACGA CGCCCAGGTG
CTGGCGCTGT TCCAGCAGGT CTGCGCGAGC TGCGAGCCGG TGCCAGCCGA CCTGCTGGCG
CGCGCCGTGC AATCGCGCAG CTTCGCCAAG GGGCTGCAGT TCGCGCGCCA GCATCTGTAC
GCCAACTACG ACCTCGCGCT GCACGACAAG GACGCGCCGG ACCCGATGGC GCTGTGGGCC
CGCATGGAGA GCGCCACGCC GCTCGGCTAC GAGCCCGGCT CGCTGTTCCC GGCCGGCTTC
TCGCACGTCG CCGGCGGCTA CGGCGCCGGC TACTACGCCT ACCTCTGGAG CCTGGCGATC
GCGCAGGATC TGCGCACCGC CTTCGCGGCC GACCCGCTGG ACCCGGCCGT CGGTCGTCGC
TACCGTGAGA CGGTGCTGGC CAACGGCGGC CAGGCTCCGC CCGCCGAGCT GGTGGCGCGC
TTCCTGGGGC GTGCGCCGAG CAACGCCGCG TTCTTTGAGT GGCTCGAGCG CTGA
 
Protein sequence
MIADTYPFPI RMNASVLAFL SRLLRLSSCV ALFGVSGWSA AADTAPYVFP SYRDGAAVKA 
RCDRTLREIE AQARRIAAGR GPDGVLVEID RLGQTVDDSM SPVFFLANVH PDKPVRDAAE
ACELRYQAFT SRLYQNPRIY RRLQALQPVD AIDRQMRADL LASFEDAGVG LPAAKRDRAR
ALNDELGRLS QDFERRLRED KTRVAFTDSE LDGVPASVWS TAPRDAQGRV LLGLDYPTYS
PVVENARSPG ARERMWRAFQ ARGGQANLKT LARLGEKRRS YARLFGVESY ADFTLRRRMA
LNVGNVQAFL GEVKGELGER EERDLSELRA AKAAELKTAP DTTPLKRWDV GYYLERVKRE
RLALDQESFR RYFPPQASVD FVFALAGRLF GVGFEPVPQS LWHPDAKAYV VVDAASRTPL
ATLYLDLYPR ADKYGHAAVW PLRGSSTWSG QLPTAALVTN FDRQGLTIDE LETLLHEFGH
ALHVTLSHTR YAAQAGTAVK LDFVEAPSQM LEEWVYDAQV LALFQQVCAS CEPVPADLLA
RAVQSRSFAK GLQFARQHLY ANYDLALHDK DAPDPMALWA RMESATPLGY EPGSLFPAGF
SHVAGGYGAG YYAYLWSLAI AQDLRTAFAA DPLDPAVGRR YRETVLANGG QAPPAELVAR
FLGRAPSNAA FFEWLER