Gene Mext_0413 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_0413 
Symbol 
ID5834825 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp456119 
End bp458218 
Gene Length2100 bp 
Protein Length699 aa 
Translation table11 
GC content70% 
IMG OID641366197 
Productpeptidyl-dipeptidase Dcp 
Protein accessionYP_001637906 
Protein GI163849863 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.689859 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCCG ACACCGCACG CGACCTCCCC GAGGGTACGG AGCCCGGCCG TGGCGGCGTG 
AAAGCCAACC CGTTCGACGT GTCGGACTGG GCGACGCCGT TCGGCCTGCC GGATTTCGAG
GCGATCCGGC CCGAGCATTA CGTGCCGGCC TTCGCGCACG CCCTGAAGGC GCATGAGGCG
GAGATCGCGG CGATTGTCGG GAACGAGGCG GCGCCGACCT TTGCCAACAC GGTCGCGGCG
ATGGAGCGCG CGGGCGCCGC GCTCGACCGC GTGGCCAACG TGTTCTTCAA CCTGACCGGC
AGCAACACCA GCCCCGAGCT TCAGGCGATC GAGCGGGCGG TCGCGCCGCA GCTCGCGCGG
CATTCGAGCG CCATCACCCT GAATCCGGAG CTCTGGGCCC GGCTCTCGGC CATTGATGCC
GAGGCGGAAG GTTTGGGCGC AGAGGAGCGC CGCGTGCTCG ACCGCTACCG CAGCCGGTTC
CGCCGGGCCG GCGCCGGCCT CGCTCCGGAG GCCAAGACCC GTATCGCCGA GATCGCGGTC
CGGCTGGCCG AACTCGGGAC GCAATTCTCT CAAAACGTGC TCGCCGACGA GCGCGACTTC
GTCCTGCCGC TGAACGGCGA GGCGGATCTC GCCGGCCTGC CGCCCTTCCT GCGCGATGCC
GCCGCTGAGG CGGCCAAGGA GCGCGGCGGC GGGCAGAGCC ACGTCATTAC CCTCTCGCGC
TCGCTGATCG AGCCCTTCCT CGTCTTCTCC ACCCGCCGCG ACCTGCGGGC GCTGGCCTAC
GCGGCCTGGA CCCGCCGCGG CGAGAATGGC GGCGAGACCG ACAACCGCGC CATCATCGCC
GAGATCGTGC GCCTGCGCGC CGAGCGGGCC CGGCTGCTCG GCTTCGACAG CTTCGCCCAT
CTCAAGCTCG ACGACACCAT GGCGGGCTCC CCCGACGCGG CGATGGAGCT GCTGCGCAAC
GTCTGGAAGC CGGCGCTCCA GCGCGCGGCA ACGGAACGCG AAGGGCTCCA GGCCCTGGTT
CGGGCGGAGG GGCACGACTT CGCTCTCGAA GCGCATGACT GGCGGCACTA TTCCGAAAAG
CTGCGGCGGG CGGAGCACGA CCTCGACGAG TCCGAGATCA AGCCGTACCT GCCCCTGGAA
GGCATGATCC GGGCGGCCTT CGACACCGCG TCGAGGCTGT TCGGGCTCGC CTTCGAGGAG
CTGCGCGACG TGCCGCGCTA TCACCCGGAC GTGCGCACTT GGCTCGTGCG CGACGCCGAC
GGCTCCCAGG TCGGCCTGTT CCTTGGCGAC TATTTCGCCC GCCCCTCGAA GCGCTCGGGC
GCCTGGATGA GCGCCTTCCG TTCGCAGGAG CGGCTGAACG GCGACATCCG GCCGATCATC
GTCAACGTGA TGAACTTCGC CCGCGCCCCG CGGGGCGAGC CGACGCTGCT CTCTTTCGAC
GACGCCCGCA CGCTGTTCCA CGAATTCGGC CACGCCCTGC ACGGGCTTCT CTCCGACGTC
ACCTACCCGC TGTTGTCCGG CACCGCGGTC TCGCGCGACT TCGTGGAACT GCCCTCGCAG
CTCTACGAGC ACTGGCTGCA GCAGCCGGAG GTGCTGCGCG CCCATGCCCG CCATGTGACC
ACCGGCGAGC CGATGCCCGA CGCGCTGCTC GAACGCCTGC TCGCGGCGGC CAACTTCAAC
CAGGGCTTCG CGACGATCGA GTATGCGGCC TCGGCCATCG TCGACATGAC GCTGCATCTC
TCCGCGGCCG GAGAGGACGG GCTCGACGTC GTCGCGTTCG AGGCGGACGC CCTGCGCCGC
ATCGCCATGC CGGCCGAGAT CGCGGCGCGG CACCGGGCGC CGCACTTCGC GCATATCTTC
TCCGGCGACG GCTATGCGGC CGGCTATTAC AGCTATCTCT GGTCGGAGGT GCTCGATGCC
GACGCCTTCG ATGCCTTCCG CGAAGCGGGC GACATCTTCC ATCCGGAGAC GGCGCAGCGG
CTGCGCCGCA TGATCTACGG GGCGGGCAAC CTGCGCGATG CGCGCGAGGC CTATACGGCG
TTCCGGGGCC GGCTGCCGAG CATCGAGCCG CTGCTGAAGA AGCGCGGGCT GGCGGCGTAG
 
Protein sequence
MTADTARDLP EGTEPGRGGV KANPFDVSDW ATPFGLPDFE AIRPEHYVPA FAHALKAHEA 
EIAAIVGNEA APTFANTVAA MERAGAALDR VANVFFNLTG SNTSPELQAI ERAVAPQLAR
HSSAITLNPE LWARLSAIDA EAEGLGAEER RVLDRYRSRF RRAGAGLAPE AKTRIAEIAV
RLAELGTQFS QNVLADERDF VLPLNGEADL AGLPPFLRDA AAEAAKERGG GQSHVITLSR
SLIEPFLVFS TRRDLRALAY AAWTRRGENG GETDNRAIIA EIVRLRAERA RLLGFDSFAH
LKLDDTMAGS PDAAMELLRN VWKPALQRAA TEREGLQALV RAEGHDFALE AHDWRHYSEK
LRRAEHDLDE SEIKPYLPLE GMIRAAFDTA SRLFGLAFEE LRDVPRYHPD VRTWLVRDAD
GSQVGLFLGD YFARPSKRSG AWMSAFRSQE RLNGDIRPII VNVMNFARAP RGEPTLLSFD
DARTLFHEFG HALHGLLSDV TYPLLSGTAV SRDFVELPSQ LYEHWLQQPE VLRAHARHVT
TGEPMPDALL ERLLAAANFN QGFATIEYAA SAIVDMTLHL SAAGEDGLDV VAFEADALRR
IAMPAEIAAR HRAPHFAHIF SGDGYAAGYY SYLWSEVLDA DAFDAFREAG DIFHPETAQR
LRRMIYGAGN LRDAREAYTA FRGRLPSIEP LLKKRGLAA