Gene Mext_0301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_0301 
Symbol 
ID5832613 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp336562 
End bp338442 
Gene Length1881 bp 
Protein Length626 aa 
Translation table11 
GC content70% 
IMG OID641366086 
Productpeptidase S9 prolyl oligopeptidase 
Protein accessionYP_001637796 
Protein GI163849753 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.111113 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGATC TGATCCCGCG CGCGCATCTC TTCGGCAACC CGACGCGCTA CGGTCATCAG 
ATCAGCCCCG ACGGGCGCCG CCTCGGTTGG GTGGCGCCCC ATGAGGGTGT GCTCAACATC
TGGTCGGCGC CGATCGACGA CCTCGACGCC GCCGTGCCCG TCACCACCGA CCGGCGCCGC
GGCATCGACG CCTACGCCTT CGCCTATGAC GGGCGCCACC TGCTCTACGT GCAGGACGCG
GACGGCGACG AGAACCACCA CCTCTACGCC GTCGATCTCA CCACGGGCGA GCGACGCGAC
CTGACGCCGA TCCCCGGCAT CGCTGCGGCG ATCGTGGGCC TCAGCCGCAT CGTGCGCGAC
CGCGTGCTCG TCGCGATCAA CGACCGCGAC CCGCGCTTCC ACGACCTGCA CAGCATCGAT
CTCGCCACCG GCGAGCGCAG CCTCGTGATC GAGAATCCGG GCTTTGCCGG TTTCCTGATC
GATGAGCGCT ACGCGGTTCG CTTCGCCTTC CGCAATCTTC CGGACGGTTC GAGCCAGTTG
ATCGCCCCGG ACGGCGCGAA CTGGAAGCCG TGGCTCACCT TCCCGCCCGA GGATGCCCGC
GTCTCCGGCG CGGAGAATCT CGACGCCGCC GGCACCGCCC TGTTCTGCCG CGACAGCCGC
GGGCGCAACA CCGCCGCGCT GACCCGCATC GATCTCGCCA CCGGCGAGAC CCGCGTGCTC
GCCGCGCACG AGGAGGCGGA TATCGGCGCG GTGCTGCAGG ATGCCGAGAC GCACGAGCCG
GTGGCCTACT CGGTCACCCA TGCCCGCAAA TCCTGGCACG TGCTCGACCC GCGTTTGACC
GACGACTTCG CCTTCCTCGA AACGCAGGGG CTCGGCGACT GGTATCCGGC GAGCCGCACC
GAGGACGATG CGCTCTGGAT CGTGGTGGCC CGCGCCGACA CCCGCGTCGG CGAGGCCGCG
ATCTACGACC GGCGGGCAAA GACGCTGCGC TCGCTCGGCA GCGCCCGGCC GGAACTGGAG
GGTGCGCCGC TCGCCCCGAT GAGCCCGGCG ATCATCCGCT CCCGCGATGG GCTCGATCTC
GTCTCGTATC TCAGCCGCCC GCTCGATGCG CAGGCCCCCG GCCCGCTGGT GCTGCTCGTC
CATGGCGGCC CGTGGGCGCG AGACAGCTTC GGCTTCGACG GCCTCCATCA ATGGCTGGCC
AATCGCGGCT ATGCCGCGCT CAGCGTCAAC TTCCGTTCCT CGACCGGCTT CGGGAAAGCC
TTCCTCAATG CGGGCGACCG CGAATGGGGT CGGCGGATGG ACGACGACCT CAGCGACGCC
GTCGCCTGGG CGGTGGCGCA GGGTGTGGCC GATCCGGCTC GCGTCGCGAT CATGGGCGGC
AGCTACGGCG GCTACGCCAC GCTGATGGCG CTGACCCGCA ACCCCGGATC GTACGCCTGC
GGCATCGACC TCGTCGGCCC GGCCAACCTC GAAACCCTGG TGCGGACGAT CCCGCCCTAT
TGGGAGGCGA TGCGGGCGCA GCTCCACCGC GCCATCGGCG ATCCCGACAC CGAGGAGGGC
ATGGCGCTGA TCCGCGAGCG CTCCCCGGTC TACTTCGCCG ACCGAATCAA GGCGCCGCTG
CTGATCGTGC AGGGGGCCAA CGATCCGCGG GTGAAACAGG CCGAGTCGGA CCAGATGGTC
GCGGCCATGG AGCGCGGCGG CATTCCCGTG ACCTACCTGC TGTTTCCGGA CGAGGGCCAC
GGCCTCGTGC GCCCGGCCAA CCGGCTGGCC TTCTTCGCGC GGGCGGAAGA GTTCCTGGCG
CGCCATCTCG GCGGGCGCTG CGAGCCGATC CGCGAGGATG AATCCGCCGG GACGTCGATG
CAGGTGGTGC GGGAGGGATA G
 
Protein sequence
MVDLIPRAHL FGNPTRYGHQ ISPDGRRLGW VAPHEGVLNI WSAPIDDLDA AVPVTTDRRR 
GIDAYAFAYD GRHLLYVQDA DGDENHHLYA VDLTTGERRD LTPIPGIAAA IVGLSRIVRD
RVLVAINDRD PRFHDLHSID LATGERSLVI ENPGFAGFLI DERYAVRFAF RNLPDGSSQL
IAPDGANWKP WLTFPPEDAR VSGAENLDAA GTALFCRDSR GRNTAALTRI DLATGETRVL
AAHEEADIGA VLQDAETHEP VAYSVTHARK SWHVLDPRLT DDFAFLETQG LGDWYPASRT
EDDALWIVVA RADTRVGEAA IYDRRAKTLR SLGSARPELE GAPLAPMSPA IIRSRDGLDL
VSYLSRPLDA QAPGPLVLLV HGGPWARDSF GFDGLHQWLA NRGYAALSVN FRSSTGFGKA
FLNAGDREWG RRMDDDLSDA VAWAVAQGVA DPARVAIMGG SYGGYATLMA LTRNPGSYAC
GIDLVGPANL ETLVRTIPPY WEAMRAQLHR AIGDPDTEEG MALIRERSPV YFADRIKAPL
LIVQGANDPR VKQAESDQMV AAMERGGIPV TYLLFPDEGH GLVRPANRLA FFARAEEFLA
RHLGGRCEPI REDESAGTSM QVVREG