Gene Mchl_0345 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_0345 
Symbol 
ID7118645 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp325399 
End bp327279 
Gene Length1881 bp 
Protein Length626 aa 
Translation table11 
GC content69% 
IMG OID643523145 
Productpeptidase S9 prolyl oligopeptidase active site domain protein 
Protein accessionYP_002419210 
Protein GI218528394 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGATC TGATCCCGCG CGCGCATCTC TTCGGCAACC CGACGCGCTA CGGTCATCAG 
ATCAGCCCCG ACGGGCGCCG CCTCGGTTGG GTGGCGCCCC ATGAGGGCGT GCTCAACATC
TGGTCGGCGC CGATCGACGA CCTTAATGCG GCCGTGCCCG TCACCACCGA TCGGCGCCGC
GGCATCGACG CCTACGCCTT CGCCTATGAC GGGCGCCACC TGCTCTACGT GCAGGACGCG
GACGGCGACG AGAACCATCA CCTCTACGCC GTCGATCTCA CCACGGGCGA GCGGCGCGAC
CTGACGCCGA TCCCAGGCAT CGCCGCGGCG ATCGTGGGCC TCAGCCGCAT TGTGCGCGAC
CGCGTGCTCG TCGCGATCAA CGACCGGGAC CCGCGCTTCC ACGACCTGCA CAGCATCGAT
CTCGCCACCG GCGAGCGCAG CCTCGTGATC GAGAATCCGG GATTTGCCGG CTTTCTGATC
GACGAGCGCT ACGCCGTGCG CTTCGCCTTC CGCAACCTTC CGGACGGTTC GAGCCAGTTG
ATCGCCCCCG ACGGCGCGAA CTGGAAGCCG TGGCTCACCT TCCCGCCCGA GGATGCCCGC
GTCTCCGGCG CGGAGAATCT CGACGCCGCC GGCACCGCCC TGTTCTGCCG CGACAGCCGT
GGGCGCAACA CCGCGGCGCT CACCCGCATC GATCTCGCCA CCGGCGAGAC ACGCGTGCTC
GCCGCGCACG AGGAAGCGGA TATCGGCGCG GTGCTGCAGG ACGCTGTGAC GCACGAACCG
GTGGCCTACT CGGTCACTCA TGCCCGCAAA TCCTGGCACG TGCTCGACCC GCGCTTGACC
GACGACTTCG CCTTCCTCGA AACGCAGGGG CTCGGCGATT GGTATCCGGC GAGCCGCACC
GAGGACGATG CGCTCTGGAT CGTGGTGGCC CGCGCCGACA CCCGCGTCGG CGAGGCCGCG
ATCTACGACC GGCAGGCAAA GACGCTGCGT TCGCTCGGCA GCGCCCGGCC GGAACTGGAG
GGTGCGCCGC TCGCCCCGAT GAGCCCGGCG ATCATCCGCT CCCGCGATGG GCTCGATCTC
GTCTCGTATC TCAGCCGCCC GCTCGATGCG CAGGCCCCCG GCCCGCTGGT GCTGCTCGTC
CATGGCGGCC CGTGGGCGCG AGACAGCTTC GGCTTCGACG GCCTCCATCA ATGGCTGGCC
AACCGCGGCT ATGCCGCGCT CAGCGTCAAC TTCCGATCCT CGACCGGCTT CGGGAAAGCC
TTCCTCAATG CGGGCGACCG CGAATGGGGT CGGCGGATGG ACGACGACCT CAGCGACGCC
GTCGCCTGGG CGGTGGCGCA AGGTGTGGCC GATCCGGCTC GCGTCGCGAT CATGGGCGGC
AGCTACGGCG GCTATGCCAC GCTGATGGCG CTGACCCGCA ACCCCGGATC TTACGCCTGC
GGCATCGACC TCGTCGGCCC GGCCAACCTC GAAACCCTGG TGCGGACGAT CCCGCCCTAT
TGGGAGGCGA TGCGGGCGCA ACTCCACCGC GCCATCGGCG ATCCCGACAC CGAGGAAGGC
ATGGCGCTGA TCCGCGAGCG CTCCCCGGTC TACTTCGCCG ACCGGATCAA AGCGCCGCTG
CTGATTGTGC AGGGCGCCAA CGATCCGCGG GTGAAACAGG CGGAATCCGA TCAGATGGTC
GCGGCCATGG AGCGCGGCGG CATTCCCGTG ACCTACCTGC TGTTTCCGGA CGAGGGCCAC
GGCCTCGTGC GCCCGGCCAA CCGGCTGGCC TTCTTCGCGC GGGCGGAAGA GTTCCTGGCG
CGCCATCTCG GCGGGCGCTG CGAGCCGATC CGCGAGGATG AATCGGCCGG GACGTCGATG
CAGGTGGTGC GGGAGGGATA G
 
Protein sequence
MVDLIPRAHL FGNPTRYGHQ ISPDGRRLGW VAPHEGVLNI WSAPIDDLNA AVPVTTDRRR 
GIDAYAFAYD GRHLLYVQDA DGDENHHLYA VDLTTGERRD LTPIPGIAAA IVGLSRIVRD
RVLVAINDRD PRFHDLHSID LATGERSLVI ENPGFAGFLI DERYAVRFAF RNLPDGSSQL
IAPDGANWKP WLTFPPEDAR VSGAENLDAA GTALFCRDSR GRNTAALTRI DLATGETRVL
AAHEEADIGA VLQDAVTHEP VAYSVTHARK SWHVLDPRLT DDFAFLETQG LGDWYPASRT
EDDALWIVVA RADTRVGEAA IYDRQAKTLR SLGSARPELE GAPLAPMSPA IIRSRDGLDL
VSYLSRPLDA QAPGPLVLLV HGGPWARDSF GFDGLHQWLA NRGYAALSVN FRSSTGFGKA
FLNAGDREWG RRMDDDLSDA VAWAVAQGVA DPARVAIMGG SYGGYATLMA LTRNPGSYAC
GIDLVGPANL ETLVRTIPPY WEAMRAQLHR AIGDPDTEEG MALIRERSPV YFADRIKAPL
LIVQGANDPR VKQAESDQMV AAMERGGIPV TYLLFPDEGH GLVRPANRLA FFARAEEFLA
RHLGGRCEPI REDESAGTSM QVVREG