Gene Mchl_5118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_5118 
Symbol 
ID7118993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp5484248 
End bp5485969 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content71% 
IMG OID643527811 
Productpeptidase S10 serine carboxypeptidase 
Protein accessionYP_002423810 
Protein GI218532994 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2939] Carboxypeptidase C (cathepsin A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.302178 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCGTC CGGTGGGAAG GCGCGACAAC ACGCACGACC AATCGCCCGA GGATCGCGAG 
AGACCGATGA CGCAAGCCTT CTCCCTCTCC CCCCGTGCCG GAACCGCTCG GCTCTGCTTG
GCCGGCCTCC TCTCCCTCAC CGTGGGATTG GGGCCGGTGC TCGCCCAGCA CGGCCCGGCG
CAGGGTGACG GTCCCGGCCA AGCGCAGGGG AGGGGAACGG CGCAGGGCCA GCCCCGCAAG
GCGCCGGAAG GCCGCCGTCT GCCGCCGGAC GCGACCACCG AGCACAGCAT CGACGGACCG
AGCGGTCGCG CGCTCGCCTT CACCGCCACC GCCGGAAGCC TCGCGCTGGT GGACGAGGAG
GGCAAGCTTC AGTCCGAGAT CGCCTTCATC GCCTATACCA AGGCGGGCAA GCCGGAGGAG
GCCGCCGCCC GGCCGATCAC CTTCGGCGTC AATGGCGGAC CGGGCGCAGC CTCAGCCTAT
CTCAATATCG GTGCGATCGG CCCCTGGCGC CTGCCGACCG ACGGCGCCTC GATCAGCCCG
TCGCAAACGA TCGCGCTTCA GCCGAACCCG GCGACCTGGC TCGATTTCAC CGATCTCGTC
TTCATCGATC CCGTCGGCAC CGGCTACAGC CGCGCGGCGG ATGGCGACGG CAAGAAGTAC
TGGAGCGTCG ATGCGGATGC CTCGGTGCTC GCCGCGGCCA TCGCCCGCTA TCTGCGCCAG
AACGACCGCC TCGCCTCGCC GAAATTCTTC GTCGGCGAGA GCTATGGCGG CTTCCGCGGG
CCGCTGATCG CGCAGAAGCT CCAGCAGGAT GTCGGCGTCG GCCTCTCGGG CCTCGTGCTG
CTCTCCCCCG TGCTCGACTT CGCGTGGCTA CAGCCGCCCC GCACCACGCC GTGGGGGTTT
GTCACCAAAC TCCCCTCGTT TGCCGCCGCG GCGCTGGAGC GCGCGGGCAC GACGCCGAGC
CGCGAACTCA TGAAGGAGGC CGAGACCTAC GCGTCCGGCG CCTATCTCAC CGATCTCCTG
AAAGGCCCGT CCGACCGGGC GGCGGTGGCG CGGCTCGCCG AGAGGGTCTC GGCGCTGACC
GGCCTCGATC TGGAGACCGT GCGGCGCCAG GCCGGGCGAC TCACCGCCCA CAGCTACCAG
CGCGAGATCG GGCGGGATGC CGGCCGCGTC GCCTCGGCCT ACGACACCGG CGTGACCGGC
TGGGACCCGG ACCCGACCGC GCCGCAATCG GGCTTCGAGG ATCCGGTGCT CGACGCACTG
CAGGCGCCGC TCACCACCGC CATGGTGCAG CTCTATCAGG GCCGCCTCAA CTGGCGCGTC
GAGAACATGC GCTACGAGTT GCTCAACGGC GCGGTCAACC GCGGCTGGAC CTGGGGCTCG
GGCCGCTCGG CGCCGGAAGC CATGGGGGCC CTGAAGGACG CGCTGGCGCT CGACGGGCGG
ATGCGGGTGC TCGTCGCCCA CGGCTTCACC GACCTCGTGA CGCCCTACTT CACCTCGAAG
ATGCTGCTGG ACCAGGTGCC GGTCTACGGC TCGCCCGATC GCCTCAAGCT CTCGGTTTAT
CCCGGCGGCC ACATGTTCTA CACGCGGCCG GATTCGCGCA ACGCCTTCCA CGACGACGCC
GCCGACCTGT TTGCCCGAGC GCTGGAGACC CGCTCCGACG GGAGCGCGAA GGGTGGAAGT
GCGTCGGGCG CGACCATGCC GGAGAAGAGA CCGACGCCTT GA
 
Protein sequence
MRRPVGRRDN THDQSPEDRE RPMTQAFSLS PRAGTARLCL AGLLSLTVGL GPVLAQHGPA 
QGDGPGQAQG RGTAQGQPRK APEGRRLPPD ATTEHSIDGP SGRALAFTAT AGSLALVDEE
GKLQSEIAFI AYTKAGKPEE AAARPITFGV NGGPGAASAY LNIGAIGPWR LPTDGASISP
SQTIALQPNP ATWLDFTDLV FIDPVGTGYS RAADGDGKKY WSVDADASVL AAAIARYLRQ
NDRLASPKFF VGESYGGFRG PLIAQKLQQD VGVGLSGLVL LSPVLDFAWL QPPRTTPWGF
VTKLPSFAAA ALERAGTTPS RELMKEAETY ASGAYLTDLL KGPSDRAAVA RLAERVSALT
GLDLETVRRQ AGRLTAHSYQ REIGRDAGRV ASAYDTGVTG WDPDPTAPQS GFEDPVLDAL
QAPLTTAMVQ LYQGRLNWRV ENMRYELLNG AVNRGWTWGS GRSAPEAMGA LKDALALDGR
MRVLVAHGFT DLVTPYFTSK MLLDQVPVYG SPDRLKLSVY PGGHMFYTRP DSRNAFHDDA
ADLFARALET RSDGSAKGGS ASGATMPEKR PTP