Gene Mext_4655 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4655 
Symbol 
ID5832385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp5206216 
End bp5207937 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content70% 
IMG OID641370450 
Productpeptidase S10 serine carboxypeptidase 
Protein accessionYP_001642094 
Protein GI163854051 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2939] Carboxypeptidase C (cathepsin A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.419635 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCGTC CGGTGGGAAG GCGCGACAAC ACGCACGACC AATCGCCCGA GGATCGCGAG 
AGACCGATGA CGCAAGCCTT CTCCCTCTCC CCCCGTGCCG GAACGGCTCG GCTCTGCTTG
GCCGGCCTCC TCTCCCTCAC CGTGGGATTG GGGCCGGTGC TCGCCCAGCA CGGCCCGGCG
CAGGGTGACG GTCCCGGCCA AGCGCAGGGG AGGGGAACGG CGCAGGGCCA GCCCCGCAAG
GCGCCGGAAG GCCGCCGTCT GCCGCCGGAC GCGACCACCG AGCACAGCAT CGACGGACCG
AACGGCCGCG CGCTCGCCTT CACCGCCACC GCCGGAAGCC TCGCGCTGGT GGACGAGGAG
GGCAAGCTTC AGTCCGAGAT CGCCTTCATC GCCTACACCA AGGCGGGCAA GCCGGAGGAG
ACCGCTGCCC GGCCGATCAC CTTCGGCGTC AATGGCGGAC CGGGCGCGGC CTCGGCCTAT
CTCAATATCG GTGCGATCGG TCCCTGGCGC CTGCCGACTG ACGGCCCCTC GATCAGCCCG
TCGCAGACGA TCGCGCTTCA GCCGAACCCG GCGACCTGGC TCGACTTCAC CGATCTCGTC
TTCATCGATC CCGTCGGCAC CGGCTACAGC CGCGCGGCGG ACGGCGACGG CAAGAAGTAC
TGGAGCGTCG ATGCGGATGC CTCGGTGCTC GCCGCGGCCA TCGCCCGCTA TCTCCGCCAG
AACGACCGTC TCGCCTCGCC GAAATTCTTC GTCGGCGAGA GCTATGGCGG CTTCCGCGGG
CCGCTGATCG CGCAGAAGCT GCAGCAGGAT GTCGGCGTCG GCCTGTCGGG CCTCGTGCTG
CTCTCACCCG TGCTCGACTT CGCGTGGCTA CAGCCGCCCC GCACCACGCC GTGGGGTTTC
GTGACCAAAC TCCCCTCGTT TGCCGCCGCG GCGCTGGAGC GCGCGGGCAC GACGCCGAGC
CGCGAACTCA TGAAGGAGGC CGAGACCTAC GCGTCCGGCG CCTATCTCAC CGATCTCCTG
AAAGGCCCGT CCGACCGGGA GGCGGTGGCG CGGCTCGCCG AGAAGGTCTC GGCGCTGACG
GGTCTCGATC CGGAGACCGT GCGGCGCCAG GCCGGGCGAC TCACCGCCCA CAGCTACCAG
CGCGAGATCG GGCGCGATGC CGGCCGCGTC GCCTCGGCCT ACGACACCGG CGTGACCGGC
TGGGACCCGG ACCCGACCGC TCCGCAATCG GGCTTCGAGG ATCCGGTGCT CGACGCGCTG
CAGGCGCCGC TCACCACCGC CATGGTGCAG CTCTATCAGG GCCGCCTGAA CTGGCGTGTC
GAGAACATGC GCTACGAGTT GCTCAACGGC GCGGTCAACC GCGGCTGGAC CTGGGGCTCA
GGCCGCTCGG CGCCGGAAGC GATGGGGGCC CTGAAGGACG CGCTGGCGCT CGACGGGCGG
ATGCGGGTGC TCGTCGCCCA CGGCTTCACC GATCTCGTGA CGCCCTACTT CACCTCGAAG
ATGCTGCTGG ACCAGATGCC GGTCTACGGC TCGCCCGACC GCCTTAAGCT CTCGGTTTAT
CCCGGCGGCC ACATGTTCTA CACGCGGCCG GATTCGCGCA ACGCCTTCCA CGACGACGCC
GCCGACCTGT TCGCCCGAGC GCTGGAGACC CGCTCCAACG GGAGCGCGAA GGGCGGTGGC
GCGTCGGGCG CGACCATGCC GGAGAAGAGA CCGACGCCTT GA
 
Protein sequence
MRRPVGRRDN THDQSPEDRE RPMTQAFSLS PRAGTARLCL AGLLSLTVGL GPVLAQHGPA 
QGDGPGQAQG RGTAQGQPRK APEGRRLPPD ATTEHSIDGP NGRALAFTAT AGSLALVDEE
GKLQSEIAFI AYTKAGKPEE TAARPITFGV NGGPGAASAY LNIGAIGPWR LPTDGPSISP
SQTIALQPNP ATWLDFTDLV FIDPVGTGYS RAADGDGKKY WSVDADASVL AAAIARYLRQ
NDRLASPKFF VGESYGGFRG PLIAQKLQQD VGVGLSGLVL LSPVLDFAWL QPPRTTPWGF
VTKLPSFAAA ALERAGTTPS RELMKEAETY ASGAYLTDLL KGPSDREAVA RLAEKVSALT
GLDPETVRRQ AGRLTAHSYQ REIGRDAGRV ASAYDTGVTG WDPDPTAPQS GFEDPVLDAL
QAPLTTAMVQ LYQGRLNWRV ENMRYELLNG AVNRGWTWGS GRSAPEAMGA LKDALALDGR
MRVLVAHGFT DLVTPYFTSK MLLDQMPVYG SPDRLKLSVY PGGHMFYTRP DSRNAFHDDA
ADLFARALET RSNGSAKGGG ASGATMPEKR PTP