Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_4655 |
Symbol | |
ID | 5832385 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 5206216 |
End bp | 5207937 |
Gene Length | 1722 bp |
Protein Length | 573 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641370450 |
Product | peptidase S10 serine carboxypeptidase |
Protein accession | YP_001642094 |
Protein GI | 163854051 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2939] Carboxypeptidase C (cathepsin A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.419635 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTCGTC CGGTGGGAAG GCGCGACAAC ACGCACGACC AATCGCCCGA GGATCGCGAG AGACCGATGA CGCAAGCCTT CTCCCTCTCC CCCCGTGCCG GAACGGCTCG GCTCTGCTTG GCCGGCCTCC TCTCCCTCAC CGTGGGATTG GGGCCGGTGC TCGCCCAGCA CGGCCCGGCG CAGGGTGACG GTCCCGGCCA AGCGCAGGGG AGGGGAACGG CGCAGGGCCA GCCCCGCAAG GCGCCGGAAG GCCGCCGTCT GCCGCCGGAC GCGACCACCG AGCACAGCAT CGACGGACCG AACGGCCGCG CGCTCGCCTT CACCGCCACC GCCGGAAGCC TCGCGCTGGT GGACGAGGAG GGCAAGCTTC AGTCCGAGAT CGCCTTCATC GCCTACACCA AGGCGGGCAA GCCGGAGGAG ACCGCTGCCC GGCCGATCAC CTTCGGCGTC AATGGCGGAC CGGGCGCGGC CTCGGCCTAT CTCAATATCG GTGCGATCGG TCCCTGGCGC CTGCCGACTG ACGGCCCCTC GATCAGCCCG TCGCAGACGA TCGCGCTTCA GCCGAACCCG GCGACCTGGC TCGACTTCAC CGATCTCGTC TTCATCGATC CCGTCGGCAC CGGCTACAGC CGCGCGGCGG ACGGCGACGG CAAGAAGTAC TGGAGCGTCG ATGCGGATGC CTCGGTGCTC GCCGCGGCCA TCGCCCGCTA TCTCCGCCAG AACGACCGTC TCGCCTCGCC GAAATTCTTC GTCGGCGAGA GCTATGGCGG CTTCCGCGGG CCGCTGATCG CGCAGAAGCT GCAGCAGGAT GTCGGCGTCG GCCTGTCGGG CCTCGTGCTG CTCTCACCCG TGCTCGACTT CGCGTGGCTA CAGCCGCCCC GCACCACGCC GTGGGGTTTC GTGACCAAAC TCCCCTCGTT TGCCGCCGCG GCGCTGGAGC GCGCGGGCAC GACGCCGAGC CGCGAACTCA TGAAGGAGGC CGAGACCTAC GCGTCCGGCG CCTATCTCAC CGATCTCCTG AAAGGCCCGT CCGACCGGGA GGCGGTGGCG CGGCTCGCCG AGAAGGTCTC GGCGCTGACG GGTCTCGATC CGGAGACCGT GCGGCGCCAG GCCGGGCGAC TCACCGCCCA CAGCTACCAG CGCGAGATCG GGCGCGATGC CGGCCGCGTC GCCTCGGCCT ACGACACCGG CGTGACCGGC TGGGACCCGG ACCCGACCGC TCCGCAATCG GGCTTCGAGG ATCCGGTGCT CGACGCGCTG CAGGCGCCGC TCACCACCGC CATGGTGCAG CTCTATCAGG GCCGCCTGAA CTGGCGTGTC GAGAACATGC GCTACGAGTT GCTCAACGGC GCGGTCAACC GCGGCTGGAC CTGGGGCTCA GGCCGCTCGG CGCCGGAAGC GATGGGGGCC CTGAAGGACG CGCTGGCGCT CGACGGGCGG ATGCGGGTGC TCGTCGCCCA CGGCTTCACC GATCTCGTGA CGCCCTACTT CACCTCGAAG ATGCTGCTGG ACCAGATGCC GGTCTACGGC TCGCCCGACC GCCTTAAGCT CTCGGTTTAT CCCGGCGGCC ACATGTTCTA CACGCGGCCG GATTCGCGCA ACGCCTTCCA CGACGACGCC GCCGACCTGT TCGCCCGAGC GCTGGAGACC CGCTCCAACG GGAGCGCGAA GGGCGGTGGC GCGTCGGGCG CGACCATGCC GGAGAAGAGA CCGACGCCTT GA
|
Protein sequence | MRRPVGRRDN THDQSPEDRE RPMTQAFSLS PRAGTARLCL AGLLSLTVGL GPVLAQHGPA QGDGPGQAQG RGTAQGQPRK APEGRRLPPD ATTEHSIDGP NGRALAFTAT AGSLALVDEE GKLQSEIAFI AYTKAGKPEE TAARPITFGV NGGPGAASAY LNIGAIGPWR LPTDGPSISP SQTIALQPNP ATWLDFTDLV FIDPVGTGYS RAADGDGKKY WSVDADASVL AAAIARYLRQ NDRLASPKFF VGESYGGFRG PLIAQKLQQD VGVGLSGLVL LSPVLDFAWL QPPRTTPWGF VTKLPSFAAA ALERAGTTPS RELMKEAETY ASGAYLTDLL KGPSDREAVA RLAEKVSALT GLDPETVRRQ AGRLTAHSYQ REIGRDAGRV ASAYDTGVTG WDPDPTAPQS GFEDPVLDAL QAPLTTAMVQ LYQGRLNWRV ENMRYELLNG AVNRGWTWGS GRSAPEAMGA LKDALALDGR MRVLVAHGFT DLVTPYFTSK MLLDQMPVYG SPDRLKLSVY PGGHMFYTRP DSRNAFHDDA ADLFARALET RSNGSAKGGG ASGATMPEKR PTP
|
| |