Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_1220 |
Symbol | |
ID | 5831870 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 1350595 |
End bp | 1352100 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641367013 |
Product | Serine-type D-Ala-D-Ala carboxypeptidase |
Protein accession | YP_001638693 |
Protein GI | 163850650 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1686] D-alanyl-D-alanine carboxypeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.518526 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.391466 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGAACC GTGTTTGGGG ACGGCTGATC GGCTCGCTCA CCGCCCTCGG CCTGATGGCG GGGGCGGCCT CGGCGGTGAC GGCACCGATC CTCGTCGTCG ATGCCGAGTC CGGCAAGGTG CTCTACAGCC AGGGCGCGAC CGACCCCTGG TATCCGGCCT CGATCACCAA GCTGATGACC ACCTATGTCG CCCTCGACAT GGTGCGCCAG GGCAAGGTCT CCCTCGACAC GCTGCTGACG GTGTCGCCGG CCGCTGCCGC CGAGCCGCCC TCGAAGATGG GCTTCAAGCC GGGCACGCAG ATCACCCTCG ACAACGCCCT CAAGATCATC ATGGTCAAGT CGGCCAACGA CGTGTCCTGG GCGATCGGCG AAGGGCTCGG CGGATCCGTC GAGGGCTTCG CCGACATGAT GAACGAGACC GCGCACCGGA TCGGCATGCG CGAGAGCCGC TGGTACAACC CCAACGGCCT GCCCGAGCCG CGGCAATGGA CGAGCGCCCG CGATATGGCG GTCCTGGCCC GCGCGCTGAT GCGCGACTTT CCCAACCAGC AGAACCTGTT CTCGATCTCG GCGATCCAGT TCGGCAAGTC GGTGATGGCC AACCATAACG GCCTGCTCGG GCGCTATCCC GGCGCCGACG GCATGAAGAC CGGCTTCATC TGCTCGGGCG GCTTCAACGT GGTGGCGACC GCGACCCGCG GCGGCCGACG CATCATCACC GTGGTGATGG GCCAGCCGAG CGCACGCGAG CGCGACATCA AGGCCGCCGA TCTGTTCGAT TACGGCTTCG GCCAGTCGGC GGGCTGGACC GCGCCGACGC TCGAATCGCT GCCGCCCTCC AACGTCGCCG CCCCGCCGGA CATGCGGCCC TACATCTGCG ACAAGCGCAA GCCGATGCCG GTGGACGAAG GCCCCGGCGC GCTGACCGCG AGCGGCCCCG GCGCCAGCAG CGCGACGACG CAGCTCCTCG GCTCGGCGAC CCCGGACGCC ACCAACCTCG CCTTCGCCGC GCTCAGCAGC GCCAACGTCC GCGGACGCAC CCTGCCGCCG CGGGCGCCGC TTCAGCCGAT CCCGGTCTGG ATCGGCAGCA GCCCCACGGA AGGCGCGATG GCGCTGGCTC GCGAGGAACA GGAGGCGCAG GCGTCCAAGC AGGCGGCCCG TCAGGCCGCG CTGGAGGCTG CCCAGCGCAA GCGCGCGGAA GCCAAGGCCA CCAAGGACGC CGCCCGAGAG GCCGCCAGGC AGGCCGCCTC GGAGAAAGCA GCCAAGGCCC GCGCCGCCGC CGTGACCGCG CCCAAGCCCG CCACGAAGAC CGCCGCCGGC CGGGAGAAGA GCGTGCCGGC CACGACCAGC GCCTACACGG CGGTGGAGCC CGCTCCGAAG GCGGCCGCGA AGCCGGCGCA CAAGCCGGCC GCCAAGAAGC CCGAAGTGAC AAAGCCCGAG GCGGCCAAGC CCGCCGCCAA GAAGGCGGAC AAGAAGCCCG CCAAGGCCAA GAACGACGAG TCGTAA
|
Protein sequence | MVNRVWGRLI GSLTALGLMA GAASAVTAPI LVVDAESGKV LYSQGATDPW YPASITKLMT TYVALDMVRQ GKVSLDTLLT VSPAAAAEPP SKMGFKPGTQ ITLDNALKII MVKSANDVSW AIGEGLGGSV EGFADMMNET AHRIGMRESR WYNPNGLPEP RQWTSARDMA VLARALMRDF PNQQNLFSIS AIQFGKSVMA NHNGLLGRYP GADGMKTGFI CSGGFNVVAT ATRGGRRIIT VVMGQPSARE RDIKAADLFD YGFGQSAGWT APTLESLPPS NVAAPPDMRP YICDKRKPMP VDEGPGALTA SGPGASSATT QLLGSATPDA TNLAFAALSS ANVRGRTLPP RAPLQPIPVW IGSSPTEGAM ALAREEQEAQ ASKQAARQAA LEAAQRKRAE AKATKDAARE AARQAASEKA AKARAAAVTA PKPATKTAAG REKSVPATTS AYTAVEPAPK AAAKPAHKPA AKKPEVTKPE AAKPAAKKAD KKPAKAKNDE S
|
| |