Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A3748 |
Symbol | |
ID | 4785977 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 3968634 |
End bp | 3970217 |
Gene Length | 1584 bp |
Protein Length | 527 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640092331 |
Product | L-proline dehydrogenase |
Protein accession | YP_001022936 |
Protein GI | 124268932 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0506] Proline dehydrogenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.654089 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.809311 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCTGC CCACCGCCAC GCCCGCCGCC CGCCTGCCGT CGCCGCTGCG CGAGGAGGCG TCCGTCCTCC AAGCCCGGCT GGCGGCGCTG GCCGGAGCGC TCGACTGGGC CGGCGTGCAG GCGCTGGCGC GTCCCTGGGT GCAGGCGGTG CGCGACGAAC CGGCGCCGTT CTGGGCCATG GAGTCGCTGC TGCGCGAGTA CCCGATCTCC AGCGCCGAAG GCCTGGCGCT GATGCGGCTG GCCGAGGCGC TGCTGCGCGT GCCCGATGCC GCCACCGCGA TCGCGCTGAC CGCCGACCAG CTGGGCCGCG CCGATTTCGA CACCGCCGCC ACCGGCGGCG ACGGACCGCA CAAGAGGCTG GCCAGCCTGT CGGCCAGCGC GATCGCGCTG TCCAAGAAGT TCCTGCCCGA CGGCGAGCAC CCGCCCGGCC TGCTGCAGCG CCTGGGCGCG CAGACCGTGG TGGCGGCGAC GGTGCGCGCC ATCCAGTTGC TGGGCCGGCA GTTCGTGCTC GGCCAGTCGA TCGCCGAGGC GCTGGGCGAG GCCCGCAGCC AGCGCCAGGC CCAGCCGCAA CTGCGCTTCA GCTTCGACAT GCTGGGCGAG GGCGCCCGCA CCGAGCACGA TGCGCAGCGC TACCTGCGCT CCTACCGCGA CGCCATCGCG GCCATTGCCG TCACCGCCGT GGCCGGTGCC GGCCCGGAAG CCAACGACGG CATCTCGATC AAGCTGTCGG CGCTGTTCCC GCGCTACGAG GATGCGCAGC GGGTGCGCGT CTTCGCCGAG CTGCTGCCGC GCGTGCTGGG CCTGATCGAC GACGCAGCGG CGGCCGACCT CAACCTCACC GTGGACGCCG AAGAGAGCGA CCGGCTCGAA CTCTCCCTCG AGCTGATCGA CGCTGCTGCC GCGCACATCG CCGCGCGCCA CCCGCGCTGG CGCGGCTTCG GCCTCGCGAT CCAGGCCTAC CAGACGCGCG CCGAGGAATG CGTGCACGAG GTCGCGCGCA TCGCCCGCCA CCACGGCCTG CGCTTCATGG TGCGGTTGGT CAAGGGCGCC TACTGGGACG GCGAGATCAA GCGCGCGCAG GAGCTCGGCT TGGCCGCCTA CCCGGTGTTC ACGCACAAGC ACCACACCGA CATCTCCTAC CTCGCGTGCG CGAAGGCGCT GCTCGACCAC GCCGACGTCA TCTACCCGCA GTTCGCCACC CACAACGCCG GCACCGTCGC GGCCATCGTG CAGATGGCGC GGGTGCGCGG CACGCCGTTC GAGTTGCAAC GGCTGCACGG CATGGGGGAG GGTGTGTCCG CGAGGTGTCG CGGTACACAC CTCCCGCTCC CCCCGCTCAA GCGGGGGGCT TGCCGGTGCG CATCTACGCG CCGGTCGGCG AGCACCGCGA CCTGCTGGCC TACCTGGTGC GCCGCCTGCT GGAGAACGGC GCCAACTCGT CGTTCGTGCA CCAGCTGGCC GACCCGCAGG TCGATGTCGA CGCGCTGCTC GCGTCGCCGC TGCAGGCCGC GCCCGAGCCG GGCCAGCCGT CGCCGCTGCA GCTCTACGGC AGCGCGCGGC GCAACGCGCT GGGCGTCGAC CTGA
|
Protein sequence | MPLPTATPAA RLPSPLREEA SVLQARLAAL AGALDWAGVQ ALARPWVQAV RDEPAPFWAM ESLLREYPIS SAEGLALMRL AEALLRVPDA ATAIALTADQ LGRADFDTAA TGGDGPHKRL ASLSASAIAL SKKFLPDGEH PPGLLQRLGA QTVVAATVRA IQLLGRQFVL GQSIAEALGE ARSQRQAQPQ LRFSFDMLGE GARTEHDAQR YLRSYRDAIA AIAVTAVAGA GPEANDGISI KLSALFPRYE DAQRVRVFAE LLPRVLGLID DAAAADLNLT VDAEESDRLE LSLELIDAAA AHIAARHPRW RGFGLAIQAY QTRAEECVHE VARIARHHGL RFMVRLVKGA YWDGEIKRAQ ELGLAAYPVF THKHHTDISY LACAKALLDH ADVIYPQFAT HNAGTVAAIV QMARVRGTPF ELQRLHGMGE GVSARCRGTH LPLPPLKRGA CRCASTRRSA STATCWPTWC AACWRTAPTR RSCTSWPTRR SMSTRCSRRR CRPRPSRASR RRCSSTAARG ATRWAST
|
| |