Gene Mpe_A3748 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3748 
Symbol 
ID4785977 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3968634 
End bp3970217 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content73% 
IMG OID640092331 
ProductL-proline dehydrogenase 
Protein accessionYP_001022936 
Protein GI124268932 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0506] Proline dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.654089 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.809311 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCTGC CCACCGCCAC GCCCGCCGCC CGCCTGCCGT CGCCGCTGCG CGAGGAGGCG 
TCCGTCCTCC AAGCCCGGCT GGCGGCGCTG GCCGGAGCGC TCGACTGGGC CGGCGTGCAG
GCGCTGGCGC GTCCCTGGGT GCAGGCGGTG CGCGACGAAC CGGCGCCGTT CTGGGCCATG
GAGTCGCTGC TGCGCGAGTA CCCGATCTCC AGCGCCGAAG GCCTGGCGCT GATGCGGCTG
GCCGAGGCGC TGCTGCGCGT GCCCGATGCC GCCACCGCGA TCGCGCTGAC CGCCGACCAG
CTGGGCCGCG CCGATTTCGA CACCGCCGCC ACCGGCGGCG ACGGACCGCA CAAGAGGCTG
GCCAGCCTGT CGGCCAGCGC GATCGCGCTG TCCAAGAAGT TCCTGCCCGA CGGCGAGCAC
CCGCCCGGCC TGCTGCAGCG CCTGGGCGCG CAGACCGTGG TGGCGGCGAC GGTGCGCGCC
ATCCAGTTGC TGGGCCGGCA GTTCGTGCTC GGCCAGTCGA TCGCCGAGGC GCTGGGCGAG
GCCCGCAGCC AGCGCCAGGC CCAGCCGCAA CTGCGCTTCA GCTTCGACAT GCTGGGCGAG
GGCGCCCGCA CCGAGCACGA TGCGCAGCGC TACCTGCGCT CCTACCGCGA CGCCATCGCG
GCCATTGCCG TCACCGCCGT GGCCGGTGCC GGCCCGGAAG CCAACGACGG CATCTCGATC
AAGCTGTCGG CGCTGTTCCC GCGCTACGAG GATGCGCAGC GGGTGCGCGT CTTCGCCGAG
CTGCTGCCGC GCGTGCTGGG CCTGATCGAC GACGCAGCGG CGGCCGACCT CAACCTCACC
GTGGACGCCG AAGAGAGCGA CCGGCTCGAA CTCTCCCTCG AGCTGATCGA CGCTGCTGCC
GCGCACATCG CCGCGCGCCA CCCGCGCTGG CGCGGCTTCG GCCTCGCGAT CCAGGCCTAC
CAGACGCGCG CCGAGGAATG CGTGCACGAG GTCGCGCGCA TCGCCCGCCA CCACGGCCTG
CGCTTCATGG TGCGGTTGGT CAAGGGCGCC TACTGGGACG GCGAGATCAA GCGCGCGCAG
GAGCTCGGCT TGGCCGCCTA CCCGGTGTTC ACGCACAAGC ACCACACCGA CATCTCCTAC
CTCGCGTGCG CGAAGGCGCT GCTCGACCAC GCCGACGTCA TCTACCCGCA GTTCGCCACC
CACAACGCCG GCACCGTCGC GGCCATCGTG CAGATGGCGC GGGTGCGCGG CACGCCGTTC
GAGTTGCAAC GGCTGCACGG CATGGGGGAG GGTGTGTCCG CGAGGTGTCG CGGTACACAC
CTCCCGCTCC CCCCGCTCAA GCGGGGGGCT TGCCGGTGCG CATCTACGCG CCGGTCGGCG
AGCACCGCGA CCTGCTGGCC TACCTGGTGC GCCGCCTGCT GGAGAACGGC GCCAACTCGT
CGTTCGTGCA CCAGCTGGCC GACCCGCAGG TCGATGTCGA CGCGCTGCTC GCGTCGCCGC
TGCAGGCCGC GCCCGAGCCG GGCCAGCCGT CGCCGCTGCA GCTCTACGGC AGCGCGCGGC
GCAACGCGCT GGGCGTCGAC CTGA
 
Protein sequence
MPLPTATPAA RLPSPLREEA SVLQARLAAL AGALDWAGVQ ALARPWVQAV RDEPAPFWAM 
ESLLREYPIS SAEGLALMRL AEALLRVPDA ATAIALTADQ LGRADFDTAA TGGDGPHKRL
ASLSASAIAL SKKFLPDGEH PPGLLQRLGA QTVVAATVRA IQLLGRQFVL GQSIAEALGE
ARSQRQAQPQ LRFSFDMLGE GARTEHDAQR YLRSYRDAIA AIAVTAVAGA GPEANDGISI
KLSALFPRYE DAQRVRVFAE LLPRVLGLID DAAAADLNLT VDAEESDRLE LSLELIDAAA
AHIAARHPRW RGFGLAIQAY QTRAEECVHE VARIARHHGL RFMVRLVKGA YWDGEIKRAQ
ELGLAAYPVF THKHHTDISY LACAKALLDH ADVIYPQFAT HNAGTVAAIV QMARVRGTPF
ELQRLHGMGE GVSARCRGTH LPLPPLKRGA CRCASTRRSA STATCWPTWC AACWRTAPTR
RSCTSWPTRR SMSTRCSRRR CRPRPSRASR RRCSSTAARG ATRWAST