Gene Mpe_A3209 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3209 
Symbol 
ID4786548 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3411491 
End bp3412618 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content71% 
IMG OID640091782 
Producthypothetical protein 
Protein accessionYP_001022397 
Protein GI124268393 
COG category 
COG ID 
TIGRFAM ID[TIGR02098] MJ0042 family finger-like domain 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.737407 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0103379 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTGG CCACCCGCTG CACCGCCTGC GGCACGATCT TCCGTGTCGT GCAGGACCAG 
CTCAAGGTCT CGGAAGGCTG GGTGCGCTGC GGCCGCTGCC AGGACACCTT CAACGCGCTC
GAAGGCCTGT TCGATCTGGA GCGCGAGGCA CCGCCGCAGC GTATCCCGAA GGCCGGCGCT
ACGCAGTCGG TGGTCGAGGG CATGGCCGAG TTCGTTGCGA GCCATCACCC CGGCAACAGC
GAACACGGCG CACTGCCGGC CATGCCCGCG ACCCAGGAGC ATGACGCGAT CGAGTCGCGC
TTCTTCGCGC CGCAGTCCGA CGACAGCCGC TCGGACGAGC ACCCCGATTT CGCCGATGCC
CGCTTTCCGA GCGAGTTCCC GCCCGACGCC GCGGCACTGG AACCGGACGC GACCGAAGAT
CCCGTCGATG CGCTGCCGCC AGCCAGCCCC AAGAGCGCGC CGCCGTCCAC GCCATTGCTG
CAGCGCTGGC GCGACAGCCG TGCCGCGCGA CAGGCGGCCG CGATGAGTTC GCTGCTGGAG
GCGCCGATCG GGGACGAGGC GGCGATGCCG CCGCCCGCTG CGCCGGCCGT TGCCGGCACC
CCAGGCTTCC TGCGCCAGGC CGAGGATGCG GCGCGCTGGC GCCGCCCGCG GGTGCGCGCC
TCGCTCGTCG TGGCTGCGGC ACTGCTGATC GGCACGCTGC TCACCCAGAT CGCTGTGCAG
TACCGCGACG CCTTCGCGGC ACAATGGCCG CAGGCGCGGC CGACACTGGA AACGCTGTGC
GAGGTGCTGG ACTGCCGCAT CGAGCCGCTG CGGCGCCTTG CGGCCATCAC CGTCGAATCG
AGTGGGCTGA CGCAGGTGGA AGGCAGTGAT GCCTACCGGC TGAGCCTGAC GCTGCACAAC
CGGGGCCAGG TGGATATCGC CCTGCCGTCG GTCGATCTCA GTGTGACCGA CAACAGCGGC
ACCCTGGTCT CGCGGCGCGC GCTGGCGCCG GCCGATTTCC GCACCGCCAC GGGCGGTTCA
GTGCCCGGCG TGGCGCTCGC CCCCGGATCG GAAAGTCAGT TGCAAGCACT GCTGACGGCG
CGCGGCGCAC GCATCAGTGG CTACACGGTC GAGCTGTTCT ATCCCTGA
 
Protein sequence
MSLATRCTAC GTIFRVVQDQ LKVSEGWVRC GRCQDTFNAL EGLFDLEREA PPQRIPKAGA 
TQSVVEGMAE FVASHHPGNS EHGALPAMPA TQEHDAIESR FFAPQSDDSR SDEHPDFADA
RFPSEFPPDA AALEPDATED PVDALPPASP KSAPPSTPLL QRWRDSRAAR QAAAMSSLLE
APIGDEAAMP PPAAPAVAGT PGFLRQAEDA ARWRRPRVRA SLVVAAALLI GTLLTQIAVQ
YRDAFAAQWP QARPTLETLC EVLDCRIEPL RRLAAITVES SGLTQVEGSD AYRLSLTLHN
RGQVDIALPS VDLSVTDNSG TLVSRRALAP ADFRTATGGS VPGVALAPGS ESQLQALLTA
RGARISGYTV ELFYP