Gene Mpe_A0897 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0897 
Symbol 
ID4787220 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp945870 
End bp948671 
Gene Length2802 bp 
Protein Length933 aa 
Translation table11 
GC content72% 
IMG OID640089458 
ProductATP-dependent transcriptional regulator-like protein protein 
Protein accessionYP_001020094 
Protein GI124266090 
COG category[K] Transcription 
COG ID[COG2909] ATP-dependent transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.571021 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCCGT CGTCGCCCCA CCGTCAGCTG CGATCGCACC CGCTCGAGGC CACGACGGAC 
TGGCCCACGC CGCTGGCCGA CGAGCTGGCC TCGACCGAAC GTGGCTTCCC GTCGGACGTG
CGGATGCACA AGCTGTTCAC GCCGCCGGTG TACCCGGGCG CCGTGCCGCG ACAGGCCATC
CTCGACCGTG TGCTGCAGGA CGACAGCCTG CGCGTCACGG TGCTGCAGGG GCCCGCCGGC
CACGGCAAGT CGACCACCCT GCAGCAGATC AAGACCGCCC ACGAGGCCCG CGGCTGGCGC
ACCGCCTGGC TCACCCTCGA CGACGCCGAC AACGACCCGC GACGCTTCGA GTCGCACCTG
GTCGCGGTGA TGAGCCTGCT GCACGGCCGT GCCGCGTCGC CCGGCGCCTC GCGCAGCGGC
ACCGGCGATG CGCCGCGCGA TCTGGCCGAC TGGATGCTCG ACACCCTGTC GGGCCGCATC
ACGCCGGCCT CGATCTTCAT CGACGAGTTC CAGGCGCTGC GCAACGAGGC CCTGCTGCGC
TTCTTCCGCT CGGTGCTCGC GCGCCTGCCG GCCAACGTGC ATGTGTTCAT CGGCTCCCGC
ACGCTGCCGG AGATCGGCCT GGCCACGCTG ATGGTCAACC GTGTGGCGTC GGTGGTGCGG
GCCGACGACC TGCGCTTCAC GCCGGGCGAG GTGACGCAGT TCTTCGCCGA TTCGGCCAGC
CTGCAGGTCA GCGCCGGCGA GGTGGACGCG ATCTACCGCC GCACCGAGGG CTGGCCGGCC
GGCGTGCAGC TGTTCCGGCT GGCGCTGGTG AGTCCCGAGG TGCGCATGGC GCTGGACGGC
GCCGACGACC ACGGGCCGCG CGAGCTGGCC GAATACCTGG CCGACAACGT GATGTCGCTG
CAGTCGCCGC GCATGCAGGA GTTCCTGCTG AAGACCTCGC TGCTGCAGCG CCTGTCGGCA
CCGCTGTGCA CCGCCGTGAC CGGCTTCGAG GACGCGCAGG AGCTGCTGGT GCGGCTGGAG
CGCTCCGGCC TGTTCCTGCG CGCGCTCGAC TCCGACAACC GCTGGTTCCG GTACCACGGC
CTGTTCTCGA CCTACCTGGC CGAGACCCTG CAGCGCAACG GCCCCGAAGC GCTGCGGCAG
GTGCACAAGA AGGCGGCGCA GTGGTGCCTG GCGCACGAGC TGCCCGAGGA AGCGATCCAC
CACGCGTTGT GCTGCAGGAA CTTTCCGCTC GCGGCGTCCA CGCTGACCGA CTGGTCGTCG
CAGCTGGTGG CCGGGGCCGA GCTGATCACG CTGGAGCGCT GGCACGACCG CCTGCCCTTC
CACGAGGTGG CGCAGCGGCC GGCGCTGGTG ATCCGCGCCG CCTATGCGCT GATGTTCCTG
CGCCGCCGCC CCAAGCTGCG GCCGCTGCTG GAGCTGATGG CACCGCAGGC CGGCGGCGGC
GACATCGTGC CGACCACCAA CCCCGACCTG TGCCGCGCGA TGTCCTTCCT GCTGGTGGAC
GACGACATGG CGGCGGCGGC CGACACCGTC GAGCAGGCCG GCGTGGTGCA GCGCGAGCTG
GAGGGCTTCC CGGCCTTCGA GCTGGGAGCC GCCGCGAACG TGCTCGCGCT GGGCAAGGTG
GCCAGCGGCG ATTTCGAGGG CGCGCGGCAG GCCCTGGCGC TGGCGCGTGC ACACCTCGGT
CGCGGCGGCG GCTCGTTCGT CGGCGGCTAC ACCGCCGCCA TCACCGGCAG CAACCTGCTC
GTGCAGGGCC GGCTGCAGGA GGCGCTGGCG CACCTGCGCG ACGAGAACGC GCAGGAGGCG
CCGCTCGACA CCTCGGTGGC CGGTGCGGCG CTCGCGGCCT GCCACATGTT CGCGCTGTAC
GAGGCCAACG ACCTGGCGAC GCTGGAGTCG CTGGCGCACC GCTTCCAGCG CGAGATCTCC
GAGTCGGTGA CGCTCGACTT CATCGCCGCG GCCCACATCG CCATCTCGCG CATGCACGAG
GCGCGCGGGC GCTCCGACGA GGCCGTCGCG GTGCTCGACG AACTGGAGCG CATCGGCCAC
ACCAGCCCCT GGCAACGCCT GGTGGCGGTG AGCGAGTGGG AGCGCGTGCG GCGTGCGCTG
GCGGGCGGCG AGATCGAGCG CGCGGTGGCG CTGGCGACGC GCATCGCCCC GGACTCGCGC
GACGACGCGC CGCACTGGAT CCACCTGGCC GAGGACGTGG AAGGCTCGGG CTACGGGTGG
ATCCGCCTCG CCATCGCGCG GCACGACCAC GCCGACGCCG CGCAGCGCAT CGCCCGGGAG
CGGGCGCGCC AGACCGGTCG GGTCTACCGC GACATCAAGC TGAGCGTGCT GGAGACCCTG
CTCCAGCAAC GCATGGGCGC CCGTAACGCC GCCCATCGCT GCCTGCGCAA GGCGCTGCAG
CTGGGCCGAC GGGGCCGCTA CGTGCGCTGC CTGCTCGACG AGGGGGACGG CGTCATCGAG
CTGCTGCGCG AGGCCTACCA GAACCTGCTG CGAGGTCACG AACCGGGCGG CGGCACCGGT
ACCGACCCGG ACCGCGACTA CATCGAGCTG CTGCTCGAGG CCTCGGGGAC CGACCTGGGC
CGCCAGGCCG CCGGCAACGC CCTGACCGAG GCACTGTCGG AGCGCGAAAA GGAGATGCTG
CGTTTCCTGC TCGACGGCAC CACCAACCGC GAGATCGCCG GACGGCTGTT CGTATCGGAG
AACACCGTCA AGTTCCACCT GAAGAACATC TACTCCAAGC TCGGCGTCGG CAACCGATTG
CAGGCCATCA ACACGGCGCG GGCGCTGCGG TTGATCGACT GA
 
Protein sequence
MPPSSPHRQL RSHPLEATTD WPTPLADELA STERGFPSDV RMHKLFTPPV YPGAVPRQAI 
LDRVLQDDSL RVTVLQGPAG HGKSTTLQQI KTAHEARGWR TAWLTLDDAD NDPRRFESHL
VAVMSLLHGR AASPGASRSG TGDAPRDLAD WMLDTLSGRI TPASIFIDEF QALRNEALLR
FFRSVLARLP ANVHVFIGSR TLPEIGLATL MVNRVASVVR ADDLRFTPGE VTQFFADSAS
LQVSAGEVDA IYRRTEGWPA GVQLFRLALV SPEVRMALDG ADDHGPRELA EYLADNVMSL
QSPRMQEFLL KTSLLQRLSA PLCTAVTGFE DAQELLVRLE RSGLFLRALD SDNRWFRYHG
LFSTYLAETL QRNGPEALRQ VHKKAAQWCL AHELPEEAIH HALCCRNFPL AASTLTDWSS
QLVAGAELIT LERWHDRLPF HEVAQRPALV IRAAYALMFL RRRPKLRPLL ELMAPQAGGG
DIVPTTNPDL CRAMSFLLVD DDMAAAADTV EQAGVVQREL EGFPAFELGA AANVLALGKV
ASGDFEGARQ ALALARAHLG RGGGSFVGGY TAAITGSNLL VQGRLQEALA HLRDENAQEA
PLDTSVAGAA LAACHMFALY EANDLATLES LAHRFQREIS ESVTLDFIAA AHIAISRMHE
ARGRSDEAVA VLDELERIGH TSPWQRLVAV SEWERVRRAL AGGEIERAVA LATRIAPDSR
DDAPHWIHLA EDVEGSGYGW IRLAIARHDH ADAAQRIARE RARQTGRVYR DIKLSVLETL
LQQRMGARNA AHRCLRKALQ LGRRGRYVRC LLDEGDGVIE LLREAYQNLL RGHEPGGGTG
TDPDRDYIEL LLEASGTDLG RQAAGNALTE ALSEREKEML RFLLDGTTNR EIAGRLFVSE
NTVKFHLKNI YSKLGVGNRL QAINTARALR LID