Gene Mpe_A0814 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0814 
SymboltbuA1 
ID4786949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp856061 
End bp857560 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content61% 
IMG OID640089375 
Producttoluene monooxygenase alpha subunit 
Protein accessionYP_001020011 
Protein GI124266007 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCTAC TTGAGCGGAT GGATTGGTAC GACCTCGCGC GGACCACCAA TTGGACGCCG 
ACCTACGTCA GCGAGGCAGA GCTGTTTCCG ACCGAGATGT CGGGGGACAT GGGAATCCCG
ATGTCCGAAT GGGAGAAGTA CGACGAGCCG TACAAGCAGA CGTATTCGGA GTACGTGAAG
ATCCAGCGTG AGAAGGATTC AGGCGCCTAT TCGGTGAAGG GCGCGCTGGA GCGGAGCAAG
ATGCTGGAAA ACGCTGACCC AGGCTGGATC AGCGTGATCA AGGCGCACTA CGGTGCGATC
GCGCGTGCGG AGTATGCGGC AGCGTCCGCG GAGTCGCGCA TGGCGCGCTT CGCCAAGGCG
CCGGGTCAGC GCAACATGGC GACGATGGGC ATGCTCGACG AGATCCGCCA TGGCCAGATC
CAGCTGTTCT TCCCGCACGA GCACGTGTCG AAAGATCGTC AATTCGACTG GGCATTCAAG
GCCTACGACA CGAATGAATG GGGCGCCATT GCCGCGCGCC ACATGTTCGA CGACATGATG
AACACGCGCA GTGCGGTGGC CATCGGGTTG ATGTTGACCT TCGCCTTCGA GACCGGTTTC
ACCAACATGC AGTTCCTCGG TCTGGCAGCG GACGCCGCTG AGGCGGGCGA CTGGACCTTC
GCCAGCATGA TCTCCAGCGT CCAGACGGAT GAGTCTCGTC ATGCTCAGAT CGGCGGTCCG
CTGGTGCCGA TCCTGATCGC GAACGGCAAG AAGGCCGAGG CTCAACGCAT GATCGACGTT
GCGTTCTGGC GCTCTTGGAA GCTGTTCACT GTGCTGACCG GACCGATGAT GGACTACTAC
ACGCCGCTCG CGCATCGCAA GCAGTCGTTC AAGGAGTTCA TGCAGGAGTT CATCGTCACG
CAGTTCGAGC GCTCCATCCT GGATCTGGGA CTCGAACGAC CCTGGTACTG GGACCAGTTC
CTGGCGGAAC TGGATTACCA GCACCATGGC ATGCACCTGG GGGTGTGGTT CTGGCGTCCA
ACCGTCTGGT GGAACCCGGC TGCAGGCGTC ACGCCCGAGG AACGTGCGTG GCTGGAGGAA
AAGTATCCGG GCTGGAACGA CACGTGGGGC AAGAGCTGGG ACGTCATCGT CGACAACCTG
TTGAAAGACA AGCGCGAGCT CACGTATCCG GAAACGCTTC CGGTGGTGTG CAACATGTGC
AATCTGCCGA TCAACGCAAC ACCTGGCGAT CCCTGGAAGG TCCGAGACCA TTCGTTGGAG
CGCAAGTCGC GCTGGTATCA CTTCTGTTCA GAAGGCTGCA AATGGTGCTT CGAGCAGGAG
CCTGAGCGCT ACGAGGGCCA CCTGTCGCTG ATCGATCGCT TCCTCGCAGG GTTGATCCAG
CCGATGGACC TGGGCGGAGG TCTCAAGTAC ATGGGCCTGG CACCCGGCGA GATCGGTGAT
GACGCCCATG GCTATGCCTG GCTCGATGCC TACCGCCAAG TGCCCAAGGC AGCCGCCTGA
 
Protein sequence
MALLERMDWY DLARTTNWTP TYVSEAELFP TEMSGDMGIP MSEWEKYDEP YKQTYSEYVK 
IQREKDSGAY SVKGALERSK MLENADPGWI SVIKAHYGAI ARAEYAAASA ESRMARFAKA
PGQRNMATMG MLDEIRHGQI QLFFPHEHVS KDRQFDWAFK AYDTNEWGAI AARHMFDDMM
NTRSAVAIGL MLTFAFETGF TNMQFLGLAA DAAEAGDWTF ASMISSVQTD ESRHAQIGGP
LVPILIANGK KAEAQRMIDV AFWRSWKLFT VLTGPMMDYY TPLAHRKQSF KEFMQEFIVT
QFERSILDLG LERPWYWDQF LAELDYQHHG MHLGVWFWRP TVWWNPAAGV TPEERAWLEE
KYPGWNDTWG KSWDVIVDNL LKDKRELTYP ETLPVVCNMC NLPINATPGD PWKVRDHSLE
RKSRWYHFCS EGCKWCFEQE PERYEGHLSL IDRFLAGLIQ PMDLGGGLKY MGLAPGEIGD
DAHGYAWLDA YRQVPKAAA