Gene Mpe_A3207 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3207 
Symbol 
ID4786546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3409191 
End bp3410540 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content67% 
IMG OID640091780 
Productputative biotin carboxylase protein 
Protein accessionYP_001022395 
Protein GI124268391 
COG category[I] Lipid transport and metabolism 
COG ID[COG0439] Biotin carboxylase 
TIGRFAM ID[TIGR00514] acetyl-CoA carboxylase, biotin carboxylase subunit 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.301064 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00428909 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTTCAAGA AGATCCTGAT CGCCAACCGA GGCGAGATCG CCCTCCGCAT CCAGAGAGCC 
TGCCGCGAGC TCGGCGTGCG CGCCGTCATC GTCTACTCGG AGGCCGACCG CGACGCCAAG
TACGTGAAGC TCGCCGACGA GGCGGTCTGC ATCGGCCCGC CGGCCTCGGC GCAGAGCTAT
CTCAACATGC CGGCCATCAT CGCCGCCGCC GAGGTGACGG ATGCCGAGGC CATCCACCCC
GGCTACGGCT TCCTCAGTGA GAACGCCGAC TTCGCCGAGC GCGTGCAACA GAGCGGCTTC
ACCTTCATCG GCCCGACGCC GGAGTCGATC CGCGTGATGG GCGACAAGGT GGCTGCCAAG
CAGGCCATGA TCAAGTCGGG CGTGCCCACG GTGCCGGGCG CCGAGGGCGC GTTGCCGGAC
GACCCGAAGG AGATCATCCG CCAGGCGCGC GCGATCGGCT ACCCGGTCAT CATCAAGGCC
GCCGGTGGTG GCGGCGGACG CGGCATGCGG GTGGTGCACA CCGAGGCGGC GCTGATCCAC
GCGGTGCAGA CGACGCGCGC CGAGGCCGGC GCGGCCTTCG GCAACCCGAC CGTCTACATG
GAGAAGTTCC TCGAGAATCC GCGCCACATC GAGATCCAGG TGCTGGCCGA CACCCACCGC
AACGCGGTGT GGCTGGGCGA GCGCGACTGC TCGATGCAGC GTCGCCACCA GAAGATCATC
GAGGAAGCTC CGGCGCCCGG CATCCCGCGG CGCGTGATCG AGCGCATCGG CGAACGCTGC
GTCGCCGCCT GCAAGAAGAT CGGCTATCGG GGCGCCGGTA CCTTCGAGTT CCTGTACGAA
AACGGCGAGT TCTACTTCAT CGAGATGAAC ACCCGCGTGC AGGTCGAGCA CCCGGTGACC
GAGCTCGTGA CCGGCGTCGA CATCGTGCAG ATGCAGATCA AGATCGCCGC CGGCGAGAAG
CTTCCGTTCA CGCAACGCCA GATCGAGATG CGGGGCCACG CGATCGAGTG CCGCATCAAC
GCCGAGGACC CTTACAAGTT CACGCCGTCA CCGGGCCGCA TCACGATGTG GCATCCGCCG
GGCGGCCCCG GCGTGCGGGT CGATTCGCAC GCATACACCA ACTACTTCGT GCCGCCGAAC
TACGACTCGA TGATCGGCAA GATCATCACT CACGGCGACA CCCGCGAGCA GGCCTTGGCC
CGCATGCGCA CGGCGCTGCT GGAGACGGTG ATCGAAGGGA TCCAGACCAA CACGCCGCTG
CACCGCGAGT TGGTGGTCGA CGCGAAATTC GTCGAGGGCG GCACGAGCAT CCATTACCTC
GAAGGCTGGA TGGCCCAGCG CAAGCGCTGA
 
Protein sequence
MFKKILIANR GEIALRIQRA CRELGVRAVI VYSEADRDAK YVKLADEAVC IGPPASAQSY 
LNMPAIIAAA EVTDAEAIHP GYGFLSENAD FAERVQQSGF TFIGPTPESI RVMGDKVAAK
QAMIKSGVPT VPGAEGALPD DPKEIIRQAR AIGYPVIIKA AGGGGGRGMR VVHTEAALIH
AVQTTRAEAG AAFGNPTVYM EKFLENPRHI EIQVLADTHR NAVWLGERDC SMQRRHQKII
EEAPAPGIPR RVIERIGERC VAACKKIGYR GAGTFEFLYE NGEFYFIEMN TRVQVEHPVT
ELVTGVDIVQ MQIKIAAGEK LPFTQRQIEM RGHAIECRIN AEDPYKFTPS PGRITMWHPP
GGPGVRVDSH AYTNYFVPPN YDSMIGKIIT HGDTREQALA RMRTALLETV IEGIQTNTPL
HRELVVDAKF VEGGTSIHYL EGWMAQRKR