Gene Mpe_B0545 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_B0545 
Symbol 
ID4787388 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008826 
Strand
Start bp495214 
End bp496236 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content65% 
IMG OID640092972 
Productcarbonic anhydrase 
Protein accessionYP_001023550 
Protein GI124263080 
COG category[R] General function prediction only 
COG ID[COG0663] Carbonic anhydrases/acetyltransferases, isoleucine patch superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.421322 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.00175013 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACTCTTG GGCTCAACCT GATTGCGTAC CAGGGTACGC TCCCCGCGTT CGCCTCTCAG 
CCCGCGGCGG CGCTCCCTGG TCACGCCGTC GTAGGCCGCG TCTCGCTGGG TCGCAACGCA
TGGCTGGGCG CTGGCGCGGT CATCCGTGCC GACGGACACT TCGTGCGGAT CGGTGACGAC
TTGCACATGG GCCGAGGCGC GACAATCCAT ATCGCCCACG AGGTCTATCC AACCTTGGTT
GGCGCTAGGG TGTCCATCGG CGCTGATGCG GTCGTGCATG CCTGCACCGT GGGTAACGAT
GTCGTCGTCG AGCGGGGATC CGTGATCCTT GACGGTGCGA AGGTCGGTGA CGGTGCCGTC
GTCGAGGCTG GGAGCATCGT CTATCCGCGA AGCACGCTGG AGCCTGGCAT GTTGTACGCG
GGTCGGCCCG CCAAGCTGTT ACGGGCGCTT GGCCTGCATG AAGTGCAGAG CCGAGCCGAA
CTGCAGCGAG CGCGCAATGA GGCATGCGAC GTCCGCTGGA CGTGCCAACC GATTCCGACC
GGAGCTGCGC CCGATTCATT CGTCGCCGGC ACCTGCGATC TCTCGGGCAG CGTTCACTTG
GCCGAGGGCG CCAGCGTGTG GTTTGGCTGC CGTCTGGATG GTCGCGAGGG GCCGATTTCG
ATCGGCAGAC TCTGCAACGT GCAGGACAAC TCGGTGCTTC GGGCCGGATC CCTGGGGATG
TCGTTGGGCG ATCAAACCAC GGTCGGACAC AACGTCCAGC TGGTGGACTG CAGCGTCGGT
TCCCGCTGCC TCGTCGGCAT CGGCAGTAGC ATCGCACCCG GCACGCGCAT CGATGACGAC
ACCTTCGTCG CTGGCGGCAG CGTCACCGAA CCCGGCCAGC ACCTGACGGG CGGACGGGTT
TGGGGCGGCG ATCCTGCACG GCCTATCGGC GAGATGAACG AGGCCAAGCG GACGGCGATC
TCGAACATCG CGATCGTTTA CGAAAGTTAT GCGCGAGCGT TGCATGCCAG CGTACTAGGC
TGA
 
Protein sequence
MTLGLNLIAY QGTLPAFASQ PAAALPGHAV VGRVSLGRNA WLGAGAVIRA DGHFVRIGDD 
LHMGRGATIH IAHEVYPTLV GARVSIGADA VVHACTVGND VVVERGSVIL DGAKVGDGAV
VEAGSIVYPR STLEPGMLYA GRPAKLLRAL GLHEVQSRAE LQRARNEACD VRWTCQPIPT
GAAPDSFVAG TCDLSGSVHL AEGASVWFGC RLDGREGPIS IGRLCNVQDN SVLRAGSLGM
SLGDQTTVGH NVQLVDCSVG SRCLVGIGSS IAPGTRIDDD TFVAGGSVTE PGQHLTGGRV
WGGDPARPIG EMNEAKRTAI SNIAIVYESY ARALHASVLG