Gene Mpe_A0176 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0176 
Symbol 
ID4784139 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp190794 
End bp191924 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content69% 
IMG OID640088724 
Producthypothetical protein 
Protein accessionYP_001019373 
Protein GI124265369 
COG category[S] Function unknown 
COG ID[COG3268] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.668245 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAACA AGAAGCCCGT CGTCGTCTAC GGCGCCTCCG GCTACACCGG CCGCCTGATC 
TGCGAATACC TTCGCGAGTA CGGCATCCCG TTCATCGCCG CGGGTCGCAG CGCGGACAAG
CTGCAGACGG CGATGCAGTC CAACGTGCCG GGCATCGAGA CCGCGAGCTA CGAGATCGCC
GAGGTGCCGC ACAGCGTCGC GGCGCTGACC GAGCTGTTCC GCGGCGCGTC GGTGGTGCTG
AACACCGTCG GACCGTTCGC GAAGTTCGGC CCGGAGGTCG TCGAGGCCTG CCTGGCCGCC
CGCTGTCACT ACACCGACAC CACCGGCGAG CAGGACTGGC TGATCACGCT CGACGAACAG
TACGGCGCCC AGTTCGCCGC CGCCGGCCTG CTGCTGTCGC CCGGCCTGGC GCACATGTAC
ACCACCGGCG AGATCGCCGC GCAGCTGTGC CTCGAGACGC CGGGGCTCGA CACGCTCGAC
ATCGCCGTGT TCTGGGGCGG CAGCCCGACG ATCGCGTCGA CGCAGACCAT CCTGGTCAAT
GCTGCGACGT CCAAGGCCTA CTACCTCGAC CAGAACAGGT ACGTCGAGTG GCAGCCCGAC
GCGGGTCTCT ACAACGTGAC CATCCCGGGC CAGCACGAGG CCGCGCTGGC GCTGCCCTGG
GGCGGCACCT CGCATCCGGT GTGGTTCAAG CGCGACCCGC GTGTGGCCAC CGTCAAGGTG
CTGGGCGGCG TGTTCAACAA GCCGCTGATG CAGGGCGTGC CGCTGATCGT CGCGGCGGCG
CTGAAGGCGA CCGAGGGCAT GAACCCCGAG GAGCGCTACG CGGCGCTGGC CCAGACGGCC
GCCGGCGTGA TGAACACCAT GCCACCACGC GAGAACCCGC GTCTCAACAA GTCGGTCGAC
TCGGTCCACG CGTCCGGCCC GCTGGCGCGC GCGCACTGCG TGATCTTCGG CAACTGCAAC
TACAAGCAGA CCGGGTTGCT GCAGGCGTTT GCGGCGGCCT CGCTGCTGCA GCAGGCGCCC
CGGCGCGTCG GCTTCGCCTC CGGCTGCCAG GCCTTCGGAC ATCACGAACT GCTCGGCGCC
CTGCGCAGCT TCGGCCTGGT GCAGGCGCCG ATCCTGACCG TCCACCGCTA G
 
Protein sequence
MSNKKPVVVY GASGYTGRLI CEYLREYGIP FIAAGRSADK LQTAMQSNVP GIETASYEIA 
EVPHSVAALT ELFRGASVVL NTVGPFAKFG PEVVEACLAA RCHYTDTTGE QDWLITLDEQ
YGAQFAAAGL LLSPGLAHMY TTGEIAAQLC LETPGLDTLD IAVFWGGSPT IASTQTILVN
AATSKAYYLD QNRYVEWQPD AGLYNVTIPG QHEAALALPW GGTSHPVWFK RDPRVATVKV
LGGVFNKPLM QGVPLIVAAA LKATEGMNPE ERYAALAQTA AGVMNTMPPR ENPRLNKSVD
SVHASGPLAR AHCVIFGNCN YKQTGLLQAF AAASLLQQAP RRVGFASGCQ AFGHHELLGA
LRSFGLVQAP ILTVHR