Gene Mpe_A1030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1030 
Symbol 
ID4785633 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1100292 
End bp1101413 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content74% 
IMG OID640089593 
Producthypothetical protein 
Protein accessionYP_001020227 
Protein GI124266223 
COG category[R] General function prediction only 
COG ID[COG0579] Predicted dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.950675 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCAGG TCGATGCGGT GGTTGTCGGG GCCGGGGTCG TGGGGCTGGC GGTCGGTCGC 
GCGCTGGCGC GGCGCGGCTT CGAGACCGTG GTGCTTGAAA GCGAGACCGC GATCGGCACC
GTCACCAGCG CGCGCAACAG CGAGGTGATC CATGCCGGCC TCTACTACCC CTCCGGTTCG
CTGAAGGCGC GGCTGTGCGT GGCCGGCAAG GCGGCGCTCT ACGCCTACTG CGCCGAGCGC
GGCATCGCGC ACCGGCGCTG CGGCAAGCTG ATCGTCGCGA CCGGGCCGAC GCAGCATGCG
GCGCTGCACG CGCTGAGCCG GCGTGCGGCC GACAACGGCG TCGACGACCT GCAACTCCTG
ACGCCGGACG CTGCGCGTGC GCTCGAACCC GCGCTCGCCT GCAGCGAGGC ACTGCTGTCG
CCGTCGACCG GCATCGTCGA CAGCCACGGC CTGATGCTCG CGCTGCAGGG CGACCTGGAG
TCGGCCGGCG GCGCGGTGGC GCTGGCCTCG CGCGTCGAGC GCATCGAGGT CGGGCGACCG
CACCGTGTGC AGGCAGCCGG CATGACGCTG GGCGCGCGCA TCGTCGTCAA TGCCGCCGGG
CTGTGGGCGC CGGCGCTCGC ACGGCGCACC GAGGGGCTGG CGCCGGCCTT CCAGCCGCCG
GGCCGGTTCG CGAAGGGCAG CTACTTCGCG TTGCCGGGCC GGGCGCCGTT CTCGCATCTC
ATCTACCCGA TGCCGGAGGT GGCCGGCCTC GGCGTCCACC TGACGCTCGA TCTCGGCGGC
CAGGCGCGCT TCGGGCCCGA TGTGGAATGG GTCGAGCCCG GTCCCGCCGC CGCGGGCGGT
GACGGCACGC TCGACTACCG CGTCGACGTT CGGCGCGCCG ATGGCTTCTA TGCGGAGATC
CGCCGCTACT GGCCGGCGCT TCCCGACGGC GCGCTGCAGC CGGCCTACAG CGGCGTGCGA
CCCAAGCTGT CGGGCCCGGG CGAGCCGGCG GCCGACTTCC GCATCGACGG CCCGGCCGAG
CACGGCATCG AGGGCCTGGT GAACCTGCTC GGCATCGAGT CGCCGGGCCT GACGGCCAGC
CTCGCGCTGG CCGACGAGAC GCTGCGGCGC CTGGCTGCGT GA
 
Protein sequence
MDQVDAVVVG AGVVGLAVGR ALARRGFETV VLESETAIGT VTSARNSEVI HAGLYYPSGS 
LKARLCVAGK AALYAYCAER GIAHRRCGKL IVATGPTQHA ALHALSRRAA DNGVDDLQLL
TPDAARALEP ALACSEALLS PSTGIVDSHG LMLALQGDLE SAGGAVALAS RVERIEVGRP
HRVQAAGMTL GARIVVNAAG LWAPALARRT EGLAPAFQPP GRFAKGSYFA LPGRAPFSHL
IYPMPEVAGL GVHLTLDLGG QARFGPDVEW VEPGPAAAGG DGTLDYRVDV RRADGFYAEI
RRYWPALPDG ALQPAYSGVR PKLSGPGEPA ADFRIDGPAE HGIEGLVNLL GIESPGLTAS
LALADETLRR LAA