Gene Mpe_B0104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_B0104 
Symbol 
ID4787707 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008826 
Strand
Start bp96307 
End bp97728 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content59% 
IMG OID640092513 
Producthypothetical protein 
Protein accessionYP_001023118 
Protein GI124262648 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0194 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000595624 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGCTCG AACACATCCC GGAACCACGA CTACGGTTCG CGTCCGGCGA ACACATCTGC 
CCGCGCAGGG GCATCGCAGC TTACGGCGTG TTCGATCGGA GCATGGACTC CCGCCGCACT
GACGTCATGA TTGGCGGGGT AGGCACCGCT ACATGCATCG AAGCGCTCGG CCGGTGGGTT
GAGCGCTGCA GTTCGGAGAT TCCTGCGCCG GAGACGGCGA AGCAACCGAA CCTGCGGGTG
CCGTTCCCCG GGGTCGGCCG CGGCCACGCT TTCGATGCCA AGCTGGTCTT TGGCAGTGAC
CTCGCTCGGA CACTGAAGAA AAGCGAGGTC GACGAAATCG TCGCCATTGG CGACCGAACC
ACCAGGCTGT CGAAGGCTAT TGACCTTTAC TACGAGCACA TAAAGTTCCT TGCGCAGAAT
CGGCAGATCG ACGCCGTGGT CTGCGTGATT CCCGATGCTC TCTACAAGGT GGTAGCGACG
GAGGAGTCGA ATCCGCTCGA AGAGACACTT GATGCAAGTG TCGAGGTGGC GTCGGAGCTG
AACTTCCGGC GCGCACTGAA GGCCAAGGCC ATGCACTTGG GCAAGCCGTT GCAGCTCATA
CGAGCCTTCT CGCTTGAGAG CAACAAGAAG GGACAGCAGG ACGATGCCAC TAAGGCATGG
AACTTCTGCA CGGCGCTCTA CTACAAGGCT GGGCCACGCG TTCCATGGAA GTTGTCAGCC
GACGACAGGC GACCTTCATC TTGCGCGGTC GGGATTGCGT TCTATCGCAG TAGAGATCGA
CAGGTGCTCA ACACCAGCTT GGCGCAGATC TTCGATGAGT TGGGCAACGG TCTGATCCTT
CGTGGCACCC CGATCGACAT GACTCGGGAT GACCGAGTTC CCCACCTCAA TGCCCAGCAG
GCCTACGACC TTCTAACTGC CGCACTCAAC GAATACAGAG TCGCGTTGCG CAACTTTCCA
GCGAGGATCG TGGTCCATAA GTCGTCGAAC TTCTCAGCGG AAGAGATCGA CGGCCTCAGC
GAGGCGGCCT CCGACCTGAG GATTGATACC GTTGATTTGG TCACCGTGAT GGACTCGAGG
TTGCGTCTCT TTCGGGAGGG AAACTATCCT CCGTATCGCG GGACAAGGAT TGAGATGGAC
GACCGCCGCC ACGTCCTGTA TTCCCGGGGC TCGGTTTGGT ACTACAAGAC CTACACCGGG
CTCTACATCC CCGAGCCTAT TGAGTTGCGA ATCGTGCGGT CCGAGGAGTC TCCGTCGTTC
ATCGCTCGCG AGATTCTGGG ACTGACCAAA ATGAACTGGA ACAACACGCA GTTCGATGGA
AAGTACCCTG TTACGCTCGG ATGCGCGAGA AAGGTCGGCG AGATCATGAA GTACCTGAGC
GACCGGGACG ATCCGCAGAT TCGCTACGGC TTCTACATGT GA
 
Protein sequence
MKLEHIPEPR LRFASGEHIC PRRGIAAYGV FDRSMDSRRT DVMIGGVGTA TCIEALGRWV 
ERCSSEIPAP ETAKQPNLRV PFPGVGRGHA FDAKLVFGSD LARTLKKSEV DEIVAIGDRT
TRLSKAIDLY YEHIKFLAQN RQIDAVVCVI PDALYKVVAT EESNPLEETL DASVEVASEL
NFRRALKAKA MHLGKPLQLI RAFSLESNKK GQQDDATKAW NFCTALYYKA GPRVPWKLSA
DDRRPSSCAV GIAFYRSRDR QVLNTSLAQI FDELGNGLIL RGTPIDMTRD DRVPHLNAQQ
AYDLLTAALN EYRVALRNFP ARIVVHKSSN FSAEEIDGLS EAASDLRIDT VDLVTVMDSR
LRLFREGNYP PYRGTRIEMD DRRHVLYSRG SVWYYKTYTG LYIPEPIELR IVRSEESPSF
IAREILGLTK MNWNNTQFDG KYPVTLGCAR KVGEIMKYLS DRDDPQIRYG FYM