Gene Mpe_A2859 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2859 
Symbol 
ID4785553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3046609 
End bp3047883 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content70% 
IMG OID640091430 
Productbacteriophytochrome-like protein 
Protein accessionYP_001022048 
Protein GI124268044 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTTCGA TTGACAACGA CCTCAGGCGT GCACTGTCCC GGATCGCGGA AGCCTCGGGC 
ATCGGACTCG GCGCGTTGCT GCTGCCGGTC GACGCGCGAG CTGCCTTCGA CGCTGCCGAG
GGTCTGCCCT GGCATGCCTT CGGGCTGTTG CTGGTCGGCG CGCTGCTGGC ACTGGGCGTC
CTGCTCGGTC TCCTGGCGTG GCAACAGCGC CAGCGGCGAG ACGCCGCTGC ACTGCGCGAC
GAGCGCGCCC GCGTGCGACG CCTGCTCGGC TTGATCGAGG GCCCGCTGTG GCGCACCGAC
GCACAGCACC GCCTGACCGC GCTGCGCGCG ACGGCGCTGC CTGCCGACCA TGCGCTGCAC
GCCGGCCGCG ACAGTCAGGC GTTCTGGCAG CTGTTCGACG AGACGTCGAT GCCCGGCCTG
CGCCAGGCTC TCGAGTCGGA AGCCGATTTC CAGGCCATCG AGGTCGGGCT CGCGGCGCCT
GGCGGCGGGC CGCTGCAACG CTGGCGATTG AAGGCATGCG TGTGCCTCGA CGCCTCGGGC
CGCTTCGCTG GCCACGAGGG GAGCGCGACG CTCCTGACGC CCCCACGCCC GTCGCCGGCG
GGTGCGGACG AGGATGCGGC GTCGTTCGGC ACGCTGGTGT CGCACGATCT GCGCGCACCG
ATCCGGGTGG TCGAGGGCTT CACGAAGATT GTCAAGGAGG ACTACGGTCG CCTGCTCGAT
CGCGTCGGCA ACGACCACCT TGACCGCGTA CTCGGGGCCG CGGCCCGCAT GAACGGCATG
ATCGACGCGC TGCTGGCGCT GTCGAGACTC GGCGCGCTGC CTCTGGCGCG CCAGACGGTC
GACCTGTCGC AACTCGCGAG CTACGTGATC GACGATCTGC AGCGCCAGGC GCCCGAACGC
CGCGTCGATG TCCGCATCGC GCCCGGCCTG CTGGTCACCG GCGATCCGAC CCTGTTGCGC
GTGGTGCTCG AGAACCTGCT GGGCAACGCT TGGAAATACA GCGCCCATCG CCACGTTGCC
CGGATCGAAC TGGGCCGCGT GGTCGAAGGC GAGCACAGTG CGTTTTGCGT GGCCGACAAC
GGCGCCGGTT TCGACATGCG CTTCATCGAT CGGCTGTTCG GCGTGTTTCA GCGACTGCAC
AGCAGCAGCG ATTTCGCCGG CACCGGGGTC GGCCTTGCGT CGGTGCGGCG CATCGTGCAA
CGCCATGGCG GCGAGATCTG GGCGGAGGCC GAGGTCGACC GCGGAGCCCG CTTCTACTTC
ACCCTGTCCA CCTGA
 
Protein sequence
MISIDNDLRR ALSRIAEASG IGLGALLLPV DARAAFDAAE GLPWHAFGLL LVGALLALGV 
LLGLLAWQQR QRRDAAALRD ERARVRRLLG LIEGPLWRTD AQHRLTALRA TALPADHALH
AGRDSQAFWQ LFDETSMPGL RQALESEADF QAIEVGLAAP GGGPLQRWRL KACVCLDASG
RFAGHEGSAT LLTPPRPSPA GADEDAASFG TLVSHDLRAP IRVVEGFTKI VKEDYGRLLD
RVGNDHLDRV LGAAARMNGM IDALLALSRL GALPLARQTV DLSQLASYVI DDLQRQAPER
RVDVRIAPGL LVTGDPTLLR VVLENLLGNA WKYSAHRHVA RIELGRVVEG EHSAFCVADN
GAGFDMRFID RLFGVFQRLH SSSDFAGTGV GLASVRRIVQ RHGGEIWAEA EVDRGARFYF
TLST