Gene Mpe_A0274 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0274 
Symbol 
ID4786943 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp296634 
End bp298979 
Gene Length2346 bp 
Protein Length781 aa 
Translation table11 
GC content71% 
IMG OID640088826 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_001019471 
Protein GI124265467 
COG category[T] Signal transduction mechanisms 
COG ID[COG5000] Signal transduction histidine kinase involved in nitrogen fixation and metabolism regulation 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.276593 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00464659 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGACCCCGG AACGCCTGCG CCGCCGCTTG GCGCTGCTGC TCACGCGGCA CTCGGACCGC 
CGAATCTCGC GCGGCGTCTG GGTCGTCGCG TTGGCCGCGA CCACGGCGGC GGCGCTGGTG
CTCACCTTCC TGCTGGCCAT CGCCACCAAC AACCGGGAGT TCTACGAGCG GTACTACAAC
GGTCTCCTGT GGGCCAACGT CGTGATCGCG ACCGTGCTGG TCGGTGTGAT CGTGGTCGGT
GCGGTGCGGC TCGCTGCGCG GGTGCTGCGC GGACGCTTCG GTTCGCGGCT GCTGCTGCGG
CTCGCGGCGA TCTTCGGCGT GGTGGCGGTG GTGCCCGGGG CGCTGGTCTA CACGGTGAGC
TACCAGTTCG TGTCTCGCAG CATCGAGAGC TGGTTCGACG TGCGCGTCGA GGGTGCGCTG
GAGGCCGGCC TGAACCTCGG GCGCGGCACG CTCGACACAC TGGTCAATGA TCTCGCCGGC
AAGACGCGCG TGGCCGCCGA GCGGCTCGGC GAGGCCCCCG AGCGACAGCA GCCGCTGGCC
CTGGAGCGCT TGCGGGAGCA ACTCACGGCG CAGGACGTGG CGGTGCTCGG CGCGGCCGGC
CAGGTGCTGT CGAGCGCCGG CGCCGCCGCG GTGCTGGCGC CCGAGCGACC GGGTCTGGCT
CTGCTGCGGC AGGCACGCAC GGCGCGCGTG ACCACGCAGA TCGAGGGGCT CGACGACGAC
GGCGCTGCCA CGCGCATCCC CGGCGTCTCG GTTGCCGACC AGCCTACGAG CCAGCCGCGC
GTGCGCGTCA TTGCCCTGAT CCCGGCCACC GGCTTCTCGC TCAGCCGCGA GGACCGCTAC
CTGGTGGTGA GCCAGCTGTT GCCGGCCACG CTGGCAGCCA ACGCGCTCGC GGTGCAGAGC
GCCTACCGCG AGTACCAGCA ACGCTCGCTG GCGCGCGAGG GCCTGCGCAA GATGTACATC
GGCACGCTGA CGCTGACACT GATCCTTGCC GTGTTCGGCG CGATGCTGCT GGCGGTGACC
TTCGGCAACC GGCTGGCGCG TCCTCTGCTG CTGCTCGCCG ACGGCGTGCG CGAGGTGGCC
CGCGGAGACC TGAGCCCGAA GCCGGTGTTC GCGTCGGGCG ACGAACTCGG CGGCCTGACG
CGCTCGTTCG CCGACATGAC CCAGCAACTC GCCGACGCCC GCGGCCTGGT CGAGCAGAGC
GTGGCCGAGC TCGAGACGGC GCGCGGCAAG CTGCAGGCCA TCCTCGACAA CATGAGCGCC
GGGGTGATCG TGTTCGATCG CGACCGTCGC ATCGACAGCG TGAACGCCGG TGCGACCCGC
ATCCTGCGCG TGCCCTTGTC GGCCCATCTC GGCCGCGGGC TCGCCGAAGT GCCCGGCCTC
GAGGGCTTCG ACGCGGCGCT GACCCGACGC TTCGACCAGA TCGAGAGCGG CTCCGAACTC
GGCGACCGAG ACCACTGGCA GGATTCCTTC GAACTCACGA CGGCCCCCGA CCGCGATCCG
CTGACCCTGC TGGTGCGCGG CGCGCCGCTG CCGGCCGGCG GCCAGGGCAG TGCCGCGCGG
CTGGTGGTGT TCGACGACAT CACCGAGCTC GTCTCGGCGC AGCGCGCCGA GGCCTGGGGC
GAGGTGGCGC GCCGGCTCGC GCACGAGATC AAGAACCCGC TCACGCCGAT CCAGCTGTCG
GCCGAGCGCA TCCAGCACCG GCTCGAAGCC AAGCTGGAGG TACCGGACCA GCAGATGCTG
GCCAAGTCTG TGAGCACCAT CGTCACCCAG GTGCAGGCGA TGAAGCAGCT GGTCAACGAG
TTCCGCGATT ACGCCCGCCT GCCGGCGGCC CAGCTGGCGC CGCTGGACCT CAACGCGCTG
GTGTCCGAGG TGCTGGGCCT GTACGCGGAG GTGCAGGAGT CGGGCCGCCT GCGCGTCGAG
CTGGCCGAGG GACTGCCGCG GATCCAGGGC GACGCCACAC AGTTGCGGCA GGTGGTGCAC
AACCTGGTGC AGAACGCGCT GGACGCGGTG GCCGAGCGGC CCGACGGCCA GGTCCGGCTG
CGCACCGAGG GGGCACTCAA CGAGCGCGGC GAGTGGCGCG CTGTGCGACT GGTCGTCGCC
GACAACGGCC CCGGCTTCAG CGAACGCATG CTCAAGCGTG CGTTCGAGCC CTATGTGACC
ACCAAGACCA AGGGCACAGG GCTCGGGCTT GCCGTGGTCA AGAAGATCGC CGACGAGCAT
GGCGCGCGCA TCCGCCTGGC CAACCTCGCG GGCGACGCGG TGGCCAGGGG CGACGCGCCG
TCTGTCGGGG GGGCACAAGT TTCGCTATCA TTTTCGAAAT GGCTGCCCCC TCGGGGTCAG
CCTTGA
 
Protein sequence
MTPERLRRRL ALLLTRHSDR RISRGVWVVA LAATTAAALV LTFLLAIATN NREFYERYYN 
GLLWANVVIA TVLVGVIVVG AVRLAARVLR GRFGSRLLLR LAAIFGVVAV VPGALVYTVS
YQFVSRSIES WFDVRVEGAL EAGLNLGRGT LDTLVNDLAG KTRVAAERLG EAPERQQPLA
LERLREQLTA QDVAVLGAAG QVLSSAGAAA VLAPERPGLA LLRQARTARV TTQIEGLDDD
GAATRIPGVS VADQPTSQPR VRVIALIPAT GFSLSREDRY LVVSQLLPAT LAANALAVQS
AYREYQQRSL AREGLRKMYI GTLTLTLILA VFGAMLLAVT FGNRLARPLL LLADGVREVA
RGDLSPKPVF ASGDELGGLT RSFADMTQQL ADARGLVEQS VAELETARGK LQAILDNMSA
GVIVFDRDRR IDSVNAGATR ILRVPLSAHL GRGLAEVPGL EGFDAALTRR FDQIESGSEL
GDRDHWQDSF ELTTAPDRDP LTLLVRGAPL PAGGQGSAAR LVVFDDITEL VSAQRAEAWG
EVARRLAHEI KNPLTPIQLS AERIQHRLEA KLEVPDQQML AKSVSTIVTQ VQAMKQLVNE
FRDYARLPAA QLAPLDLNAL VSEVLGLYAE VQESGRLRVE LAEGLPRIQG DATQLRQVVH
NLVQNALDAV AERPDGQVRL RTEGALNERG EWRAVRLVVA DNGPGFSERM LKRAFEPYVT
TKTKGTGLGL AVVKKIADEH GARIRLANLA GDAVARGDAP SVGGAQVSLS FSKWLPPRGQ
P