Gene Mpe_A3217 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3217 
SymbolpilS 
ID4786556 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3421858 
End bp3423786 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content71% 
IMG OID640091790 
Productsignal transduction histidine kinase 
Protein accessionYP_001022405 
Protein GI124268401 
COG category[T] Signal transduction mechanisms 
COG ID[COG5000] Signal transduction histidine kinase involved in nitrogen fixation and metabolism regulation 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.478958 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGGCA CGACCGGATC GGGTCGCCGT CGCGCCCCTG CGGATGGCGG GGCGTCGTGG 
TTCGGGTCGA TCGATCTCGG CCTGCCCGGT GACGACGATG CGGCGCAGTC GCGCTTCGAC
CGTGGCGAGT GGCGCGAGAC CGATGCTGCC GATTCGCGCT TCCTGTCGCG TCAGGCGCGG
CGGATCGCCG GGTCGGGCCA GAGCGCGGTG TACCGGCTCT ACCGTGCCTT CGTGGCCTCG
CGCGCGGTGC TCGGTCTCGC GCTGCTGGCC ACCGAGGTGG CGATCACCTG GCTGAGTCCG
CGGCCACAGC GAGAGCTGGT GCTGTCGCTG TGTTCGCTTT ACGCGATGGC CGCGCTCCTG
TTGTGGGTGC TGCCCAACCT GCTGGGTCCG ATGCAGCCGG CGCTGCAGTC GCGTCTGCGA
CGCCGCCACT GGATGGCGAC GATCGGGGTC GACATGTTGA GCTTCGGCGT CCTGCATCTG
CTGGCCGGCG GCGGCGTGCT GAACTACTCG GCGCTGCTCG TGCTGCCGGT GCTGATGGCC
GGTGTGCTGA CGCCGCGTCT GCAGGCGTTG GGAGTGGCCG CCGGGTGCAC CTTGATCCTG
CTTGCTGCCG CGGGGCTCAA TGTGGACGTG ACCGGCGAGG CAACGTTGCA GCTCACGCAG
GCGGGCCTGG CCGGCGTCGG CCTGTTCGTG GTCAGCCTGA TGGCGGGCGA GTTGTCCGGT
CGGCTGGCGC GCGAGGAGCT GACGGCGCGC GGCAGCCTCG AATTGGCGCG CCAGCAGGCG
CAACTCAATC GCCTCGTCCT CGAGGAAATG CAGGACGGCG TGATGGTCGT CGACCGCCGT
GGCCGGGTGC GCGCCGCCAA CCCGGCGGCG CGCCATCTGC TCGACGAGCC GCTGATCAGC
GCCGCCGACA GCTTCTCGCT GACCGGTGTG CAGGCCTGGG AGCCCCTGAT CAGCGCCGCC
GACAGCTTCT CGCTGACCGG TGTGCAGGCC TGGGAGCCCC TGGTCAGGGC CGTGGACCGG
GCCTTCGGGG AGGGCCACTG GCCCGAGGGG GGGCGCGACG TGGTGTTGCC CCGTGTGGCC
TCCAGCGATA CGGGGCCACG ACAGCTGCGC CTGCGCGTGC GCTTCACGCG TCGCCGCGAG
ACGGGGGCGC CGGAGGACTA CTGCGTGCTC TTCCTGGAGG ACCTGCGCAC GGTGCAGGCG
CGCGTGCGCC AGGAAAAGCT GGCGGCGATG GGTCGCGTGT CGGCCGGCAT TGCGCACGAG
ATCCGCAATC CGCTGGCGGC GATCATGCAA GCCAATGCGC TGCTGGCCGA AGACGCCAGC
AGCGCGCAGC AGGTGCAGCT CACGCGCATG GTGGGTGAGA ACGCCGAGCG CCTGAAGCGC
ATCGTCGACG ACGTGATGGA GGTTGCACCG AGCCTGCTGC CCGAGCCGGC GCCGCTCGAC
GCGAGTCTGC AGGTCGCCAC CATCTGCGGC GAGTGGGCCC GCACCGCGGG CCTGGCGATC
GGTGCCGACA GCGTGCTGCG GGTCGACCTG CCGAGCGAGC CGCTCGGCGT GGTGTTCGAT
GGCGAGCACC TGCGCCGCGT GCTGGTGAAT CTGCTCGACA ACGCGCTGCG GCACGGAAGC
CGGACGCCCG GTGCGGTCCA ACTGCGGCTC GCTGCGGCGA GCGAAAGCCG AGCCCTGCTC
ACGGTGGGAA GCGACGGCGA GCAGATCGCG CCGGAGGTCG AGCGTTACCT GTTCGAGCCC
TTCTTCTCGA CGCGCAGCCG CGGCACCGGA CTGGGACTGT ATATTTGTCG TGAGCTGTGC
GAGCGCTACG GCGCCAGCAT CGAATTCAGC TCACGCGGCG CGCCGGAGCG CCACCGCAAC
GTGTTCTCGG TCGCCATGCG GCGCACGCTG CTTCCGGACG GCGATTCCCG GCTTCACTTC
AGCGCATGA
 
Protein sequence
MAGTTGSGRR RAPADGGASW FGSIDLGLPG DDDAAQSRFD RGEWRETDAA DSRFLSRQAR 
RIAGSGQSAV YRLYRAFVAS RAVLGLALLA TEVAITWLSP RPQRELVLSL CSLYAMAALL
LWVLPNLLGP MQPALQSRLR RRHWMATIGV DMLSFGVLHL LAGGGVLNYS ALLVLPVLMA
GVLTPRLQAL GVAAGCTLIL LAAAGLNVDV TGEATLQLTQ AGLAGVGLFV VSLMAGELSG
RLAREELTAR GSLELARQQA QLNRLVLEEM QDGVMVVDRR GRVRAANPAA RHLLDEPLIS
AADSFSLTGV QAWEPLISAA DSFSLTGVQA WEPLVRAVDR AFGEGHWPEG GRDVVLPRVA
SSDTGPRQLR LRVRFTRRRE TGAPEDYCVL FLEDLRTVQA RVRQEKLAAM GRVSAGIAHE
IRNPLAAIMQ ANALLAEDAS SAQQVQLTRM VGENAERLKR IVDDVMEVAP SLLPEPAPLD
ASLQVATICG EWARTAGLAI GADSVLRVDL PSEPLGVVFD GEHLRRVLVN LLDNALRHGS
RTPGAVQLRL AAASESRALL TVGSDGEQIA PEVERYLFEP FFSTRSRGTG LGLYICRELC
ERYGASIEFS SRGAPERHRN VFSVAMRRTL LPDGDSRLHF SA