Gene Mpe_A1687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1687 
Symbol 
ID4785479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1811968 
End bp1813899 
Gene Length1932 bp 
Protein Length643 aa 
Translation table11 
GC content74% 
IMG OID640090260 
Productsignal transduction histidine kinase, nitrate/nitrite-specific, NarQ 
Protein accessionYP_001020884 
Protein GI124266880 
COG category[T] Signal transduction mechanisms 
COG ID[COG3850] Signal transduction histidine kinase, nitrate/nitrite-specific 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.461201 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.659068 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCCG CCTTCCTGCG CGCGAGCCGT CGCCTGCAGC GCGCCTCGCT GGCGGCCAAG 
CTGGGGGCCC TGGGCTCGCT GATCATGGTC GTGGCCCTGG GCTCCATCGC CTTCACCGTG
TGGGTCACCT GGCAGCTCGA GGGCGGCGCC GCCGCAGTCA ACGAGGCCGG CCGCCTGCGC
ATGCAGACCT GGCGTTATGC CCAGGTGGCG TCCGCGCAGG ACCTGCCGCA GCTGCCGGCC
CTGGCCCAGC AGTACGACGC CACCCTGGCC CTGCTGGACG CCGGCGACCC GGCCCGCCCC
CTCTTTGTGC CGCGAGACGA GGACACCCGC GCAGCCTTGC AGGCCGTGCG CAGCCGCTGG
CAGGCGCTGC AGCGCGCCAT CTCCGAGGGC GCCACGCCGC TGCAGGCGGG CCAGCTGGCG
GGCGACCTGG TGATCGAGAT CGACCACCTC GTCACCGGCA TCGAGGCGCA CCTTCAGCGC
TGGACGTCGA TCCTGAGCGC CCTGCAGTTC ACCCTGCTGG CCCTGGCGCT GGCCGCCGGC
GTGGCGCTGC TTTATGCCGC GCACCTGTTC ATCTTCCAGC CGCTGGCCCG CCTGCAGCGC
GGGCTGGAGG CGGTGGCGCG CGGCGACCTG GGCGCGCGTG TGGACGTCGA GACCCACGAC
GAGTTCGGCG AGGTGGCCGA CGGTTTCAAC CACATGGCCA TGCGCCTCGA AGACCTGTAC
CGCGGGCTCG AGACCAAGGT CGCGGAAAAG ACCGAACACC TGCGCACCGA GCGCGAGCGC
CTGGCGGCCC TGTACGAGGC CAGCCGGCGG GTGGCCCAGG CCGCCAGCCT GGACGAGCTG
GCGCAGGGCT TCGTGCGCCA GTTCCGCGAG GTGGCCCGGG CCGACGCCGC GCTGCTGCGC
TGGCACGACG CCGAACAGGG GCGCATGCTG ATGCTGGCCG CCGACCGCGT GCCTGCGGCC
ATGCTGGAAC ACGAACGCTG CCTCGCCCCT GGCGACTGCC ACTGCGGCAA CGGCGCCGCC
CCGGGCGGCC CGCGCACCAT CACCATCCAC GCCCTGACGG ACGGCGACAC CGGCGACAGC
GCCGGCACCT GCCGCCGCTT CGGCTTCCAG CGCGTGCTAA CCTTGCCCGT GCTGCTGCAG
GACCAGACCC TGGGCGAGGT CGACCTGCTG TGGCGCGACG CCACGCGCCC CGTGCTCGAC
GACGACCGCG CGCTGCTCGA GGGCCTGGCC GCCCAGCTGG CGGGCGGCAT CGAGAGCGTG
CGCGCGCAGG CCCTGCAGCG CGAGGCGGCT GTCTCCGAGG AGCGCGGCTT CATCGCCCGG
GAACTGCACG ACTCCATCGC CCAGGCGCTG GCCTTCATGA AGATCCAGCT GCAGATGCTG
CGCGGCGCCC TGCGCCGCGG CGACCCGGCG GCCACCACGC GGGTGGTCGA CGAGCTCGAT
GCCGGCGTGC GCGAGAGCCT GGCCGACGTG CGCGAGCTGC TGCTGCACTT CCGCACCCGC
AGCGACGGCG CCGAGCTGGC GGCGGCCCTG CGCGCCACGC TGCAGAAGTT CCAGCACCAG
ACCGGCCTGC CCGCGCAGCT GGACGTGCGC GGCCCCGTGC TGCCACTGGC GCCCGACGTG
CAGGTGCAGC TGCTGCACGT GGTGCAGGAG GCCCTGTCCA ACGTCCGCAA GCATGCCCAG
GCGCGGCAGG TGTGGGTGAC GCTGGACCCG CGCCCCGAGC TCAGCGTGAC CGTGCGCGAT
GACGGCCGCG GCTTCGACCT GGCGGCCCGC AACAGCGACG ATGGCGACGA CGGCCACGTG
GGCCTGCGCA TCATGCGCGA GCGCGCGGCC GGCATCGGGG CCGAGGTCGA GATCACCTCG
ACACGCGGCG AGGGCACCCA GGTGCGCATC GCGGTGCCCT CGCGCCCGGC CTCCGTGGCC
CTGGCGGCCT AG
 
Protein sequence
MNAAFLRASR RLQRASLAAK LGALGSLIMV VALGSIAFTV WVTWQLEGGA AAVNEAGRLR 
MQTWRYAQVA SAQDLPQLPA LAQQYDATLA LLDAGDPARP LFVPRDEDTR AALQAVRSRW
QALQRAISEG ATPLQAGQLA GDLVIEIDHL VTGIEAHLQR WTSILSALQF TLLALALAAG
VALLYAAHLF IFQPLARLQR GLEAVARGDL GARVDVETHD EFGEVADGFN HMAMRLEDLY
RGLETKVAEK TEHLRTERER LAALYEASRR VAQAASLDEL AQGFVRQFRE VARADAALLR
WHDAEQGRML MLAADRVPAA MLEHERCLAP GDCHCGNGAA PGGPRTITIH ALTDGDTGDS
AGTCRRFGFQ RVLTLPVLLQ DQTLGEVDLL WRDATRPVLD DDRALLEGLA AQLAGGIESV
RAQALQREAA VSEERGFIAR ELHDSIAQAL AFMKIQLQML RGALRRGDPA ATTRVVDELD
AGVRESLADV RELLLHFRTR SDGAELAAAL RATLQKFQHQ TGLPAQLDVR GPVLPLAPDV
QVQLLHVVQE ALSNVRKHAQ ARQVWVTLDP RPELSVTVRD DGRGFDLAAR NSDDGDDGHV
GLRIMRERAA GIGAEVEITS TRGEGTQVRI AVPSRPASVA LAA