Gene Mpe_A2074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2074 
Symbol 
ID4783653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2219615 
End bp2220736 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content68% 
IMG OID640090642 
Productsignal transduction histidine kinase, nitrogen specific, NtrB 
Protein accessionYP_001021265 
Protein GI124267261 
COG category[T] Signal transduction mechanisms 
COG ID[COG3852] Signal transduction histidine kinase, nitrogen specific 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.673281 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTGAGC GGCGCGGCAT CGCCGTCGCG GCCGGCACGA CAAGCACGAC ACTGCCGGCT 
GCGGCGCACT GGAGCGAGGC CTTCGACCTG CTCGCCACCA TGGTGGCGGT GGTCCAGCCC
GACGGGCGCT GCCTGTTCGC CAACGCGGCG TTCGAGAACG TCCTGGGCCT GTCGCGCCGC
AACGTGCTGC GGCGCGACTG GTTCGAATGG TTCGTCGATC CCTCGCTGCT GCGGGACACC
GTCGCCGCGG TGGCCTGCAA CGAGTTCTCG ACCAGCCGCC TCGACGCCCA GCTCAAGCGG
CCCTTCGTGT CACCCGGTGA GCCGCTGCTC GTGCACGTGA TCGTCAACCA GATGGAGCAG
TCCGACGTCA CGCTGGTCGA GCTGATCGAG ATCGAGCAGC AGACCCGGCA GGACCGCGAG
GAGCGCGCGC TGGACCAGGT GCGCGCCACC AAGGAGCTGA TTCGCAACCT GGCGCACGAG
ATCAAGAACC CGCTGGGCGG CATCCGTGGC GCGGCGCAAC TGCTGCAGAT GGAGGTCGAG
TCCAAGGGCC TGAGCGAATA CGCCCAGGTC ATCATCCACG AGGCCGACCG CCTGCAGGCG
CTGGTCGACC GGCTGCTGGC GCCGCACCGC CGCCCGCACG TGGTGGGCGA CGTCAACATC
CACGAGGTGT GCGAGCGGGT GCGCGCCCTG ATCGTGGCCG AGTTTCCCCG CGGGCTCCGG
ATCGAACGCG ACTACGACAC ATCCATCCCC GACTTCCGCG GCGACCGCGA GCAGCTGATC
CAGGCGGTGC TGAACATCGC CCACAACGCC GCCCAGGCGC TGGGCGAGCG CATGGTGGCC
GGTGACGCCC GCATCGTGCT GCGCACGCGG ATCGCCCGGC AGGTCACGCT CGGCAAGCAG
CGCTTTCGTT TGGCACTGGA ATTGCATATT GAGGACAACG GTCCGGGCGT TCCCGAGGCC
ATCCGTGATC GCATCTTCTA CCCCCTGGTT TCTGGGCGCG ATGGCGGATC GGGCTTGGGG
CTCACACTGG CACAGACCTT CGTGGCGCAA CATCAGGGCA CGATCGAATG CGAGAGTGAA
CCGGGCAGGA CGCTGTTCAA AATCACGATG CCGTTACCCT GA
 
Protein sequence
MAERRGIAVA AGTTSTTLPA AAHWSEAFDL LATMVAVVQP DGRCLFANAA FENVLGLSRR 
NVLRRDWFEW FVDPSLLRDT VAAVACNEFS TSRLDAQLKR PFVSPGEPLL VHVIVNQMEQ
SDVTLVELIE IEQQTRQDRE ERALDQVRAT KELIRNLAHE IKNPLGGIRG AAQLLQMEVE
SKGLSEYAQV IIHEADRLQA LVDRLLAPHR RPHVVGDVNI HEVCERVRAL IVAEFPRGLR
IERDYDTSIP DFRGDREQLI QAVLNIAHNA AQALGERMVA GDARIVLRTR IARQVTLGKQ
RFRLALELHI EDNGPGVPEA IRDRIFYPLV SGRDGGSGLG LTLAQTFVAQ HQGTIECESE
PGRTLFKITM PLP