Gene Mpe_A2224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2224 
Symbol 
ID4785356 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2378512 
End bp2379639 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content72% 
IMG OID640090792 
Productmolybdate metabolism transcriptional regulator 
Protein accessionYP_001021415 
Protein GI124267411 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG1910] Periplasmic molybdate-binding protein/domain
[COG2005] N-terminal domain of molybdenum-binding protein 
TIGRFAM ID[TIGR00637] ModE molybdate transport repressor domain 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATAGCA ACAACAACGA CGACTCGCCT GCGCCCACCC GTTCGCCGCA GCGCGTGCAG 
CTCACCTATT CGCTCGGGAC CGACGGCACC GCCGGGCCGG TGCATCACCC GCTGTTCGCG
CTGCTCGACG CGCTGCACCG CGGCGGTTCG ATCTCGGCCG CGGCCACGGC GCTGGGGTTC
TCCTACCGGC ATGTCTGGGG TGAACTGCGG CGCTGGGAGA CCGAACTGGG CCGCTCGCTG
ATCATCTGGA ACAAGGGCCA GCGCGCGGTG CTCACGTCCT TCGGCGACAA GCTGCTGTGG
GCCGAGCGTC GGGCGCAGGC GCGGCTCGCG CCGCAGATCG AGTCGCTGCG CATGGAGCTG
GAGCGCGCCT TCGCGGATGC CTTCGACGAC CGCGTCGACG TGCTCAGCGT CTGCGCCAGC
CATGACCAGG CGTTGCCGCT GCTGCGAGAA CTGGCGCTGG CGGAACAGCT GCACCTCGAC
ATCGAGTTCG CCGGCAGCCT CGACGCGTTG CACACCCTCG ACGCCGGCGG CTGCCTGCTC
GCCGGCTTCC ACGTGCTGGA CGGCGTGGCG CGCGGCTCGG TCAGTGCACG CACCTACCGC
GCACGGCTGA AGCCGGGCCA CCACAAGCTG ATCGGCTTCG CGCAGCGCGT TCAGGGCGTG
ATGACGGCGC CCGGCAACCC GCTGAAGGTG GGGTCGCTGC ACGACCTGTC GCGGCCCGGT
CTGCGCTGGG TCGGGCGCCC CGAGGGCACC GGCACGCGGG TGCTGCTGGA GGAACTGATC
GAACAGGCCG GCCTGAAGAT GCCGGAGGCC TTCGCGCTGA TCGAGCCGTC GCACGGCGCG
GCCGCGCAGG CCGTGGCCAG CGGCGCGGCC GACGCGGCCT TCGGGCTGGA GGCCGCGGCG
CGCGCCGCCG GACTGGGCTT CGTGCCGCTG GCCCGCGAGC GCTACTTCCT CGTGACGCTG
AAGTCCACGC TGGAGCAGCC AGCGGTGCAG CGCCTGGTGA GCCTGCTGGG CTCCACGACC
TGGGCCCGCA CGCTGGCCGG CCTGCCCGGC TACCGCGCCA CCGAGCCCGG CGCGGTGCTG
GCATTGACGA AGGTACTGCC GTGGTGGAGC TACCGCAGCA AGCACTGA
 
Protein sequence
MHSNNNDDSP APTRSPQRVQ LTYSLGTDGT AGPVHHPLFA LLDALHRGGS ISAAATALGF 
SYRHVWGELR RWETELGRSL IIWNKGQRAV LTSFGDKLLW AERRAQARLA PQIESLRMEL
ERAFADAFDD RVDVLSVCAS HDQALPLLRE LALAEQLHLD IEFAGSLDAL HTLDAGGCLL
AGFHVLDGVA RGSVSARTYR ARLKPGHHKL IGFAQRVQGV MTAPGNPLKV GSLHDLSRPG
LRWVGRPEGT GTRVLLEELI EQAGLKMPEA FALIEPSHGA AAQAVASGAA DAAFGLEAAA
RAAGLGFVPL ARERYFLVTL KSTLEQPAVQ RLVSLLGSTT WARTLAGLPG YRATEPGAVL
ALTKVLPWWS YRSKH