Gene Mpe_A2041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2041 
Symbol 
ID4784618 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2181985 
End bp2183319 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content75% 
IMG OID640090611 
ProductGntR family transcriptional regulator 
Protein accessionYP_001021234 
Protein GI124267230 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCAGA CCCGGTACAA GCAGGTCGTC GACCGGTTCG CCGCTGAGAT CACGGGCGGC 
CGCCTGCCCC CCGGCACGCG CCTGCCCACG CACCGGCAGC TCGCCGCCGA CGAGGGCCTG
GCGCTGGTGA CGGCCAGCCG CGTCTATGCC GAGCTGGCGG CCATGGGCCT GGTGAGCGGC
GAGACCGGCC GCGGTACCTT CGTCAAGGAA ACGGCGGTTC CGCGCGGCCA GGGCGTGGAC
CAGCACGCGG TGGCGGCCAA CATGCTCGAC CTGAACTTCA ACTACCCGTC GCTGCCGGGC
CAGGCCGAAC TGTTGCGGAA CGCGCTGCGC CAGCTGGCCG CGGCCGGTGA CCTGGAGGCG
CTGCTGCGCT ACCAGCCCCA TGGCGGGCGC CCGCACGAGC GCGCCTCGGT GGCGCGGCAT
CTGGCGCGCC GCGGGCTCAC GCTGCCGGGC GAGCAGGTGA TGCTGGTCGA CGGCGCGCAG
CACGGCCTGG CCACCACCGT GATGGCGCTG CTGCAGCCCG GCGACGTGGT GGCGGTCGAC
GCCCTCACCT ACCCCGGCTT CAAGCTGCTC GCCGAGGCGC ACCGCCTGGA GCTGGTGGCG
GTGCCCGCCG GCACCGACGG CCCCGACCCG GACGCGCTGG CTGCGCTGTG CCAGCGCCGC
CGGGTCAAGG CCCTGTACGC CATGCCGACC ATGCACAACC CGCTGGGCTG GGTGATGAGC
GCCAGCCACC GCCGTGCCCT GGTGGCGGTG GCGCGGCGAC ACGGGCTGCT CGTCATCGAG
GACGCCGCCT ACGCCTTCCT CGTGGAGAAG GCGCCGCCGC CGCTGGCGGC GCTGGCACCG
GAGCGCACGG TCTACGTGTC GGGCTTCTCC AAGAGCGTGG CCACCGGCCT GCGCGTGGGC
TGCGTGGCCG CGCCGCCGCA GTGGGTGGGC GCGATCGAGC GCGCGATCCG CGCCACCACC
TGGAACACGC CGGGCGTGAT GACCGCCATC ACCTGCGGCT GGATCGACGA CGGCACGGTG
GACCGGCTGG AAGCCGGCAA GCGCCGGGAC GCACGCGCGC GCCAGCAACT CGCCGCGCAG
GTGCTGGGCG CGCTGCAGCG CGTGAGCCAC CCGGCGTCCT ACTTCGTCTG GCTGCCGCTC
GCCGAAGAGG TGCGCGCCGA CCAGGTGGCC GCGGCGCTGA TGCGCGAGCG CATCTCGGTG
TCCACAGCCG AACCCTTCGC CACGTCGGCC CGGGTGCCGC ACGCGATCCG GCTGGCGCTG
GGCTCGGTGG ACTTCGACAC GCTGCGCGAG GCGCTGGAGA AGGTGGCGGC GGTGATTGCC
GCGCGCACGT ACTGA
 
Protein sequence
MAQTRYKQVV DRFAAEITGG RLPPGTRLPT HRQLAADEGL ALVTASRVYA ELAAMGLVSG 
ETGRGTFVKE TAVPRGQGVD QHAVAANMLD LNFNYPSLPG QAELLRNALR QLAAAGDLEA
LLRYQPHGGR PHERASVARH LARRGLTLPG EQVMLVDGAQ HGLATTVMAL LQPGDVVAVD
ALTYPGFKLL AEAHRLELVA VPAGTDGPDP DALAALCQRR RVKALYAMPT MHNPLGWVMS
ASHRRALVAV ARRHGLLVIE DAAYAFLVEK APPPLAALAP ERTVYVSGFS KSVATGLRVG
CVAAPPQWVG AIERAIRATT WNTPGVMTAI TCGWIDDGTV DRLEAGKRRD ARARQQLAAQ
VLGALQRVSH PASYFVWLPL AEEVRADQVA AALMRERISV STAEPFATSA RVPHAIRLAL
GSVDFDTLRE ALEKVAAVIA ARTY