Gene Mpe_A3279 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3279 
Symbol 
ID4786498 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3487777 
End bp3488754 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content72% 
IMG OID640091852 
ProductAraC family transcriptional regulator 
Protein accessionYP_001022467 
Protein GI124268463 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.853616 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCCACG ACCGTGCGTT CTCCACCCTC CACCTGCCGC CGGGCCAGCG CGTCGCGCGC 
TGGACGGAGG CCGCCTCGGA CCGCTTCGTC GAATCCCGCT TCAAGGTGCA GGACCCGGAC
CGCTTCGTCG CCTCGATGCT GCACCGCGAC CTCGCCGAAC TGTCGGTGAC CCGCATCACC
TCGGTCGGCC ACGGCTTCAA GCACATCACC CGCTCGCAGC GCCAGGTGGC CCGCGCGCAC
GAGGACTTCT TCCTGGTCAG CGTGCAGCTC GAGGGTTCGT GCTGGATCGC GCAGGGCGGC
CGCGAGACGC GGCTGGCGCC AGGGCAGTTC GCGATCTACG ACACCCGGCG CCCCTACGAA
CTGCTGCTCG AAGAGGACTA CCAGCAGGCC GTGCTGCGCA TCCCCTGCGC CACGCTGATG
GCGCGTGCGC CCGATTGCGA TGCGCAGACG GCACAGGCCA TCTCGGCGGC CAGCAGCTCC
GCACGACGGC TGATCCACCA GGTCCGCGAA GCCTGTCGTG GCACGCGCCT GTCGCGTCCG
GCGCTGGCCG AGGCCTTGCT GGGCGCCGTC GGCGGCGGCC TGCGCGGCGA CGCCGACAGC
CGTGCAGCGA CGCCGCATTC GCGCCGCACG CTGCTGGCGC GCATCAAGGC CCATGTGGTC
GCCCACCTGG GTGATCCGCA GCTGTCGGTG CCGGGCATCG CCGCGACGCT GGGGCTGTCG
ACCAGCTACC TGCACCAGCT GTTCCGCTCC GAGGGCAGCA CGCTGGAACG CTGGATCTGG
GCTCAGCGCC TGGCCGCCTG CGAACGCGCC CTGATCGACC CGCGCGCGGC GCGGCACACG
CTGACGCAGA TCGCCTACAG CCATGGCTTC AGCGATGCGG CGCATTTCAG CCGCAGCTTC
CAGCAGCGCT ACGGCGCCTC GCCGCGCGAG TACCGCAAGT CGGCCGCCAC GGTGCCCGCG
GCCGGACCAC GCGACTGA
 
Protein sequence
MGHDRAFSTL HLPPGQRVAR WTEAASDRFV ESRFKVQDPD RFVASMLHRD LAELSVTRIT 
SVGHGFKHIT RSQRQVARAH EDFFLVSVQL EGSCWIAQGG RETRLAPGQF AIYDTRRPYE
LLLEEDYQQA VLRIPCATLM ARAPDCDAQT AQAISAASSS ARRLIHQVRE ACRGTRLSRP
ALAEALLGAV GGGLRGDADS RAATPHSRRT LLARIKAHVV AHLGDPQLSV PGIAATLGLS
TSYLHQLFRS EGSTLERWIW AQRLAACERA LIDPRAARHT LTQIAYSHGF SDAAHFSRSF
QQRYGASPRE YRKSAATVPA AGPRD