Gene Mpe_A2651 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2651 
Symbol 
ID4785876 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2823631 
End bp2824587 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content72% 
IMG OID640091222 
ProductAraC family transcriptional regulator 
Protein accessionYP_001021840 
Protein GI124267836 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0391115 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGGATC ACCAATCGGC CGATGTCGAC GACCAGGCGC AGGCGCTGAG CGGTTGGCAG 
CAGCGCTACG AGCAACTCGG CTGCGGCCGC TTCCGGGGGT CGGCGCGGCA GGTGGTGATG
GCCGGCGGCA CGGTGCTGCG CGAGTCCACC AACCGCCAGC TGCGCGAGCA GATCCGTCCG
CCCTCCGACT GCCTGGTGCT GGCCATCCCG CTGTCGGTCG CGCCCGGTTC GGTGTTTGCC
GGCCGGCCGC TGGACGGCGA TGCATTGATG GTCATCTCGG GCCACGAGGA GTACGAGCTG
GTGGCGGCCG GCGAACTCGA CCTGCTGGCG CTGTCGGTCG ACCGACGGCG GCTGGGCGGC
ATGCTGGCGC CCGAGGAGAT CGAGTGGCTG GCGCGGGCCG AACGCCAGCG GCGCTGGGCG
CTGGCCCCCG ACACCGCCGG CGCGGTGCGC AGCCAGCTGC TCGCCGTGTG TTCAGCCGCT
GGCCGCTGTG CGCCCGGTGC GGTGATCGAC ATCGAGAACG AGCCGGCGCT GATCGGCGCC
ACGCTCGCGC ACACGGTGGC GCTGGCGATG TCGGACGGCG GCGCCGACCG CGGCGCGGTC
GGCATTCCGC GGCGTGCCGA CTCGCGGCTG CGGGTGGTGA AGCGCGCCAT CGAATTCATC
CGCGCCAACC TGCAGGAGGA CATCGGCATC CCCGAGATCT GCGCGGCCGC CTGCGCCAGC
CGCCGCAGCC TGCAGTATTG CTTCGAGGAG TTCCTGCACA CCACGCCGCA GGCCTATCTG
CGCGCGCTGC GCCTGAACGA GGCGCGGCGT CGCCTGAAGC AACCGGGCGA TCAGCCCATC
ACGCTGCTGG CGTGCGCCAT GGGCTTCAGC AGCGCGAGCC ATTTCACTCG CCACTACAAG
CTGATGTTCA ACGAGCTGCC GTCGCAGACG CAGCGGCGGC GCACGCGCGA CGCCTGA
 
Protein sequence
MLDHQSADVD DQAQALSGWQ QRYEQLGCGR FRGSARQVVM AGGTVLREST NRQLREQIRP 
PSDCLVLAIP LSVAPGSVFA GRPLDGDALM VISGHEEYEL VAAGELDLLA LSVDRRRLGG
MLAPEEIEWL ARAERQRRWA LAPDTAGAVR SQLLAVCSAA GRCAPGAVID IENEPALIGA
TLAHTVALAM SDGGADRGAV GIPRRADSRL RVVKRAIEFI RANLQEDIGI PEICAAACAS
RRSLQYCFEE FLHTTPQAYL RALRLNEARR RLKQPGDQPI TLLACAMGFS SASHFTRHYK
LMFNELPSQT QRRRTRDA