Gene Mpe_A2658 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2658 
Symbol 
ID4785883 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2830286 
End bp2831392 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content71% 
IMG OID640091229 
ProductAraC family transcriptional regulator 
Protein accessionYP_001021847 
Protein GI124267843 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGGCGCG CGGCGCCGCG CTCCTATGAT GGTCCGCACC CACCCCGGCC GTCGGCGGTC 
GCGAGCAGGA GACCCGAAGT GGACAGGCAG ACCTCATCGC TGGTCGATTG CTGGGAGGAC
CCCGCCGTCT GGTCGACCGA GCGCGTGGCC CCGAGCAAGC AGTTCGATTG CTGGCGCGAC
TTCGTCATCG ACGCCCACCT GCACTGGTCG ATCCGGCCGA TCCGCTGCGA GCGCTTCCCG
GCCTTCATCC GCCAGGGCCG CTTCGACGGC TTTCGCGTCA CCCACCTCAC CTCCGCCCAG
GGCGGCATCG TCGGCACGCG CGGCGCGCGC GAGATGGCGC AGGACAGCGA GGCGCTCTAC
AACCTGATCT ACATCGCCGA GGGGTCGATC TGCCTGGTCA TCGACGACGA GGAACTCACG
CTGACCCCGG GTAGCTTCGC GCTCTGGGAC AGCGCGCGCC CGATGACCTT CATCACCGGC
GCCGGCCTGC GGCAGATCAC CCTCGCGGTG CCGCAGCGCG AACTGCAGCT CGCGCTGCCG
CGCGCCGGCG AGTTCGTCGG CCGCCGCTTC GCGGCCACCA GCGGCCTCAG CCGGCTGTTC
GTCGACCACC TCATCTCGCT CGACGCGCGC TTCGGCGAGC TGCCGCGCGG CAACGCGGGC
CACGTGCTGC ATGCCAGCGT GGAATTGCTG GCCTCCACGC TGAGCGCGCA GGCCGAGCCC
TGCGCCGGAC GCAGCGGGAA GATCGTGTTG CAAGGGGTGA TGGCCTACAT CGACCGCCAC
CTCGACGATC CGGAACTCGA CACACGCCGC GTCGCGAGCG ACTGCGGCAT CACCGAGCGG
CATCTGCACC GGCTGTTCGA ACGCGCCGAC ACCACGGCGG CAGCCTGGAT ACGGCGACAG
CGGCTGGACC GCTGCCGCCA GGACCTGCGC GCAGCCGAGA CCGCGCACCT CAGCATCACG
CAGATCGCCT ACCGCTGGGG TTTCGGCGAC TCCAGCAGCT TCAGCAAGAT CTTCAAGCGC
GAGTTCGCCA GCAGCCCGAA GGACTACCGC GCCGCCGGCG GCTTCTCCAG CCAGGCCCGG
CGTGCCTCCA GCGGCTGGGC CACGTAG
 
Protein sequence
MRRAAPRSYD GPHPPRPSAV ASRRPEVDRQ TSSLVDCWED PAVWSTERVA PSKQFDCWRD 
FVIDAHLHWS IRPIRCERFP AFIRQGRFDG FRVTHLTSAQ GGIVGTRGAR EMAQDSEALY
NLIYIAEGSI CLVIDDEELT LTPGSFALWD SARPMTFITG AGLRQITLAV PQRELQLALP
RAGEFVGRRF AATSGLSRLF VDHLISLDAR FGELPRGNAG HVLHASVELL ASTLSAQAEP
CAGRSGKIVL QGVMAYIDRH LDDPELDTRR VASDCGITER HLHRLFERAD TTAAAWIRRQ
RLDRCRQDLR AAETAHLSIT QIAYRWGFGD SSSFSKIFKR EFASSPKDYR AAGGFSSQAR
RASSGWAT