Gene Mpe_A0902 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0902 
Symbol 
ID4787225 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp953473 
End bp954549 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content72% 
IMG OID640089463 
ProductAraC family transcriptional regulator 
Protein accessionYP_001020099 
Protein GI124266095 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAAGG ACACCGTCTC GATCTATTTC GTGCGGGCCG CGCTGGCGCA TCTTGCACCC 
GAGGCGCTGC CCGCGGTGCT GCGCGCGGCC GGCATCCCGG CCGAGATGCT GGCCCACCGC
CAGGCGCGCG TGCCGGCCCG GGCCTTCGCG GCGCTGTGGC TGGCCGTCGC GCACCAGCTC
GACGACGAAT TCTTCGGCCT CGACGCGCGC CGCATGAAGG TCGGCAGCTT CGCCCTGATG
TGCCATGCGG TGCTGCACTG CGGCACGCTC GGCCGGGCCC TGCGCCGCTG CCTGCGCAGC
TTCTCGGCCT TCCTCGACGA CCTGCACGGC GAGCTGGCGG TGGACGACGA CCGGGCGGTG
ATCACGCTGC ACAACCGCAT CGCCGACCCG GCCGACCGCC GCTTCGCCGA GGAGACCTAC
CTGGTGATGC TGCACGGCCT GATGTGCTGG CTGATCGGGC GGCGCATCCA GCTCGCCCAC
ATCGCCTTCG GCCACGCGCT GCCCGAGGAA GCGCGCGAGT ACCGCGTGAT GTACGGCGAA
GACCTGGCGT TCGGCGCCGA ACGCACCACC ATCGAGTTCG ACGCCGCGCT GCTGGAGGCG
CCGATCATCC AGACCGAGGC CACGCTGAAG AGCTTTTTGC GCTCGGCGCC GCAGTCGGTG
TTCCTCAAGT ACAAGAGCAC CGACAGCTGG ACCGCGCGCG TGCGGCGCCG GCTGCGCCGC
TGCCTAGGCG GCCAGGCCGG CTGGCCGACG CTGGACGACC TGGCGCTCGA GTTCCACGTC
GCCCCGTCGA CGCTGCGGCG CCGGCTGGAG GCCGAGGGTG GCACCTACCA GAGCGCCAAG
GACGACCTGC GGCGCGATGC GGCGATCCAC CACCTGTGCC ACAGCCGGCT CAGCATCGCC
GAGATCTCCA CGCTGCTCGG CTTCCAGGAG CCCAGCGCCT TCCACCGCGC CTTCAAGAAA
TGGACCGGGT CGCAGCCGGG CGAGTACCGC GCGTTGCGTG CCGACGGCAT GTCGGCGCCG
GGCGCCGACG CCACCGCGCG GCGGCCGGCC ACCGCCCCGG CACTGCTCAG CGGTTGA
 
Protein sequence
MDKDTVSIYF VRAALAHLAP EALPAVLRAA GIPAEMLAHR QARVPARAFA ALWLAVAHQL 
DDEFFGLDAR RMKVGSFALM CHAVLHCGTL GRALRRCLRS FSAFLDDLHG ELAVDDDRAV
ITLHNRIADP ADRRFAEETY LVMLHGLMCW LIGRRIQLAH IAFGHALPEE AREYRVMYGE
DLAFGAERTT IEFDAALLEA PIIQTEATLK SFLRSAPQSV FLKYKSTDSW TARVRRRLRR
CLGGQAGWPT LDDLALEFHV APSTLRRRLE AEGGTYQSAK DDLRRDAAIH HLCHSRLSIA
EISTLLGFQE PSAFHRAFKK WTGSQPGEYR ALRADGMSAP GADATARRPA TAPALLSG