Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A3599 |
Symbol | |
ID | 4786125 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 3808230 |
End bp | 3810764 |
Gene Length | 2535 bp |
Protein Length | 844 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640092181 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001022787 |
Protein GI | 124268783 |
COG category | [C] Energy production and conversion [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins [COG2421] Predicted acetamidase/formamidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.675813 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCGACG ACCTCCGCCG CTTCTCCACC GACGCCTTCC CGGAAGACCT GCGCCCGGCC GCCTGGGCCG AGGTGATGGC CAAGGTGCTG ATGGCCCCGG CCGCGCCGGC CTACGACGCA CCGCCGCTGG CCGGTTACGT GTCGGCGCGC AGCTCGGCGC TGGAATCGGT GTTCGCGCGG CTCGGCTCGT CGCCGCAGGT GTGGCTGCCG CACCCCGGCA ACGCACGGCG CGGCGGCCAC ACGGTGCTGA TGCTGGCGCT GCTGGAGGGC AGCGGCCTGG TGGTGGAGGG GCAGGAGTCG GTGGCGCTGG CGGCCGGCGG CATCGTGCTG CTCGACCCGG CGCGGCCCTG GCGTGTGGAG CTCAATACCG ATTTCCGGGC GGTGGTGATC CGGCTCGAAT CGGCGAGCTT CGTGCTGCGC CTGGTGCGCA CCAGCGGGCA GGACCTCAAC ACCATCTCGC CGAAGACCGG CGTGGGCGCG GTCTGCCTGA ACCTGGTGCG CTCGATCGCC GACGAGCTGG ACCAGCTGGG CCGCCACGAG CTGCTGCCGA TCGAGGCGAC GTTGATGGAG CTGCTGGTCA CCTGCCTGTC GCACGTGGAC GATGAAGACG CGCAGGCCGC GGCGTCACCG GACGACTCCA CGTCGGTGCA GCTCGGCCAT CTGCGGCGCG TGTGCCGCAC GGTCGAGGCG CGGCTCGGCG ACGCCGAGCT GACGCTGGAG GCGATCGCGG CGCTGGAGGG GCTGTCGCCG CGCTACATCC AGAAGCTGTT CAAGGCCGCG TCGGCCAGCT TCGGCGAGTA CCTGAAGGCG CGCCGCCTGG AGCGCTGCCG GCTCGACCTG GCCAACCGCG CGCTGGCGCA CTTCACGATC GCCGAGCTGT GCTTCCGCTG GGGCTTCGGC GATGCGGCCA ACTTCAGCCG CGCGTTCACT GCGCGCTTCG GTGTCTCGCC CAAGGCCTAC CGCGCCGCGC CGCCGGCCGA TGCGGCGGCG GGCCCCACGC AGCGTGGCGC GCCGGCGGCC GGCGTGGCGA CCGCGAGCTC GGCGCGCGCG GCCGCGGTGG CCGACACGCC CGAGGCGCGC CGCCGGCAGT TCGAGGCGGT GCTGACGGAC CACGAGCGCT ACGCGCTGGC GCTGGCGCTC GCGCCGAAGC GCGCGGTGCC ACACGCCGCG GAGGCGGGGT TCCCGGAGGC GCTGCCCGGC GCCGGGCATG CGCCCGAATA CCACTACATC CCGGTGTCCG ACCGGACGGT GCACTGGGGC TACCTGAGCC GCACGCTGAA GCCGGTGATC AGCGTGCGCT CGGGCGACGT GGTGACGATC GAGACGCTGA CCCAGCACGC CAGCGACGAC CGCGAACGCA TGATCGACGG CGACCCCGGC GCCGAGAGCG TGTTCCACTG GACCGCCACG CGCAAGGCGG TGGCGCGGCG CGGCGCCGGG CCGATGGACG CGTCGGTGTT CGGCCGCGGC GCGGGCGAGG GCTTCGGCGT GCACATCTGC ACCGGGCCGG TCCACGTGCG GGGCGCCGAG ACCGGCGACG TGCTGGAGGT GCGCATCCTC GACATCCGTG CACGCCCGAG CTGCCACCCG CAGCATCGCG GCAAGCTGTT CGGCAGCAAT GCCGCGGCCT GGTGGGGCTA CCAGTACCAC GACCTGCTGA CCGAGCCGAA GAAGCGCGAG GTGGTCACGC TCTACGAGCT GCAGACCGAC GTGGCGGAGC CGTATGCGAA GGCCGTCTAC AGCTTCCGCT GGACGCCGCA GACCGACCCC GACGGCGTGC GCCACGAGAC CATCGACTAC CCCGGCGTGC CGGTCGACCC CGCCAGCGTC GACAAGCGCT ACGGCGTGCT GGCGAAGGCG CGCGTGCCGA TCCGCCCGCA CTTCGGCCTG CTGGCGGTGG CGCCGAAGGA GAGCGGGCTG ATCGATTCGG TGCCACCGGG CTACTTCGGC GGCAACCTCG ACAACTGGCG CGCCGGCAAG GGCGCGCGGC TGTTCCTGCC GGTGTCGGTG GAGGGCGCGC TGTTCTCGGT GGGCGACCCG CACGCCTCGC AGGGCGATGG CGAGGTCTGC GGCACCGCGA TCGAGTGCTC GCTGACCGGC AGCTTCCAGC TCGTGCTGCA CAAGCGCGCG CAGATCGCCG ACGGCTTCCT CGCCGACCTG AACCACCCCT TCCTCGATGC CGAGGACGCC TGGGTGGTGC AGGGCTTCAG CTTCGCCAAC CACCTGGCCG AGCTGGGCGC GCAGGCACAG ACCCAGGTCT ACCAGAAGTC CTCGCTGGAC CTGGCGATGC GCGACGCCTT CCGCAAGGCG CGCCGCTACC TGATGCAGGC CCACGGGCTG GACGAGGACG AGGCGCTGTC GCTGATGTCG GTGGCGGTGG ACTTCGGCGT GACACAGGTC GCCGACGGCA ACTGGGGCGT GCACGCGGTG ATCCGCAAGG CGATCTTTCC GCCGCAGGCG GAGGGTGACG GGGCGGGCGC GGCACCCGCC TCGTCGTCGG CGCCGGCTCA GCCGTTGCGG AACAGGAAGC TGTAG
|
Protein sequence | MLDDLRRFST DAFPEDLRPA AWAEVMAKVL MAPAAPAYDA PPLAGYVSAR SSALESVFAR LGSSPQVWLP HPGNARRGGH TVLMLALLEG SGLVVEGQES VALAAGGIVL LDPARPWRVE LNTDFRAVVI RLESASFVLR LVRTSGQDLN TISPKTGVGA VCLNLVRSIA DELDQLGRHE LLPIEATLME LLVTCLSHVD DEDAQAAASP DDSTSVQLGH LRRVCRTVEA RLGDAELTLE AIAALEGLSP RYIQKLFKAA SASFGEYLKA RRLERCRLDL ANRALAHFTI AELCFRWGFG DAANFSRAFT ARFGVSPKAY RAAPPADAAA GPTQRGAPAA GVATASSARA AAVADTPEAR RRQFEAVLTD HERYALALAL APKRAVPHAA EAGFPEALPG AGHAPEYHYI PVSDRTVHWG YLSRTLKPVI SVRSGDVVTI ETLTQHASDD RERMIDGDPG AESVFHWTAT RKAVARRGAG PMDASVFGRG AGEGFGVHIC TGPVHVRGAE TGDVLEVRIL DIRARPSCHP QHRGKLFGSN AAAWWGYQYH DLLTEPKKRE VVTLYELQTD VAEPYAKAVY SFRWTPQTDP DGVRHETIDY PGVPVDPASV DKRYGVLAKA RVPIRPHFGL LAVAPKESGL IDSVPPGYFG GNLDNWRAGK GARLFLPVSV EGALFSVGDP HASQGDGEVC GTAIECSLTG SFQLVLHKRA QIADGFLADL NHPFLDAEDA WVVQGFSFAN HLAELGAQAQ TQVYQKSSLD LAMRDAFRKA RRYLMQAHGL DEDEALSLMS VAVDFGVTQV ADGNWGVHAV IRKAIFPPQA EGDGAGAAPA SSSAPAQPLR NRKL
|
| |