Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A3480 |
Symbol | |
ID | 4786242 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 3691046 |
End bp | 3692491 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640092061 |
Product | GntR family transcriptional regulator |
Protein accession | YP_001022668 |
Protein GI | 124268664 |
COG category | [E] Amino acid transport and metabolism [K] Transcription |
COG ID | [COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.324042 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAATCAA TGCGCCATCC CGACCCCCTG CCCTCGCCCT TCATCCCCAT GGCCGCCGCC AAACCCATGC AGCACTGGCT GCGCCGACTC GAGGGCAGCG ACCGGCCCGC CTACCTGCTG ATCGCCGACC TGATCGCCGA GGACCTGCGC ACCGGCCGGC TCGGCGCGCA GGAGCGCCTG CCCACGCTGC GCCAGTTGGC CGACAGCCTG CGGCTCAACT ACACCACGGT GGCGCGCGCC TATGGCGAGG CGCGCAAGCG CGGGCTGATC GACTCGCGTC CCGGCATGGG CACCTACGTG CGCGGGCGCA GCCCGGCCGT GCCGCTGCGC GGCGGCAGCG GCGCGGAGAT GACGATGAAC CTGCCGCCGG AGCCCGAGGA CCCGGCGCTG GTCGAGCGCC TGCGCGACAG CGCCCGCGAG CTGATGGCCC GCAGCGACCT GTACACGCTG ATGCGCTATC AGGATTTCGG CGGCTCGCCC GAGGACAAGG ACGCCGCGGT GCAGTGGCTG CGCCACCGTT TGCCGGACTG CAGCGCCGAG CGCGTGCTGG TCTGCCCCGG CATCCACAGC GCGCTGGCCG CACTGGTGTC GCAGCTGGCG CGGCCGGGCG AGCTGGTGTG CGTGGAGTCG CTCACCTACC CGGGCATCAA GGCGATCGCC ACCCAGCTCG GCGTGCAGCT GCACGCGCTG GCGCTCGACG ACGAAGGCCC GAGTGCGGCC GACTTCGAAC AGGCCTGCAA GACCCTCAAG CCCAAGGCGC TGTACTGCAA CCCGACGCTG CTGAACCCGA CCACGCTCAC CACCTCGAAG CGGCGGCGCG AGGCGCTGGC CGACATCGCG CTGCGCTACA GCGTGCCGAT CGTCGAGGAC GACGCCTACT CGATGCTGCC GCGCGAGGTG CCGCCGCCGC TGGCGCTGCT GGCGCCGGAG CTCACCTACT ACGTCACCGG TTTCAGCAAG TGTCTCGGCG CGGGCCTGCG CACCGCCTAC GTCAGTGCGC CCAGCGAACG CCAGGCCCAG CGGCTGGCCG GCGCGCTGCG CGCCACCACG GTGATGGCGT CGCCGGTGAC CAATGCGCTG GCCACCCGCT GGGTCGTCGA CGGCAGCGCG CAGGCGATGC TGCAGGCGAT CCGCAACGAG TCGATCGCCA GGCAGGCGAT GGCGGCCCGC CACCTGGCGC GCCACGCGGT GCAGGCGCAG CCGGAGGGCT TCCACCTCTG GCTGCCGCTG TCCTCTTCGT GGAGCACGGT GGAGTTCGCG TCCTACCTGC GCACCCAGGG CGTCGGCGTG GTGGCCAGCG CCGCCTTCTC GACCGACGGC GATCCGCCGG ACGCGGTGCG CATCTGCCTC GGCGGCCCGC TGACGCGCGA GGACTGCGAC GCCGCGCTGC GGCTGATCGC CGACACGCTG GACCACCCGC TGCATCCGCA CGCCACCGTG ATGTAG
|
Protein sequence | MQSMRHPDPL PSPFIPMAAA KPMQHWLRRL EGSDRPAYLL IADLIAEDLR TGRLGAQERL PTLRQLADSL RLNYTTVARA YGEARKRGLI DSRPGMGTYV RGRSPAVPLR GGSGAEMTMN LPPEPEDPAL VERLRDSARE LMARSDLYTL MRYQDFGGSP EDKDAAVQWL RHRLPDCSAE RVLVCPGIHS ALAALVSQLA RPGELVCVES LTYPGIKAIA TQLGVQLHAL ALDDEGPSAA DFEQACKTLK PKALYCNPTL LNPTTLTTSK RRREALADIA LRYSVPIVED DAYSMLPREV PPPLALLAPE LTYYVTGFSK CLGAGLRTAY VSAPSERQAQ RLAGALRATT VMASPVTNAL ATRWVVDGSA QAMLQAIRNE SIARQAMAAR HLARHAVQAQ PEGFHLWLPL SSSWSTVEFA SYLRTQGVGV VASAAFSTDG DPPDAVRICL GGPLTREDCD AALRLIADTL DHPLHPHATV M
|
| |