Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A1005 |
Symbol | |
ID | 4787181 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 1068464 |
End bp | 1069477 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640089567 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001020202 |
Protein GI | 124266198 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.829176 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGAAA TGCCGCCCGT CTTGCCCGAA ACGCCGCCGG CCCCGCAGGC GTCCGACCTG CTGTCGGACC TGCTCGGCAG CATGCATCTG GCCGGCATGG TGCTGTTCCG CGCCGCGTTC CGCGAGCCCT GGGCGGTGCT CACGCCGGGG TCGGCACAGC TGGCGCGGGT GCTGCCGTTC CGCACCGAGC ACATCATTCC CTTCCATGTC ATCGCGACCG GCAGTTGCCG GCTGGCGCTG CCTGGGCACG AGCCGGTGTG GCTGCGCGAA GGCGATGCCG TGTTGCTGCC CTACGGCGAC AGCCACCGCC TCGAAGGCCT CGAGGAGACC GCGCCGGTGC AGGTCGGCCA GCTGCTGCCC TGCCAGCCCT GGAAGGACAT GCCGCAGGTG GAGCATGGCG GCGGCGGCGC CGTCACGCAC ATCATCTGCG GCTTCCTGCA GTGCGACGAG CTGCTGTTCC GCCCCATCCT GCGGCATCTG CCGACCCTGC TGCACGTGAG CCCCGACGCC ACGCCGGCCG ACCACTGGCT GGGCAGCACC ATCCGCCACA CCGCCGCCGA GGCCAGCCGG ACCACGCCGG GCTCGCGCAG CATGCTGCCG CGGCTGACCG AGCTGATGTT CGTGGAGATC CTGCGCAAGC ACATGCAGGG CCTGTCGGAC GACGAGAACG GCTGGTTCGC CGCGTGCAGG GACCCGGTGA CCGGCTCGGC GCTGAAGCTG CTGCACGATG CGCCGCTGCA GGACTGGAGC GTGGAGCGGC TGGCACGCGC AGTCGGCGTC TCGCGCACGG TGCTGGCCGA GCGCTTCCGT CACTACCTCG ACCAGCCGCC GATGCAGTAC CTGGCGCACT GGCGGCTGCA GCTCGCCGCC CAGCAGCTGA AGACCGGCGA CCAGCCGCTG AAGACCATTG CCGACCGGGT GGGTTACGAG TCGGAGGCCG CCTTCAGCCG CGCCTTCAAG CGCCACTTCG GCCTGCCGCC CGGCAGCTGG CGGCTGCGGC AGGGCACGCA CTGA
|
Protein sequence | MTEMPPVLPE TPPAPQASDL LSDLLGSMHL AGMVLFRAAF REPWAVLTPG SAQLARVLPF RTEHIIPFHV IATGSCRLAL PGHEPVWLRE GDAVLLPYGD SHRLEGLEET APVQVGQLLP CQPWKDMPQV EHGGGGAVTH IICGFLQCDE LLFRPILRHL PTLLHVSPDA TPADHWLGST IRHTAAEASR TTPGSRSMLP RLTELMFVEI LRKHMQGLSD DENGWFAACR DPVTGSALKL LHDAPLQDWS VERLARAVGV SRTVLAERFR HYLDQPPMQY LAHWRLQLAA QQLKTGDQPL KTIADRVGYE SEAAFSRAFK RHFGLPPGSW RLRQGTH
|
| |