Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A0902 |
Symbol | |
ID | 4787225 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 953473 |
End bp | 954549 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640089463 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001020099 |
Protein GI | 124266095 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACAAGG ACACCGTCTC GATCTATTTC GTGCGGGCCG CGCTGGCGCA TCTTGCACCC GAGGCGCTGC CCGCGGTGCT GCGCGCGGCC GGCATCCCGG CCGAGATGCT GGCCCACCGC CAGGCGCGCG TGCCGGCCCG GGCCTTCGCG GCGCTGTGGC TGGCCGTCGC GCACCAGCTC GACGACGAAT TCTTCGGCCT CGACGCGCGC CGCATGAAGG TCGGCAGCTT CGCCCTGATG TGCCATGCGG TGCTGCACTG CGGCACGCTC GGCCGGGCCC TGCGCCGCTG CCTGCGCAGC TTCTCGGCCT TCCTCGACGA CCTGCACGGC GAGCTGGCGG TGGACGACGA CCGGGCGGTG ATCACGCTGC ACAACCGCAT CGCCGACCCG GCCGACCGCC GCTTCGCCGA GGAGACCTAC CTGGTGATGC TGCACGGCCT GATGTGCTGG CTGATCGGGC GGCGCATCCA GCTCGCCCAC ATCGCCTTCG GCCACGCGCT GCCCGAGGAA GCGCGCGAGT ACCGCGTGAT GTACGGCGAA GACCTGGCGT TCGGCGCCGA ACGCACCACC ATCGAGTTCG ACGCCGCGCT GCTGGAGGCG CCGATCATCC AGACCGAGGC CACGCTGAAG AGCTTTTTGC GCTCGGCGCC GCAGTCGGTG TTCCTCAAGT ACAAGAGCAC CGACAGCTGG ACCGCGCGCG TGCGGCGCCG GCTGCGCCGC TGCCTAGGCG GCCAGGCCGG CTGGCCGACG CTGGACGACC TGGCGCTCGA GTTCCACGTC GCCCCGTCGA CGCTGCGGCG CCGGCTGGAG GCCGAGGGTG GCACCTACCA GAGCGCCAAG GACGACCTGC GGCGCGATGC GGCGATCCAC CACCTGTGCC ACAGCCGGCT CAGCATCGCC GAGATCTCCA CGCTGCTCGG CTTCCAGGAG CCCAGCGCCT TCCACCGCGC CTTCAAGAAA TGGACCGGGT CGCAGCCGGG CGAGTACCGC GCGTTGCGTG CCGACGGCAT GTCGGCGCCG GGCGCCGACG CCACCGCGCG GCGGCCGGCC ACCGCCCCGG CACTGCTCAG CGGTTGA
|
Protein sequence | MDKDTVSIYF VRAALAHLAP EALPAVLRAA GIPAEMLAHR QARVPARAFA ALWLAVAHQL DDEFFGLDAR RMKVGSFALM CHAVLHCGTL GRALRRCLRS FSAFLDDLHG ELAVDDDRAV ITLHNRIADP ADRRFAEETY LVMLHGLMCW LIGRRIQLAH IAFGHALPEE AREYRVMYGE DLAFGAERTT IEFDAALLEA PIIQTEATLK SFLRSAPQSV FLKYKSTDSW TARVRRRLRR CLGGQAGWPT LDDLALEFHV APSTLRRRLE AEGGTYQSAK DDLRRDAAIH HLCHSRLSIA EISTLLGFQE PSAFHRAFKK WTGSQPGEYR ALRADGMSAP GADATARRPA TAPALLSG
|
| |