Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A0994 |
Symbol | |
ID | 4787170 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 1055244 |
End bp | 1056200 |
Gene Length | 957 bp |
Protein Length | 318 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640089556 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001020191 |
Protein GI | 124266187 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCACCC TGTTCACCAC TGTCGGTGTC GAGCCCGCCC AGCGCCTGGC GTACTGGACC GATGGCGTCT GCGACACCTA TGTGCAGCTG GACTGCGACG CGATCGGCGA CGCCTCGCGG TTCGAAGGCG AGATCGCCGC CGATGCGCTG TCCACGCTGA CGCTGTCGCG CGTGACGGCC AGCGCGCAGC GCGTGCGCCG CACGCCGGCC AAGATCGCGC GGGCGAGCGA GGACTACTTC CTCGTCAGCA TCCAGACGCA GGGCTGCGGC GCGGTCGTGC AGGACGGACG CGTGGCCCAT CTGTCGCCCG GCGACTTCGC CCTCTACGAC AGCACGCGGC CCTACGAGCT GCGTTTCGAC GCGGCATTCC AGCAGTACGT GCTGATGCTG CCCGGCCCCA CGCTGCGCAC CGCCCTGACC GACACGCCGT CGCTCACCGC CTGCGCGGTC AGCGGCCGGC GCGGGGCCGG TCACCTGATG ATCGGCATGA TCGAGACGCT GTCGGCCGAG ATCGCCACGC TGGCGCCCGA GTCGGCTGCG GCCGTGGCGG ACAGCGTGAC GCAGATCCTG ATCGCCGGCT TGTCGACGTT GCCGGGCGCC CGGCCGCCGC CGGTGTCGCG GCTCGTCGCG CTGCACCGGC AGCAGGTGAA GGCGCTGGTG CGCGAGCGGC TGCGCGATCC GGATCTCACG GTGGTCCGCA TCGCCGAGCG CCTCGGCGTG TCACCGAGCA CCTTGCACCG GGCCTGGGCC GGCGAGGCCT GCTCGCTGGC CGACTGGATC CAGACCGAGC GCCTGGAAGC CGCGCGGCGC GACCTGTGCG CCGCGCAGCA GCGTCTGCGC CGCGTGAGCG AACTGGCGTA CTCGTGGGGC TTCAAGGATG CCGCCCACTT CAGCCGCGCC TTCCGCCGGC GCTTCGGCTG TTCACCGCGC GAGCTGCGGG CCCGGATCGA AACCTGA
|
Protein sequence | MPTLFTTVGV EPAQRLAYWT DGVCDTYVQL DCDAIGDASR FEGEIAADAL STLTLSRVTA SAQRVRRTPA KIARASEDYF LVSIQTQGCG AVVQDGRVAH LSPGDFALYD STRPYELRFD AAFQQYVLML PGPTLRTALT DTPSLTACAV SGRRGAGHLM IGMIETLSAE IATLAPESAA AVADSVTQIL IAGLSTLPGA RPPPVSRLVA LHRQQVKALV RERLRDPDLT VVRIAERLGV SPSTLHRAWA GEACSLADWI QTERLEAARR DLCAAQQRLR RVSELAYSWG FKDAAHFSRA FRRRFGCSPR ELRARIET
|
| |