Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A3662 |
Symbol | |
ID | 4786069 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 3872672 |
End bp | 3873646 |
Gene Length | 975 bp |
Protein Length | 324 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640092244 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001022850 |
Protein GI | 124268846 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAGCGA GCGCCTCGCC GACCGTCCCC GCCTCGTTGC CCGAGCCGCC CTGGTGGTCG ACCACGGACG TAGCCCTGGC CGACCGCACC GACGCCTGGG AGGCCGCGCT GAGCGACAAC TTCCGCCGCT GGCAGGTGCC GGACCGCGTG GCACCGGAGT TCAACGCGCG TCTGCGGCAC CGCGATGTGG CCGGGCTGCG CGTGGTCGAG TGCATCTGCG ACCCCTGCGG CGGGCGCCGG CTGCCGCGCC ATGTGAGCCA GGAAGGGGCG GCCTACATCG GCGTGCAGAT CACCCGTTCG GGGCGCGAGT CGCTGCGTGC CGGTGACGAG CGCGTGGCCG TCGGGCCGGG CGATCTGGTC ATCTGGACCA GCGACCAGCC GATGGAATTC ACGGTGACCG AGCGGCTGCA CAAGGTCTCG CTGATCCTGC CGTGGGACGA CGTGAAGGAG CGGCTGCCAC GCACCGGGCG CTTCCGCGGC ACGGTGCTCG ACAGCCGTTC GGGCATCGGC GCGGTGCTGT ACTCGCACAT CGAGTCGCTG GCCTCGCAGC TCGAGGATCT GGAGGCGGGC GAGCTGAGCG CGGTGCGGCG CGCCACCGTC GAGCTGCTGA CGGCGGCGAT GTCCTGCCGC CTCGAACCGG CGCCGCGCGG CCTGTCGCTG CAGCAGCTCA AGCGGGTGCA GGACTACATC CTCGACCATC TGCAGGACGA GACGCTGACG CCGGGCAGCA TCGCGCAGGC GCACCACATC TCGCCGCGCT ACCTGCACTT GCTGTTCGGC AAGACCGGGC AGAGCGTGTC GGCCTACATC CGCCAGCAGC GTCTCGAGCG CTGCGGCGAG GCGCTGAGCA ACCCGTCCTA CCGCGAACAC AGCGTGGCCG AGATCGCCTA CCAGTGGGGC TTCACCGACC CGGCGCACTT CAGCCGCGTC TTCAAGCAGC ATTTCGGCCA TCCGCCGGGG CATTGCCGCC GCTGA
|
Protein sequence | MEASASPTVP ASLPEPPWWS TTDVALADRT DAWEAALSDN FRRWQVPDRV APEFNARLRH RDVAGLRVVE CICDPCGGRR LPRHVSQEGA AYIGVQITRS GRESLRAGDE RVAVGPGDLV IWTSDQPMEF TVTERLHKVS LILPWDDVKE RLPRTGRFRG TVLDSRSGIG AVLYSHIESL ASQLEDLEAG ELSAVRRATV ELLTAAMSCR LEPAPRGLSL QQLKRVQDYI LDHLQDETLT PGSIAQAHHI SPRYLHLLFG KTGQSVSAYI RQQRLERCGE ALSNPSYREH SVAEIAYQWG FTDPAHFSRV FKQHFGHPPG HCRR
|
| |