Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A3779 |
Symbol | |
ID | 4785948 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 3998171 |
End bp | 4000093 |
Gene Length | 1923 bp |
Protein Length | 640 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640092362 |
Product | hypothetical protein |
Protein accession | YP_001022967 |
Protein GI | 124268963 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2766] Putative Ser protein kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00146351 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGATGTGA TCAGCAGCTT TGCAGCGCGC TACGAGCGCA GCCGCGAGGA AGAGTTCACG CTCGAGGAGT ACCTCGACAT CTGCCGGCGT GAGCCGGTGG CCTATGCCAC GGCGGCCGAG CGCATGCTCA AGGCCATCGG CGAACCCGAA CTCGTGGACA CCCGCAACGA TCCGCGCATG TCGCGCCTGT TCGCGAACAA GGTGATCAAG CGCTATCCGG CCTTCGCCGA GTTCTACGGC ATGGAAGATT CGATCGAACA GGTCGTCTCC TACTTCCGCC ACGCGGCGCA GGGGCTCGAG GAGAAGAAGC AGATCCTCTA CCTGCTGGGA CCGGTGGGCG GCGGCAAGAG TTCGATCGCC GAACGGCTGA AGCACCTGAT GCAGGAAGTG CCGTTCTACG CCATCAAGGG CTCGCCGGTG AACGAATCGC CGCTCGGCCT GTTCGACATG GCCGAGGACG GCCCCATCCT GGAGAAGGAG TACGGCATCC CGCGGCGCTA TTTGAACCGC ATCCTGTCGC CCTGGGCCGT GAAACGGCTC GACGAGTACG GCGGCGACAT CCGACAGTTC AGAGTCGTGA AGCGCTATCC GTCGATCCTG AAGCAGGTCG GCGTTGCCAA GACCGAGCCG GGCGACGAGA ACAACCAGGA CATCAGCTCG CTGGTCGGCA AGGTCGACAT CCGCAAGCTC GAGACCTATG CACAGGACGA TCCCGACGCC TACAGCTACT CCGGCGGCCT GTGCCTGGCC AACCAGGGTC TGCTCGAGTT CGTCGAGATG TTCAAGGCGC CGATCAAGGT GCTGCACCCG CTGCTGACGG CGACCCAGGA AGGCAACTAC AAGGGGACCG AGGGTTTCGG CGCGATCCCC TTCGACGGCA TCATCCTCGC GCACAGCAAC GAGAGCGAGT GGAAGACCTT CCGCAACAAC AAGAACAACG AAGCCTTCCT CGACCGCATC TACATCGTCA AGGTGCCGTA CTGCCTGCGC ATCACTGAAG AGATCAAGAT CTACGACAAG CTGATTCGAG GCTCCTCGCT CAGCGAGGCG AAGTGCGCGC CCGGCACATT GAAGATGATG GCGCAGTTCG CGGTGCTGTC GCGCCTGAAG GAGCCGGAGA ACTCTTCGCT GTTCAGCAAG GCGCTGGTCT ATGACGGTGA GAGCCTCAAG GACACCGATC CCAAGGCCAA GAGCTACCAG GAGTACCGCG ACTACGCCGG CGTCGACGAG GGCATGAGCG GCATCTCGAC GCGCTTTGCC TTCAAGATCC TGTCCAAGGT GTTCAACTTC GACTCGTCGG AAGTCGCAGC GAACCCGGTG CATCTGATGT ACGTGCTGGA GCAGCAGATC GAGCGCGAGC AGTTCCCCAC CGAGACCGAG CAGAAGTACC TCGGCTTCAT CAAGGAGTTC CTGGCGGCGC GCTACGCCGA GTTCATCGGC AAGGAGATCC AGACCGCCTA CCTCGAGAGC TACTCCGAGT ACGGCCAGAA CATCTTCGAC CGCTACGTGA CCTACGCCGA CTACTGGATC CAGGACCAGG AGTACCGTGA CACCGACACC GGCGAGGTGT TCGACCGGGG CTCGTTGAAC GCCGAGCTCG AGAAGATCGA GAAGCCGGCG GGCATCGCCA ACCCGAAGGA CTTCCGCAAC GAGATCGTCA ACTTCGTGCT GCGCGCCCGT GCCAACAATG CCGGCAACAA CCCGTTGTGG ACGAGCTACG AGAAGCTGCG CACGGTGATC GAGAAGAAGA TGTTCTCGAA CACCGAGGAA CTGCTGCCGG TGATCAGCTT CAACGCGAAA GCCAGTGCCG ACGAGGCGAA GAAGCACGAG GACTTCGTGA ACCGCATGGT GCAGAAGGGC TACACGCCCA AGCAGGTGCG CCTGCTGTGC GAGTGGTACC TGCGTGTGAG AAAGAGTTCA TGA
|
Protein sequence | MDVISSFAAR YERSREEEFT LEEYLDICRR EPVAYATAAE RMLKAIGEPE LVDTRNDPRM SRLFANKVIK RYPAFAEFYG MEDSIEQVVS YFRHAAQGLE EKKQILYLLG PVGGGKSSIA ERLKHLMQEV PFYAIKGSPV NESPLGLFDM AEDGPILEKE YGIPRRYLNR ILSPWAVKRL DEYGGDIRQF RVVKRYPSIL KQVGVAKTEP GDENNQDISS LVGKVDIRKL ETYAQDDPDA YSYSGGLCLA NQGLLEFVEM FKAPIKVLHP LLTATQEGNY KGTEGFGAIP FDGIILAHSN ESEWKTFRNN KNNEAFLDRI YIVKVPYCLR ITEEIKIYDK LIRGSSLSEA KCAPGTLKMM AQFAVLSRLK EPENSSLFSK ALVYDGESLK DTDPKAKSYQ EYRDYAGVDE GMSGISTRFA FKILSKVFNF DSSEVAANPV HLMYVLEQQI EREQFPTETE QKYLGFIKEF LAARYAEFIG KEIQTAYLES YSEYGQNIFD RYVTYADYWI QDQEYRDTDT GEVFDRGSLN AELEKIEKPA GIANPKDFRN EIVNFVLRAR ANNAGNNPLW TSYEKLRTVI EKKMFSNTEE LLPVISFNAK ASADEAKKHE DFVNRMVQKG YTPKQVRLLC EWYLRVRKSS
|
| |