Gene Mpe_A3779 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3779 
Symbol 
ID4785948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3998171 
End bp4000093 
Gene Length1923 bp 
Protein Length640 aa 
Translation table11 
GC content62% 
IMG OID640092362 
Producthypothetical protein 
Protein accessionYP_001022967 
Protein GI124268963 
COG category[T] Signal transduction mechanisms 
COG ID[COG2766] Putative Ser protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00146351 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGATGTGA TCAGCAGCTT TGCAGCGCGC TACGAGCGCA GCCGCGAGGA AGAGTTCACG 
CTCGAGGAGT ACCTCGACAT CTGCCGGCGT GAGCCGGTGG CCTATGCCAC GGCGGCCGAG
CGCATGCTCA AGGCCATCGG CGAACCCGAA CTCGTGGACA CCCGCAACGA TCCGCGCATG
TCGCGCCTGT TCGCGAACAA GGTGATCAAG CGCTATCCGG CCTTCGCCGA GTTCTACGGC
ATGGAAGATT CGATCGAACA GGTCGTCTCC TACTTCCGCC ACGCGGCGCA GGGGCTCGAG
GAGAAGAAGC AGATCCTCTA CCTGCTGGGA CCGGTGGGCG GCGGCAAGAG TTCGATCGCC
GAACGGCTGA AGCACCTGAT GCAGGAAGTG CCGTTCTACG CCATCAAGGG CTCGCCGGTG
AACGAATCGC CGCTCGGCCT GTTCGACATG GCCGAGGACG GCCCCATCCT GGAGAAGGAG
TACGGCATCC CGCGGCGCTA TTTGAACCGC ATCCTGTCGC CCTGGGCCGT GAAACGGCTC
GACGAGTACG GCGGCGACAT CCGACAGTTC AGAGTCGTGA AGCGCTATCC GTCGATCCTG
AAGCAGGTCG GCGTTGCCAA GACCGAGCCG GGCGACGAGA ACAACCAGGA CATCAGCTCG
CTGGTCGGCA AGGTCGACAT CCGCAAGCTC GAGACCTATG CACAGGACGA TCCCGACGCC
TACAGCTACT CCGGCGGCCT GTGCCTGGCC AACCAGGGTC TGCTCGAGTT CGTCGAGATG
TTCAAGGCGC CGATCAAGGT GCTGCACCCG CTGCTGACGG CGACCCAGGA AGGCAACTAC
AAGGGGACCG AGGGTTTCGG CGCGATCCCC TTCGACGGCA TCATCCTCGC GCACAGCAAC
GAGAGCGAGT GGAAGACCTT CCGCAACAAC AAGAACAACG AAGCCTTCCT CGACCGCATC
TACATCGTCA AGGTGCCGTA CTGCCTGCGC ATCACTGAAG AGATCAAGAT CTACGACAAG
CTGATTCGAG GCTCCTCGCT CAGCGAGGCG AAGTGCGCGC CCGGCACATT GAAGATGATG
GCGCAGTTCG CGGTGCTGTC GCGCCTGAAG GAGCCGGAGA ACTCTTCGCT GTTCAGCAAG
GCGCTGGTCT ATGACGGTGA GAGCCTCAAG GACACCGATC CCAAGGCCAA GAGCTACCAG
GAGTACCGCG ACTACGCCGG CGTCGACGAG GGCATGAGCG GCATCTCGAC GCGCTTTGCC
TTCAAGATCC TGTCCAAGGT GTTCAACTTC GACTCGTCGG AAGTCGCAGC GAACCCGGTG
CATCTGATGT ACGTGCTGGA GCAGCAGATC GAGCGCGAGC AGTTCCCCAC CGAGACCGAG
CAGAAGTACC TCGGCTTCAT CAAGGAGTTC CTGGCGGCGC GCTACGCCGA GTTCATCGGC
AAGGAGATCC AGACCGCCTA CCTCGAGAGC TACTCCGAGT ACGGCCAGAA CATCTTCGAC
CGCTACGTGA CCTACGCCGA CTACTGGATC CAGGACCAGG AGTACCGTGA CACCGACACC
GGCGAGGTGT TCGACCGGGG CTCGTTGAAC GCCGAGCTCG AGAAGATCGA GAAGCCGGCG
GGCATCGCCA ACCCGAAGGA CTTCCGCAAC GAGATCGTCA ACTTCGTGCT GCGCGCCCGT
GCCAACAATG CCGGCAACAA CCCGTTGTGG ACGAGCTACG AGAAGCTGCG CACGGTGATC
GAGAAGAAGA TGTTCTCGAA CACCGAGGAA CTGCTGCCGG TGATCAGCTT CAACGCGAAA
GCCAGTGCCG ACGAGGCGAA GAAGCACGAG GACTTCGTGA ACCGCATGGT GCAGAAGGGC
TACACGCCCA AGCAGGTGCG CCTGCTGTGC GAGTGGTACC TGCGTGTGAG AAAGAGTTCA
TGA
 
Protein sequence
MDVISSFAAR YERSREEEFT LEEYLDICRR EPVAYATAAE RMLKAIGEPE LVDTRNDPRM 
SRLFANKVIK RYPAFAEFYG MEDSIEQVVS YFRHAAQGLE EKKQILYLLG PVGGGKSSIA
ERLKHLMQEV PFYAIKGSPV NESPLGLFDM AEDGPILEKE YGIPRRYLNR ILSPWAVKRL
DEYGGDIRQF RVVKRYPSIL KQVGVAKTEP GDENNQDISS LVGKVDIRKL ETYAQDDPDA
YSYSGGLCLA NQGLLEFVEM FKAPIKVLHP LLTATQEGNY KGTEGFGAIP FDGIILAHSN
ESEWKTFRNN KNNEAFLDRI YIVKVPYCLR ITEEIKIYDK LIRGSSLSEA KCAPGTLKMM
AQFAVLSRLK EPENSSLFSK ALVYDGESLK DTDPKAKSYQ EYRDYAGVDE GMSGISTRFA
FKILSKVFNF DSSEVAANPV HLMYVLEQQI EREQFPTETE QKYLGFIKEF LAARYAEFIG
KEIQTAYLES YSEYGQNIFD RYVTYADYWI QDQEYRDTDT GEVFDRGSLN AELEKIEKPA
GIANPKDFRN EIVNFVLRAR ANNAGNNPLW TSYEKLRTVI EKKMFSNTEE LLPVISFNAK
ASADEAKKHE DFVNRMVQKG YTPKQVRLLC EWYLRVRKSS