Gene Mpe_A1569 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1569 
Symbol 
ID4785619 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1692440 
End bp1695994 
Gene Length3555 bp 
Protein Length1184 aa 
Translation table11 
GC content70% 
IMG OID640090137 
Producttranscription-repair coupling factor 
Protein accessionYP_001020766 
Protein GI124266762 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1197] Transcription-repair coupling factor (superfamily II helicase) 
TIGRFAM ID[TIGR00580] transcription-repair coupling factor (mfd) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.878317 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGATTC TAGAATTCAC CCCGTGGATC GCCCCGCCTG CTCCCGCGGC GGGGCGATCT 
GCTTTCGACT TTCTTCACCG CATGGACCTC CCCGCCCTCG CCCCCGGCAA GCGCTACACC
CTGCCCCGCC CGTTCGGCTC GGCCGATGCA CTGCTGCTCG CGCGCTTCGC CGAGCAGCGC
GCCGCGCAGG GGCAGATCAC CGCCGTCATC AGTGCCGAAC CGGCCGACAC GCAGCGGCTG
CAGGACGAAC TGGCCTTCTT CGCGCCAGGC TTGCGCGTCG CCGTGTTCCC CGACTGGGAG
ACGCTGCCCT ACGACAGCTT CTCGCCGCAC CAGGACCTGA TCTCCGAGCG GCTGGCCACG
CTGTGGCGCA TCCTGCATGC GCGGAGCGAG GGCGCGATCG ACGTGGTGCT GATGCCGGCG
ACCACGGCAT TGCTGCGGCT CGCGCCGCCG TCCTTCCTGG CGGCCTACAC CTTCCACTTC
GTTCAGAAGC AGAAGCTCGA CGAGGCGGCG CTGAAGGCCC AGCTCACGCT GGCCGGCTAC
CAGCACGTGA GCCAGGTGGT GTCGCCCGGC GAGTACGCGG TGCGCGGCGG GCTGATTGAC
CTGTTCCCGA TGGGCTCCCC GGTGCCCTAT CGCGTGGACC TGTTCGACAA CGAGGTCGAC
TCGATCCGCA CCTTCGACCC CGACAGCCAG CGCAGCCTCT ACCCGGTGCC CGAGGTGCGG
CTGCTGCCGG GCCGCGAGTT CCCGATGGAC GAGGCGGCCC GCACCGGTTT CCGCGCCCGC
TGGCGCGAGA AGATGGAAGG CGACCCGAGC CGCTCGCGCC TCTACAAGGA CATGGGCACC
GGCCTGGCCG GTCCCGGCAT CGAGTTCTAC CTGCCGCTGT TCTTCGACGA GACGGCGACG
GTGTTCGACT ACCTCGGCGC CGCGGCCGAA CTGGTGCTGC ACGGCGAGGT CGATGCAGCG
CTGAACAAGT TCTGGGGCGA GACGCGCGAG CGCCACCGCT TCCTGCAGCA CGACCCCGAG
CGGCCGATCC TGCCGCCGCA GGAGATCTAC CTGCCGCCCG ACGCCTTCTT CGCGCGCTGC
AACGACCATG CGCAGCTGTC GCTGCGCGGC AGCGACCCGC TCGAATGGGT GCGCCCGCTG
CCCGATCTGG CCGTCGAGCG CGGCACGCCC GACCCGCTGC GCAAGCTGGA GCAGCACCTC
GCGAAGCAGG GCACCGACAG CACCGGCCCG CGCGTGCTGC TCGTCGCCGA GAGCGAGGGC
CGACGCGAGA GCCTGCTGGA GCTGCTGCGC GACCACCAGC TGGAGCCGCC GACGGTCGCT
TCGCTCGCGG ACTTCGAGGC CGGCGATCAC CGCTACGCGA TCACCGTCGC GCCGCTGGCG
AGCGGCTTCT TCTGGGTCCG TGGGTCGGGC ACGCCCGGCA TCCAGTTCGT CACCGAGACC
GAGCTGTTCG ACGCCGCACC CACGGTACGG CGCCGCCGCA AGCAGGAGCA GACCAGCGAT
GTCGACGCGC TGATTAAGGA CCTGTCGGAG CTCAAGGTCG GCGACCCGGT GGTGCATGCC
AACCACGGCA TTGGCCGCTA CGTGGGGCTG GTCAACATCG ACCTGGGCGA CGGGCCGAGC
GAGTTCCTGC ACCTCGAGTA CGCCGACAAG GCGACGCTCT ACGTGCCGGT CGCGCAGTTG
CAGCTCATCA GCCGCTACAC CGGCGTCAGC GCCGAGGAGG CGCCGCTGCA CCGGCTGGGT
TCCGGCCAGT GGGAGAAGGC CAAGCGCAAG GCGGCCGAGC AGGTGCGCGA TACCGCCGCC
GAGCTGCTGA ACCTGTACGC GCGCCGCGCC GCCCGTGAAG GCCATGCCTT CCGCTTCTCG
CCGCAGGACT ACGAGGCCTT CGCCGCCAGC TTCGGCTTCG AGGAAACGGC CGACCAGCGC
GCCGCGATCC ACGCGGTGAT CCAGGACCTG GTCAGCCCCA AGCCGATGGA CCGTCTGGTG
TGCGGCGACG TCGGCTTCGG CAAGACCGAG GTGGCGCTGC GTGCTGCCTT CGTCGCGGTC
ACCGGCGGCA AGCAGGTGGC GCTGCTGGCG CCGACCACGC TGCTGGCCGA GCAGCACTAC
CAGAACATCG CCGACCGCTT CGCCAAGTGG CCGGTCAAGG TGGCCGAGAT GTCGCGCTTC
CGCTCGGCCA AGGAGATCAA GGCGGCGATG GCCGGACTGG CCGAGGGCAC GATCGACATC
GTGGTCGGCA CACACAAGCT GCTGAGCCAG GACACCAAGT TCGCCAACCT CGGGCTGCTG
ATCATCGACG AGGAGCACCG CTTCGGCGTG CGCCACAAGG AAGCCATGAA GGCGCTGCGC
GCCGAGGTCG ACGTGCTGAC CCTCACCGCC ACGCCCATCC CGCGCACCCT GGGCATGGCG
CTCGAGGGCC TGCGCGACCT GAGCGTGATC GCCACCGCGC CACAGCGCCG GCTCGCCATC
AAGACCTTCG TGCGCGGCGA GTCCAACGGC ACCATCCGCG AGGCGGTGAT GCGCGAGCTG
AAGCGCGGCG GTCAGGTCTA CTTCCTGCAC AACGAGGTGG AGACCATCGA GAACCGCCGC
CGCACGCTGG AGGAACTGCT TCCGGAAGCG CGCATCGCGG TCGCCCATGG CCAGATGCCC
GAGCGCGAGC TGGAGCGGGT GATGCGCGAG TTCGTCGCGC AGAAGCACAA CCTGCTGCTG
TGCTCGACCA TCATCGAGAC GGGCATTGAC GTGCCCACCG CCAACACCAT CGTGATGAGC
CGCGCCGACA AGTTCGGCCT GGCACAGCTG CACCAGCTGC GCGGGCGCGT CGGCCGCTCG
CACCACCAGG CCTATGCCTA CCTGCTGGTG CCCGATGTGG AAGGCCTCAC CAAGCAGGCC
GCGCAGCGCC TGCAGGCGAT CCAGGACATG GAGGAGCTCG GCTCGGGCTT CTACCTCGCG
ATGCACGACC TGGAGATCCG CGGCGCTGGC GAGGTGCTGG GCGAGAACCA GAGCGGCAAC
ATGATGGAGG TCGGCTTCCA GCTCTACAAC GAGATGCTGG CCACCGCAGT GCGCGAGATG
AAGGCCGGCC GCGAGCCTGA TCTGCTGAAC CCGGTGAACG TGGCCACCGA CGTCAACCTG
CACGCGCCGG CGCTGCTGCC CGACGCCTAT TGCGGCGACG TGCATGTGCG CCTGTCGCTC
TACAAGCGGC TGGCCAGCGC CGACCGGCTG GACAAGATCG ACACCATGCT CGAGGAGATC
GTCGACCGCT TCGGCAAGCT GCCGCCGCAG GCGCAGACGC TGTTCGACGT GCACCGCCTG
CGCGTGCAGG CGCGCGCCTA CGGTGTGCTC AAGATCGACG CCGGCCCGCA GGCCATGAGC
ATCGGCTTCC GGCCCGACGC GCCGGTCGAC GCGCTGCGCA TCATCGAGCT GGTGCAGAAG
AACCGCCACA TCAAGCTGGC CGGCAACGAC AAGCTGCGCA TCGAGAAGGC CCTGCCTGAT
CCGATGGCCC GCGCCCAGTT CATCCGCGAC GTGCTGCGCT CGCTCGGCAC GCCCACCCCC
GCCCTTTCCG CATGA
 
Protein sequence
MPILEFTPWI APPAPAAGRS AFDFLHRMDL PALAPGKRYT LPRPFGSADA LLLARFAEQR 
AAQGQITAVI SAEPADTQRL QDELAFFAPG LRVAVFPDWE TLPYDSFSPH QDLISERLAT
LWRILHARSE GAIDVVLMPA TTALLRLAPP SFLAAYTFHF VQKQKLDEAA LKAQLTLAGY
QHVSQVVSPG EYAVRGGLID LFPMGSPVPY RVDLFDNEVD SIRTFDPDSQ RSLYPVPEVR
LLPGREFPMD EAARTGFRAR WREKMEGDPS RSRLYKDMGT GLAGPGIEFY LPLFFDETAT
VFDYLGAAAE LVLHGEVDAA LNKFWGETRE RHRFLQHDPE RPILPPQEIY LPPDAFFARC
NDHAQLSLRG SDPLEWVRPL PDLAVERGTP DPLRKLEQHL AKQGTDSTGP RVLLVAESEG
RRESLLELLR DHQLEPPTVA SLADFEAGDH RYAITVAPLA SGFFWVRGSG TPGIQFVTET
ELFDAAPTVR RRRKQEQTSD VDALIKDLSE LKVGDPVVHA NHGIGRYVGL VNIDLGDGPS
EFLHLEYADK ATLYVPVAQL QLISRYTGVS AEEAPLHRLG SGQWEKAKRK AAEQVRDTAA
ELLNLYARRA AREGHAFRFS PQDYEAFAAS FGFEETADQR AAIHAVIQDL VSPKPMDRLV
CGDVGFGKTE VALRAAFVAV TGGKQVALLA PTTLLAEQHY QNIADRFAKW PVKVAEMSRF
RSAKEIKAAM AGLAEGTIDI VVGTHKLLSQ DTKFANLGLL IIDEEHRFGV RHKEAMKALR
AEVDVLTLTA TPIPRTLGMA LEGLRDLSVI ATAPQRRLAI KTFVRGESNG TIREAVMREL
KRGGQVYFLH NEVETIENRR RTLEELLPEA RIAVAHGQMP ERELERVMRE FVAQKHNLLL
CSTIIETGID VPTANTIVMS RADKFGLAQL HQLRGRVGRS HHQAYAYLLV PDVEGLTKQA
AQRLQAIQDM EELGSGFYLA MHDLEIRGAG EVLGENQSGN MMEVGFQLYN EMLATAVREM
KAGREPDLLN PVNVATDVNL HAPALLPDAY CGDVHVRLSL YKRLASADRL DKIDTMLEEI
VDRFGKLPPQ AQTLFDVHRL RVQARAYGVL KIDAGPQAMS IGFRPDAPVD ALRIIELVQK
NRHIKLAGND KLRIEKALPD PMARAQFIRD VLRSLGTPTP ALSA