Gene Mpe_A1792 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1792 
Symbol 
ID4784459 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1927268 
End bp1929649 
Gene Length2382 bp 
Protein Length793 aa 
Translation table11 
GC content70% 
IMG OID640090363 
Productputative transcription accessory protein 
Protein accessionYP_001020986 
Protein GI124266982 
COG category[K] Transcription 
COG ID[COG2183] Transcriptional accessory protein 
TIGRFAM ID[TIGR00426] competence protein ComEA helix-hairpin-helix repeat region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.781108 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.584186 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACAAGA TCCTGCTGCA AATCGCCGCC GAGCTGAAGG TTCGGCCGGC CCAAGTCAAC 
GCTGCCGTCG AACTGCTCGA CGGCGGGGCC ACGGTGCCCT TCATCGCGCG CTACCGCAAG
GAGGCGACCG ACAACCTCGA TGACACCCAG CTGCGCGACC TGGAAGCGCG CCTGGGGTAC
CTGCGCGAGC TCGAGGAGCG CCGCGCCGCC GTGCTGAAGA GCATCGACGA GCAGGGCAAG
CTCACGCCCG AACTGCGCGC CGCGATCGAC GCCGCGCCGA CCAAGCAGGA GCTGGAAGAC
CTCTACCTGC CCTTCAAGCC GCGGCGTCGC ACCAAGGGCA TGATCGCCCG CGAGGCCGGC
ATCGAGCCGC TGGCCGACCG CCTGTTCGCC GACCCCACGC TCGACCCGCT GGTCGAGGCC
GCCGCCTTCG TGGCCGGCTA TGCGGGCGGC GCCGAGGCGC TGGCGACCGC CGGCTTCGCC
GATGTCCACG CGGTGCTCGA CGGCGTGCGC GACCTGCTGT CCGAACGCTG GGCCGAGGAC
GCCGCACTGG TGCAGTCGCT GCGAGGCTGG CTGTGGGACG AGGGGCTGCT GCGCTCCAAG
CTGATGGATG GCAAGAACGA GCAGGACGCG GAGATCGCGA AGTTCCGCGA CTACTTCGAC
TACGACGAGC CCATCCGCAC CGTGCCCTCG CACCGTGCGC TCGCGGTGTT CCGAGGTCGC
ACGCTGGAGA TCCTGGACGC GAAGCTGGTG CTCGACGAGG AGGCGCTGCC GGGCAAGCCG
ACGCTCGCCG AAGGGCGCAT CGCGGTGCAT CTGGGCTGGA GCCACGCCAA GCGCCCGGCC
GACGAGCTGA TCCGCAAGTG CGTGGCCTGG ACCTGGAAGG TCAAGCTCAG CCTGAGCCTG
GAGCGCGACC TGTTCGCGCG GCTGCGCGAG GATGCCGAGA AGGTGGCGAT CAAGGTCTTC
GCCGAGAACC TGCGCGACCT GCTGCTGGCC GCGCCGGCCG GCAAGCGCGT GGTGATGGGC
CTGGACCCCG GCATCCGCAC CGGCGTGAAG GTGGCGGTCG TCAGCGACAC CGGCAAGGTG
CTCGACACGG CCACGGTGTA CCCGCACGAG CCCCGCAAGG ACTGGGACGG CGCGATCCAC
ACGCTGGGCC GGCTGGCCGC GACCCACGGC GTCAACCTGA TCGCCATCGG CAACGGCACC
GCGAGCCGCG AGACCGACAA GCTGGCGGCC GACCTGATCA AGCGCATCCA GCAACTGGCC
CCTGGCACGC ACATCGAGAA GGTGGTCGTC AGCGAGGCGG GCGCTTCGGT CTACTCGGCC
TCGGAGTTCG CCAGCAAGGA ACTGCCGGAG CTCGACGTGA GCCTGCGCGG AGCGGTCTCG
ATCGCACGGC GGCTGCAGGA CCCGCTGGCC GAGCTGGTGA AGATCGACCC CAAGAGCATC
GGCGTCGGCC AGTACCAGCA CGACGTGAAC CAGAGCGAAC TCGCGCGCAC GCTCGACACG
GTAGTGGAGG ACTGCGTCAA CTCGGTCGGC GTCGATCTGA ACACCGCGTC GGCGCCGCTG
CTGTCGCGCG TATCCGGGCT GTCGGGCGCG GTCGCGGCCG GCATCGTGCG TTGGCGCGAT
ACGCACGGCG CCTTCCGCAA CCGCCGCCAG CTGCGCGAGG TCGCGGGCCT GGGTGAGAAG
ACCTTCGAGC AGGCGGCCGG CTTCCTGCGC ATCCGCGACG GCGACAACCC GCTCGACCTG
TCGGGCGTGC ACCCCGAGAC CTACCCGGTG GTCGAGAAGA TCATCGCCGC CGTCGGCCGA
CCCGTCGGTG ATCTGATCGG CAACAGCGAC GTGATCCGCA AGCTGCGGCC CGAGGCTTAC
GCCGACGAGC GCTTCGGCGC GATCACCGTG AAGGACATCC TCGCCGAGCT GGAGAAGCCT
GGCCGCGACC CACGCCCGGA CTTCAAGGTG GCGCGCTTCA ACGAGGGCGT CGACGACATC
AAGGACCTGC AGGCCGGCAT GACGCTGGAA GGCACGGTCA GCAACGTGGC GCAGTTCGGC
GCCTTCGTGG ACCTGGGCGT CCACCAGGAC GGGCTGGTAC ACGTGAGCCA GCTGGCGAAC
AAGTTCGTCA ACGACGCGCG CGAGATCGTG AAGACCGGCG ACATCGTCAA GGTCAAGGTG
CTCGAGGTCG ATCTGGCGCG CGGCCGCATC TCGCTGACGA TGAAACTCGA CACCAGCGTG
GCGCGCGGAC GCGACGGACG CGGCGAGGGC AGCGAGAACG GCTACCGGCC GGCCGGACGC
GATGAGCGTG CCCGCAGCGG TGCGCCGCGC GGCAGCTCAC CCCAGTCCGC CGCCGGCGGC
GCGATGGCCG CCGCGTTCGC GAAGCTGCAG TCGCGGCGCT GA
 
Protein sequence
MDKILLQIAA ELKVRPAQVN AAVELLDGGA TVPFIARYRK EATDNLDDTQ LRDLEARLGY 
LRELEERRAA VLKSIDEQGK LTPELRAAID AAPTKQELED LYLPFKPRRR TKGMIAREAG
IEPLADRLFA DPTLDPLVEA AAFVAGYAGG AEALATAGFA DVHAVLDGVR DLLSERWAED
AALVQSLRGW LWDEGLLRSK LMDGKNEQDA EIAKFRDYFD YDEPIRTVPS HRALAVFRGR
TLEILDAKLV LDEEALPGKP TLAEGRIAVH LGWSHAKRPA DELIRKCVAW TWKVKLSLSL
ERDLFARLRE DAEKVAIKVF AENLRDLLLA APAGKRVVMG LDPGIRTGVK VAVVSDTGKV
LDTATVYPHE PRKDWDGAIH TLGRLAATHG VNLIAIGNGT ASRETDKLAA DLIKRIQQLA
PGTHIEKVVV SEAGASVYSA SEFASKELPE LDVSLRGAVS IARRLQDPLA ELVKIDPKSI
GVGQYQHDVN QSELARTLDT VVEDCVNSVG VDLNTASAPL LSRVSGLSGA VAAGIVRWRD
THGAFRNRRQ LREVAGLGEK TFEQAAGFLR IRDGDNPLDL SGVHPETYPV VEKIIAAVGR
PVGDLIGNSD VIRKLRPEAY ADERFGAITV KDILAELEKP GRDPRPDFKV ARFNEGVDDI
KDLQAGMTLE GTVSNVAQFG AFVDLGVHQD GLVHVSQLAN KFVNDAREIV KTGDIVKVKV
LEVDLARGRI SLTMKLDTSV ARGRDGRGEG SENGYRPAGR DERARSGAPR GSSPQSAAGG
AMAAAFAKLQ SRR