Gene Mpe_B0361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_B0361 
SymboltraG 
ID4787989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008826 
Strand
Start bp311837 
End bp313606 
Gene Length1770 bp 
Protein Length589 aa 
Translation table11 
GC content66% 
IMG OID640092793 
Productsex pilus assembly and mating pair formation 
Protein accessionYP_001023371 
Protein GI124262901 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3505] Type IV secretory pathway, VirD4 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.398823 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTTTG CATCGATGTA CCGCGCCAAG ATCGGCGGCA CGGTGGACAC GTTGGCGATG 
ACCGCGCTGG GCGCGGGAGC CATCCTGTCG TCGCCGGCGT TCGTCGAGTC CGGGGGCAAG
CTGTCGTTCC TCATGGCGGC GGCCGGCGCC GCGCACCTCG GGCGGCGCTT CCTGCAGCCG
TGGGAGGAGG CCAACGCCCT CGCCTCAAAG ATCAACGTCC GCTCGGCCGA CATGCTGTCC
AGCCCGGACG GCCTGCTGCT GGGCTACCGG ACCGACACCG GCGATCCGGT CTATATCCCC
GACGAAGACC TGATGCGGCA CGGCCTGATC GGCGGCCAGT CGGGTGTCGG CAAGACGGTG
CTCGGCAAGC TGTTGATGTA CCAGCAGATC CAGCGCGGCG GCGGCCTGGC GTTCGTCGAC
GGCAAGATGA ACGCCGAAGA CATCGAGACG ATCTACCAGT ATTGCTGCCT GTGCGGCCGC
GAGCAGGATC TGCTCATCCT GAACCCGGGC AATCCGGGCA TGAGCCACTC CTACAACCCG
ATCCTGCGCG GCGACCCGGA CGAGATCGCG GCGCGCGTGC TCTCGCTCAT CCCCTCGACC
GAGAACAACC CCGGCGCCGA CCACTACCGC CAGTCGGCCA ACCAAGGCCT GACCACGCTG
GTCGCAGCGC TGCAGGAGGC CGGCCTGGCC TTCAACTTCA TCGACCTGAC CATCCTGCTG
ATGAACCACA AGGCCATCGA GGAGCTGGAG TCGCGGCTGA AGCGCAGCAA GGGCGGCTCG
GCGGCGACGA AGAACCTCAG CCTGTTCCTC GAGCAGTACA AGGGCGGCGG CAAGCCCGGC
AGCGGCCTGG AGAACATGGT CGACATCAAG CGCATGAAGG AGACCTTCGG CGGCGTCGGC
GGCCGGATGT TCCAGTTCGG CACGGGCAAG TTCGGCGAGG TGCTCAACAC CTACTCGCCG
GAGATCGATC TGTTCGAGGC GATCCGCCAG AACAAGATCA TCTACGTGGC CCTGCCGACC
ATGGGCAAGA ACGAGGCGGC GGGCAACATG GGCAAGATGT TCCTGGGCGA TCTGCGCACG
GCGATCTCCT GGGTGCAGGC GCTGCCGGAA GACCAGAAGC CCCGGATCCC GTTCCTGGCC
TTCATGGACG AGCTCGGCTC GTACGCAGTC GCATCGCTGG CGCGCCCGTT CGAGCAGGCG
CGCTCGGCGC GGATCGCCCT CTTCCCGGCT TTCCAGACGC TGGCCAACCT GGAGGTCGTG
TCCCCCGACT TCGCCCAGAT GGTGCTGGGC AACACCTGGA CCAAGATCTT CTTCAAGCTG
GGCACGCAGG AGACCGCCGA ACCGGTGGCC GACCTGATCG GCAAGGAGAT GCAGATCGCG
AGGTCGATGA CGCACACGAA CAACCGCAGC GAGAACAACC CGCTGCTGGC CGTGGCGCCG
GAGGGCGGGG AAGGCGAAGG CATGGGCGTG TCCGAGGGTG AGCGCGAGGA AGAGCGCCAC
CGCGTCAGCC CCGAAGACCT GAAGGCGCTG GACAAGGGCG AGTGCGTAGT CCTGTACGGC
GGCGATGCCG TCTTCAACAT CCGCGTGCCG ATGATCTACG TCGACCCCGA GGTCGCGCGC
GAGATCGGGC CGCTTCGCAT TCACCACCGG CGCGAGCGCC AGGTCGAAGG CGCCGACTAC
TTCAAGAACA GCGATCGGTA CCTGGGCGGC AACTCCGTTC CGTCCGGGCA CCGCAAGCAG
ACGGCTCAGG AAATGCTGGC CGACGAGTAA
 
Protein sequence
MSFASMYRAK IGGTVDTLAM TALGAGAILS SPAFVESGGK LSFLMAAAGA AHLGRRFLQP 
WEEANALASK INVRSADMLS SPDGLLLGYR TDTGDPVYIP DEDLMRHGLI GGQSGVGKTV
LGKLLMYQQI QRGGGLAFVD GKMNAEDIET IYQYCCLCGR EQDLLILNPG NPGMSHSYNP
ILRGDPDEIA ARVLSLIPST ENNPGADHYR QSANQGLTTL VAALQEAGLA FNFIDLTILL
MNHKAIEELE SRLKRSKGGS AATKNLSLFL EQYKGGGKPG SGLENMVDIK RMKETFGGVG
GRMFQFGTGK FGEVLNTYSP EIDLFEAIRQ NKIIYVALPT MGKNEAAGNM GKMFLGDLRT
AISWVQALPE DQKPRIPFLA FMDELGSYAV ASLARPFEQA RSARIALFPA FQTLANLEVV
SPDFAQMVLG NTWTKIFFKL GTQETAEPVA DLIGKEMQIA RSMTHTNNRS ENNPLLAVAP
EGGEGEGMGV SEGEREEERH RVSPEDLKAL DKGECVVLYG GDAVFNIRVP MIYVDPEVAR
EIGPLRIHHR RERQVEGADY FKNSDRYLGG NSVPSGHRKQ TAQEMLADE