Gene Mpe_B0371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_B0371 
SymboltraC 
ID4787999 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008826 
Strand
Start bp320102 
End bp322531 
Gene Length2430 bp 
Protein Length809 aa 
Translation table11 
GC content64% 
IMG OID640092803 
ProductF-pilin subunit assembly into extended F pili 
Protein accessionYP_001023381 
Protein GI124262911 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3451] Type IV secretory pathway, VirB4 components 
TIGRFAM ID[TIGR02746] type-IV secretion system protein TraC 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.221651 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGGAT TCCTGAGGCA AGAGCGGGAC AACGTCCCAT TCGCCAATCT CACTGTGATG 
GCGCATGACC CAGCCACGCG GTTGTTTCAC ATGACCGACG GCGAGGTCAG CTATCTCGGC
GCCTGCTTCG TTGGTGACCC GCTGACCGGC GCAGACCAGT CGACCGTGGA CAAGTTCAGC
TCCGCCTTCG GAATGCCTTT CCCGGCCGGT ACCTTCGTAC AGATCGGCCT GCTGTCGACG
CCCGACGTCG ACGAGTATCT CGACGCCTAC GTCGGCGCGA AGGCGCAAGA CGGCGTCCTC
GGCGCTCTCG CTGCTCGCCA CCGCGACCTC ATCTCCTCGG GGCGCGAGAA GCCGCTGGTA
TCGCGCTCGG GCATCTACCT GCACCGGCAG CGCCTGGTGA TCACGATCAA GACGCCGCTC
CACAACAATC GACCGACCGA GGCCGACATC TCTTCGGCCA AGGAGTATGC CGACCGCCTG
CAGGAAGCGC TCAAGGCCTC CGGCCTGCAC ATGGAGGCCC TGGACGCGCC GCGCTACGTC
GCGCTCATGC GCCTGATGAC GCACCCTCAC GACCCCATCG ACGACCGCTA CGACGCGCAC
AAGCCGATCC GCGAGCAGAT CTTCTTCCCC GGCGATTCGG TCAGCTACGA CGACAAGTCG
ACGATCAGCT TCCACGACGG CAAGCACTTC GCCAAGCTGC TGAGCGTGAA GCACTTCCCG
AAGCGCGCAA CGCTCGGGCT CATGAACTTC ATCGTCGGAG AGCCGCAGGG GCTGTCGAAC
CAGATCACGG AGCCCTACTG GATCACGCTG ACGCTGCACT ACCCCGACCA GGTCAAGAAG
GCCGACTGGG TGCGCGGCCG CTCGGCCATG ATCAATCACC AGGTCTTCGG CCCGACGGCA
CACATGATCC CGGTGCTCGG CTACAAGAAG GCAGGCATCG ACACCCTGGT GCACGAGATG
GAGGGACGTG GCGCGGTCCT CTGCGACGTC AACTTCTCGA TCTTCCTGAT CTCACGAGAC
AGGAGCCGTC TCAACAAGTC GATCGCCGGC CTGCAGGCCT ACTACTCGTC GCTCGCCTTC
GAGTTCCGCG AGGACAGCCG CATCCTCGAG ACGATGTGGG ACAACCTGCT GCCGCTGAAC
GTCAGTGCGC CGGCGATCGA GAAGACCTTC CGTTTCCACA CCATGGCCGT GTCGCAGGCC
GTCTGCTTCC TGCCGATCAT CGGCGAGTGG CGCGGCAGCG GCGTGGGCCG CTCGGTTCTG
ATGGTTACGC GACGCGGACA GCCGGCCCTG TTCGACCTCT ACGATTCGTC GACCAACTAC
AACGGGGTCG TGTTCGCCGA GTCCGGCGCC GGCAAGTCGT TCTTCACACA GAAGCTGGTC
TCGGACTACC TGGCGGAGGG CGCCAAGGTG TGGGCGATCG ACGTCGGCCA CAGCTACAAG
AAGCTGTGCC GCTCGGTGGG TGGCGAGTTC GTCGAATTCC ACGCCGAGAG CAGGATCTGC
CTCAACCCGT TCACGAACAT CGACGGCAAC CTGGACGAGG AGATGGACAT CCTCAAGGCG
ACCATCGCCA AGATGGCAGC GCCCGAGGAG TCGCTGTCCG ACTACCAGAT GGCCATCCTC
GAGCAGGCGA TCACCAGCGT CTACACGAAG TTCGGCAACA AGGCCAACGT CATCGCGATC
GTCGAGTTCC TGATGGCGCA AACGGACAGC GAGGCGCACC GCCTGGCGAA GCAGCTCTAT
CCGTTCGCAG GCGGCGCCTA CACGCGCTGG TTCGATGGCG ACAACAATCT CGACCTCGAC
AACGCGTTCG TCGTGCTGGA GCTCCAGGAC CTCAAGGGCC GCAAGGCGCT CCAGCAAGTG
GTGCTGCTGC AGCTCATCTC GCGCATCAAC CACGAGATCT ACCGCACGCA CGGCCGCAAG
AAGATCCTGA TCATCGACGA GGCCTGGGAA CTGCTGGACG ACCCGCTTAT GGCCAAGGCT
ATGGAAGCGG CCTATCGCAA GGCACGGAAG CACGACGGCG CTGTTCTGGT CGTGACCCAG
TCGCTGGCCG ACCTGTACAA CTCACCCAAC TCGCGCGCGA TCGTGGCGAA CTCGGCCTGG
CAGTTCATCC TGAAGCAGAA CGGCGAGGCC GTCGACGCGG CGATCGATGG TGGCCAGTTC
AAGATCGAGC CGTACGGCGC GTACATGCTG AAGACCGTGC ACACCGTGAG GGGTGCCTAC
TCGGAGGTCA TGGTCAAGCG CGGCGACAAC AGCTGGGGGA TCCTGCGCCT CGTGGTCGAC
CGGTTCACCC AGGTCATGTT CTCGACCAGC GGCGCCGAGC GTGATCAGAT CCTCGGCGCA
ATCGATCGCG GGGAGGATGT CGTCGAGGCC GTGGATGCGT ACATCGCCAG CGAGAACGAT
CGGGCCGGCA GAGAGCTGGA GGCCGCCTAA
 
Protein sequence
MFGFLRQERD NVPFANLTVM AHDPATRLFH MTDGEVSYLG ACFVGDPLTG ADQSTVDKFS 
SAFGMPFPAG TFVQIGLLST PDVDEYLDAY VGAKAQDGVL GALAARHRDL ISSGREKPLV
SRSGIYLHRQ RLVITIKTPL HNNRPTEADI SSAKEYADRL QEALKASGLH MEALDAPRYV
ALMRLMTHPH DPIDDRYDAH KPIREQIFFP GDSVSYDDKS TISFHDGKHF AKLLSVKHFP
KRATLGLMNF IVGEPQGLSN QITEPYWITL TLHYPDQVKK ADWVRGRSAM INHQVFGPTA
HMIPVLGYKK AGIDTLVHEM EGRGAVLCDV NFSIFLISRD RSRLNKSIAG LQAYYSSLAF
EFREDSRILE TMWDNLLPLN VSAPAIEKTF RFHTMAVSQA VCFLPIIGEW RGSGVGRSVL
MVTRRGQPAL FDLYDSSTNY NGVVFAESGA GKSFFTQKLV SDYLAEGAKV WAIDVGHSYK
KLCRSVGGEF VEFHAESRIC LNPFTNIDGN LDEEMDILKA TIAKMAAPEE SLSDYQMAIL
EQAITSVYTK FGNKANVIAI VEFLMAQTDS EAHRLAKQLY PFAGGAYTRW FDGDNNLDLD
NAFVVLELQD LKGRKALQQV VLLQLISRIN HEIYRTHGRK KILIIDEAWE LLDDPLMAKA
MEAAYRKARK HDGAVLVVTQ SLADLYNSPN SRAIVANSAW QFILKQNGEA VDAAIDGGQF
KIEPYGAYML KTVHTVRGAY SEVMVKRGDN SWGILRLVVD RFTQVMFSTS GAERDQILGA
IDRGEDVVEA VDAYIASEND RAGRELEAA