Gene Mpe_A3762 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3762 
SymbolpilU 
ID4785991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3981338 
End bp3982474 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content65% 
IMG OID640092345 
ProductPili biogenesis ATPase 
Protein accessionYP_001022950 
Protein GI124268946 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5008] Tfp pilus assembly protein, ATPase PilU 
TIGRFAM ID[TIGR01420] pilus retraction protein PilT 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0879223 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACGCG ATCAAGCGTC GAAATTCGTC AACGACCTGC TGCGCCTGCT GGTCGCCCGC 
AACGGCTCCG ATCTGTTCCT GACGGCCGAC TTTCCTCCGG CGGTCAAGGT CGACGGCCGC
GTCACCAAGG TATCGCCGCA GCCGCTGACG CCGCAACACA CGATGGCCCT GGCGCGTTCG
ATCATGAACG ACAAGCAGGC GGCCGAGTTC GAACGCACCA AGGAGTGCAA TTTCGCGATC
GCGCCGACCG GCATCGGCCG CTTCCGGGTC AATGCCTTCG TGCAGCAGGG CTGCGTGGGC
CTCGTGCTGC GGACGATCCC GCAGACGCTG CCGACGATCG ACTCACTCGG CCTGCCGCAG
GTGCTGAAGG ACGTCGCCTC GACCAAGCGC GGGCTGGTGA TCTTCGTCGG CGCCACCGGC
TCGGGCAAGA GCACCTCGCT GGCCGCGATG GTGGACTACC GCAACGAGAA CTCCTACGGC
CACATCATCA CCATCGAGGA CCCGGTGGAG TTCGTGCACC CGCACAAGAA CTGCATCGTG
ACGCAGCGTG AGGTCGGCAT CGATACCGAC GACTGGGCCC CGGCGCTGAA GAACACGCTG
CGCCAGGCGC CCGACGTGAT CCTGATGGGC GAGATCCGCG ACCGCGAGAC CATGGAACAC
GCGGTGGCCT TCGCCGAGAC CGGCCACCTG TGCATGGCGA CGCTGCACGC CAACAGCGCC
AACCAGGCGC TCGACCGCAT CATCAACTTC TTCCCCGAGG AGCGGCGCGC GCAGCTGCTG
ATGGACCTGT CGCTGAACCT CAAGGCGCTC GTGTCGCAGC GCCTGCTGGC GCGCCAGGAA
GGCCGTGGCC GCGTCGCGGC GATCGAGATC CTGCTGAACA CGCCGTTGAT CTCCGACCTG
ATCTTCAAGG GCGAGGTGGC CGAGATCAAG GAAATCATGA AGAAGAGCCG CGAGCTCGGC
ATGCAGACCT TCGACCAGAG CCTGTTCGAC CTGTACGAGG GCCAGCTGGT CACCTACGAG
GATGCCCTGC GCAACGCCGA TTCGGTCAAC GACCTGCGTC TGCAGATCAA GCTCAACAGC
AACCGGGCGC GCAACTCCGA CCTGGCCTCG GGCACCGAGC ACCTGACCAT CGTCTGA
 
Protein sequence
MERDQASKFV NDLLRLLVAR NGSDLFLTAD FPPAVKVDGR VTKVSPQPLT PQHTMALARS 
IMNDKQAAEF ERTKECNFAI APTGIGRFRV NAFVQQGCVG LVLRTIPQTL PTIDSLGLPQ
VLKDVASTKR GLVIFVGATG SGKSTSLAAM VDYRNENSYG HIITIEDPVE FVHPHKNCIV
TQREVGIDTD DWAPALKNTL RQAPDVILMG EIRDRETMEH AVAFAETGHL CMATLHANSA
NQALDRIINF FPEERRAQLL MDLSLNLKAL VSQRLLARQE GRGRVAAIEI LLNTPLISDL
IFKGEVAEIK EIMKKSRELG MQTFDQSLFD LYEGQLVTYE DALRNADSVN DLRLQIKLNS
NRARNSDLAS GTEHLTIV