Gene Mpe_A2944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2944 
SymbolpilV 
ID4784366 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3127975 
End bp3129063 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content70% 
IMG OID640091515 
Productfimbrial biogenesis protein 
Protein accessionYP_001022132 
Protein GI124268128 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4967] Tfp pilus assembly protein PilV 
TIGRFAM ID[TIGR02523] type IV pilus modification protein PilV
[TIGR02532] prepilin-type N-terminal cleavage/methylation domain 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.104726 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.987031 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCACGC TCGGAGCACG ACATCAAGCC GCCGGCTTCA CCCTGGTCGA GGTGCTGATC 
GCCGTCGTCG TGGTGTCGAT CGGTGCGCTC GCCCTCGGCA GCCTGCAGGT CGCACTGTCT
CGCCATGCGG ATGCCGCCCG CCAGCGCACC GAAGCCACCC AGCTCGCCAT CTCCAGGCTG
GAGGAGTTGC GCAGCTTCGA GCAGGTGCTC ACCGAAGCCG GCAAGCAGGC CTATGCCGAT
CTCCTCAGCG GCAGCGACCA ACCCCTCATC GACAGCAACA CGCGCTTCGA GCGCCAATGG
CAAATACAAG GCACGGCCGA CGATCCCTAT CGCCGCGTCG ACGTGCAAGT CACATGGGCC
GACCGCAGCG GCGACACGCG CCAGACCTTC GTCAGGCTGG GCTCGCTGAT CGCACGCGCG
GAACCTGCCG ACGGCGGCAG CCTGGGCCTG CCCCGGGTCG ACGTCGCCGC ACTGCTGCGT
CCGAAAGGCC GGGCGCTCGA CATCCCGATC GAGGCCGAGC GACTGACCGG CGCGAACCGC
GGCCGCAGCG TTCTGCGCTG GCAGGGTGCC AGCGGCGGCT TCCTGGTGTT CGACGACAGC
TCCGGCGCGG TGATCGCGCA ATGCACGACG GCGCCGGACG ACCACACCGA CATCGCGGCC
ACCTGTACCC CACTGCAGGC ACTGCTGCTG CGCGGCGATC TGTCGGGCTC CTGGGCCGCC
GCCGTGACTG GGCTCTCCTT CAGCGCCACG CAGCATCTGC TGGCGGTGCC CGACTGTCAT
GTGGCCGACG CCGTCAGCCA CAACGACGGT CGGCCCATCG CCGGCGTGCG CTCCTATGCC
TGCCTGATGC GTGCGGGCGA TCACGACACC GATCCGGGCA CGCCACGCGC CTGGTCGGGA
CAGTCCCGCA TCGCGCCCGA GCCCGCCGGC ACGCAGGTCG TGTGCCGCTA CACCACTGCG
CCCTCGACGA CGCGCAACGA GGAGCATCCG GCCCTCTACA GTCTCGTGAC TCGATCCCTT
CACCACCAGA ACTTCCTGCT GCTGGACACC GGGGCCTGCC CGGCCGGGAC CGCGCCGCAC
CAGCCGTGA
 
Protein sequence
MPTLGARHQA AGFTLVEVLI AVVVVSIGAL ALGSLQVALS RHADAARQRT EATQLAISRL 
EELRSFEQVL TEAGKQAYAD LLSGSDQPLI DSNTRFERQW QIQGTADDPY RRVDVQVTWA
DRSGDTRQTF VRLGSLIARA EPADGGSLGL PRVDVAALLR PKGRALDIPI EAERLTGANR
GRSVLRWQGA SGGFLVFDDS SGAVIAQCTT APDDHTDIAA TCTPLQALLL RGDLSGSWAA
AVTGLSFSAT QHLLAVPDCH VADAVSHNDG RPIAGVRSYA CLMRAGDHDT DPGTPRAWSG
QSRIAPEPAG TQVVCRYTTA PSTTRNEEHP ALYSLVTRSL HHQNFLLLDT GACPAGTAPH
QP