Gene Mpe_A3232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3232 
Symbol 
ID4786511 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3434768 
End bp3436597 
Gene Length1830 bp 
Protein Length609 aa 
Translation table11 
GC content72% 
IMG OID640091805 
Producthypothetical protein 
Protein accessionYP_001022420 
Protein GI124268416 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0129967 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCCTT TCCGATTCGC TGTCCTGGCC CTCGCGGCCT CCAGCGTTCT TGCCGCGACG 
GCGTGGGCCC AGCCCGCTCC GGCGGAAGCG CCGGCATCCG GGCCGGGCGC TGCGGCACCG
GTGGAAAATT CGGTCCTCGA CGCACCGCTG TTCTACCAGT TGCTGATCGG CGAGATGGAG
CTCAGCCAGG GCGATGCCGG TACCAGCTAC CAGGTCCTGC TGGATGCGGC CCGCAAGACC
CGCGACGAGC GCCTGTTCCG GCGCGCCACC GAAGTCGCTT TGCAGGCACA GGCCGGCGAG
CAGGCGCTCG ATGCGGCACG CGCCTGGCGC CAGGCCGTGC CCGGCTCGAT CGACGCCCAC
CGCTTCGAGG TGCAGTTGCT GGTCGCGTTG AATCGCACCG CCGAAACCGT GCAGCCGCTG
CGGGCCACGC TGGCGCTGGT GCCGCCGGCC CAGCGGCCGG CGGCCATCGG GTCCTTGCCT
GGCTATTTCT CGCGCACCAC CGATCGCAAG GCGACCACGG CGGTGCTCGA ACAGGTGCTG
CTGCCCTATG CAGAGCAGCG CTCGGCGGGC GAGCCGCAAG CCACCGCGGT GGCCGCCTGG
GTGGCGCTGG GCCGCAGCCG GCTGGCCGCC GGCGATGCGG CGCGCGCGCT CGACGCCGCG
CGCCGTGGTC AAGCGCTGAG TCCGCGGTCG GAGGCCGTGG CGCTGCTCGC CATTGAACTG
ATGCCGACCG AGCAGCGGGC CGAGTCGCTG GTCACGGCCT TTCTCGAGGC CCAGCCACCG
GCTCCGGCCG CCACGCGCAG CGCGGTGCGG CTGGTCTATG CCCGCACGCT GGCTGTCGCG
CAGCGCTATG CGGACGCCGC GCCGCAGCTC GAGGCGGTGA CGCGCGACGC GCCGCAGTTT
GTCGACGCCT GGCTGACGCT GGGCGCCTTG CGGCTGGAAC TGAAGCAGCC GGCCGAGGCC
GAAGCGGCGC TGCGCGAGTA CCTCGCGCGC CTCGAGGCCG GCAGCGAGGC GGCATCGACC
GAAGCGGCCG ATGCCACGCA GGACGAGGAC GCCGCCACGC CCATGCAGCG TCTGACGCAG
GCCTACCTGC TGTTGGCCCA GGCCGCCGAG CAGCGGCGCG ACTTCAAGGC CGCCGAGGGC
TGGCTGGCCA AGGTCGACAG CTCCCAGGCA CTGACGGTGC AATCGCGGCG CGCATCGTTG
CTGGCGCAGC AGGGCAAGCT CGCCGAGGCG CGCAGCCTGA TCCGCGCGCT GCCGGAACGG
CAGCCGGAGG ACGCGCGCTC CAAGCTACTG GTGGAGGCGC AACTGCTGCG CGACCAGAAG
CAGTGGTCGG AGGCACTGGG CGTCCTGACC GAGGCCAACG ACCGCTACCG CGACGACACC
GACCTGCTGT ACGAGCAGGC GATGATGGCC GAGAAGCTCG ACCGCATGAC CGAGATGGAG
CAGTTGCTGC GTCGCGTGAT CGCGCTGAAG CCGCAGCAGC CGCAGGCCTA CAACGCGCTG
GGCTACTCGC TCGCTGATCG CAACCAGCGC CTGCCGGAAG CGCGCCAGCT GATCATCAAG
GCCCTGGAGC TGTCGCCGGG CGATCCGTTC CTGATCGACA GCCTGGGCTG GGTCGAGTAC
CGGCTCGGCA ACCATGACGA GGCGATCCGC TGGTTGCGGC AGGCACACGG CGCACGACCG
GACACCGAGA TCGCCGCGCA CCTGGGTGAG GTGCTCTGGG TCAGTGGCCG GCGCGACGAG
GCTCGTCGGA TCTGGGCCGA GGCCCGGGCC CGCGACGCCA CGAACGACGT GCTGAAGGAA
ACCCTGGCGC GGCTCAAGGT CGATCTGTGA
 
Protein sequence
MAPFRFAVLA LAASSVLAAT AWAQPAPAEA PASGPGAAAP VENSVLDAPL FYQLLIGEME 
LSQGDAGTSY QVLLDAARKT RDERLFRRAT EVALQAQAGE QALDAARAWR QAVPGSIDAH
RFEVQLLVAL NRTAETVQPL RATLALVPPA QRPAAIGSLP GYFSRTTDRK ATTAVLEQVL
LPYAEQRSAG EPQATAVAAW VALGRSRLAA GDAARALDAA RRGQALSPRS EAVALLAIEL
MPTEQRAESL VTAFLEAQPP APAATRSAVR LVYARTLAVA QRYADAAPQL EAVTRDAPQF
VDAWLTLGAL RLELKQPAEA EAALREYLAR LEAGSEAAST EAADATQDED AATPMQRLTQ
AYLLLAQAAE QRRDFKAAEG WLAKVDSSQA LTVQSRRASL LAQQGKLAEA RSLIRALPER
QPEDARSKLL VEAQLLRDQK QWSEALGVLT EANDRYRDDT DLLYEQAMMA EKLDRMTEME
QLLRRVIALK PQQPQAYNAL GYSLADRNQR LPEARQLIIK ALELSPGDPF LIDSLGWVEY
RLGNHDEAIR WLRQAHGARP DTEIAAHLGE VLWVSGRRDE ARRIWAEARA RDATNDVLKE
TLARLKVDL