Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A3232 |
Symbol | |
ID | 4786511 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 3434768 |
End bp | 3436597 |
Gene Length | 1830 bp |
Protein Length | 609 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640091805 |
Product | hypothetical protein |
Protein accession | YP_001022420 |
Protein GI | 124268416 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5010] Flp pilus assembly protein TadD, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0129967 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCCTT TCCGATTCGC TGTCCTGGCC CTCGCGGCCT CCAGCGTTCT TGCCGCGACG GCGTGGGCCC AGCCCGCTCC GGCGGAAGCG CCGGCATCCG GGCCGGGCGC TGCGGCACCG GTGGAAAATT CGGTCCTCGA CGCACCGCTG TTCTACCAGT TGCTGATCGG CGAGATGGAG CTCAGCCAGG GCGATGCCGG TACCAGCTAC CAGGTCCTGC TGGATGCGGC CCGCAAGACC CGCGACGAGC GCCTGTTCCG GCGCGCCACC GAAGTCGCTT TGCAGGCACA GGCCGGCGAG CAGGCGCTCG ATGCGGCACG CGCCTGGCGC CAGGCCGTGC CCGGCTCGAT CGACGCCCAC CGCTTCGAGG TGCAGTTGCT GGTCGCGTTG AATCGCACCG CCGAAACCGT GCAGCCGCTG CGGGCCACGC TGGCGCTGGT GCCGCCGGCC CAGCGGCCGG CGGCCATCGG GTCCTTGCCT GGCTATTTCT CGCGCACCAC CGATCGCAAG GCGACCACGG CGGTGCTCGA ACAGGTGCTG CTGCCCTATG CAGAGCAGCG CTCGGCGGGC GAGCCGCAAG CCACCGCGGT GGCCGCCTGG GTGGCGCTGG GCCGCAGCCG GCTGGCCGCC GGCGATGCGG CGCGCGCGCT CGACGCCGCG CGCCGTGGTC AAGCGCTGAG TCCGCGGTCG GAGGCCGTGG CGCTGCTCGC CATTGAACTG ATGCCGACCG AGCAGCGGGC CGAGTCGCTG GTCACGGCCT TTCTCGAGGC CCAGCCACCG GCTCCGGCCG CCACGCGCAG CGCGGTGCGG CTGGTCTATG CCCGCACGCT GGCTGTCGCG CAGCGCTATG CGGACGCCGC GCCGCAGCTC GAGGCGGTGA CGCGCGACGC GCCGCAGTTT GTCGACGCCT GGCTGACGCT GGGCGCCTTG CGGCTGGAAC TGAAGCAGCC GGCCGAGGCC GAAGCGGCGC TGCGCGAGTA CCTCGCGCGC CTCGAGGCCG GCAGCGAGGC GGCATCGACC GAAGCGGCCG ATGCCACGCA GGACGAGGAC GCCGCCACGC CCATGCAGCG TCTGACGCAG GCCTACCTGC TGTTGGCCCA GGCCGCCGAG CAGCGGCGCG ACTTCAAGGC CGCCGAGGGC TGGCTGGCCA AGGTCGACAG CTCCCAGGCA CTGACGGTGC AATCGCGGCG CGCATCGTTG CTGGCGCAGC AGGGCAAGCT CGCCGAGGCG CGCAGCCTGA TCCGCGCGCT GCCGGAACGG CAGCCGGAGG ACGCGCGCTC CAAGCTACTG GTGGAGGCGC AACTGCTGCG CGACCAGAAG CAGTGGTCGG AGGCACTGGG CGTCCTGACC GAGGCCAACG ACCGCTACCG CGACGACACC GACCTGCTGT ACGAGCAGGC GATGATGGCC GAGAAGCTCG ACCGCATGAC CGAGATGGAG CAGTTGCTGC GTCGCGTGAT CGCGCTGAAG CCGCAGCAGC CGCAGGCCTA CAACGCGCTG GGCTACTCGC TCGCTGATCG CAACCAGCGC CTGCCGGAAG CGCGCCAGCT GATCATCAAG GCCCTGGAGC TGTCGCCGGG CGATCCGTTC CTGATCGACA GCCTGGGCTG GGTCGAGTAC CGGCTCGGCA ACCATGACGA GGCGATCCGC TGGTTGCGGC AGGCACACGG CGCACGACCG GACACCGAGA TCGCCGCGCA CCTGGGTGAG GTGCTCTGGG TCAGTGGCCG GCGCGACGAG GCTCGTCGGA TCTGGGCCGA GGCCCGGGCC CGCGACGCCA CGAACGACGT GCTGAAGGAA ACCCTGGCGC GGCTCAAGGT CGATCTGTGA
|
Protein sequence | MAPFRFAVLA LAASSVLAAT AWAQPAPAEA PASGPGAAAP VENSVLDAPL FYQLLIGEME LSQGDAGTSY QVLLDAARKT RDERLFRRAT EVALQAQAGE QALDAARAWR QAVPGSIDAH RFEVQLLVAL NRTAETVQPL RATLALVPPA QRPAAIGSLP GYFSRTTDRK ATTAVLEQVL LPYAEQRSAG EPQATAVAAW VALGRSRLAA GDAARALDAA RRGQALSPRS EAVALLAIEL MPTEQRAESL VTAFLEAQPP APAATRSAVR LVYARTLAVA QRYADAAPQL EAVTRDAPQF VDAWLTLGAL RLELKQPAEA EAALREYLAR LEAGSEAAST EAADATQDED AATPMQRLTQ AYLLLAQAAE QRRDFKAAEG WLAKVDSSQA LTVQSRRASL LAQQGKLAEA RSLIRALPER QPEDARSKLL VEAQLLRDQK QWSEALGVLT EANDRYRDDT DLLYEQAMMA EKLDRMTEME QLLRRVIALK PQQPQAYNAL GYSLADRNQR LPEARQLIIK ALELSPGDPF LIDSLGWVEY RLGNHDEAIR WLRQAHGARP DTEIAAHLGE VLWVSGRRDE ARRIWAEARA RDATNDVLKE TLARLKVDL
|
| |