Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A1948 |
Symbol | |
ID | 4786709 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 2084266 |
End bp | 2086179 |
Gene Length | 1914 bp |
Protein Length | 637 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640090518 |
Product | hypothetical protein |
Protein accession | YP_001021141 |
Protein GI | 124267137 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3170] Tfp pilus assembly protein FimV |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.461201 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.145014 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CTGATCGGTG TGAGCCTGTG CGCGTTGGCA CAGGGTGTCG TGGCTGTCGG TTTCGGAAAA TTGCCGGACT CCACGACGCT CGGGCAGCCG CTGGAACTGA AGATTCCGCT GCGGGTCGAC GCGGGCGAGG ATCTGCGTCC CGAATGCCTC AATGGCGAAG TCCATTTCTC CGACAACCTG CAGCAGCCCG GCACCACCTC GCTGCGGCTC GAACCCGCCG AACCCAGCGC AACCGAGCGC GTGCTGCGGC TTGCCACCAC CACGCGCGTC GACGAGCCGG TCGTGACGGT GCAGATCACG GTCGGATGTG CCTCCAAGGT CACACGCCGC TTCACGCTGT TCGCGGACCC GCCGCTGATC GCCCCCGCCG AAGTGGCCGC CCCAGTTGCG GCCGTTGCAC CGTCGGCACC ACCTGCGCCG GCGGCCTCCG CAGCGAGCCG CCCCGAAGCC GCGCCCCCGG ATACCGCGCC GATGCCGCGT GCCGCCACAC CGCGCACGCC GGCGCGCAGC CGTGCCACCA CGGCCGCACC ACGCCGTGCG CCCGCCACTG CCGCTGCCCC AATCGCTGTG CCGCGCCCCG CGCCGCAGCG TGCGGCCCGG CCCGCGGCGA CCGGCAACAA CGCCCGTTTG AAGCTCGCCC CTCTCGAGGC GGGCGCCCTG GTCGCGGCGA CGCCAACGCC CGCCGAGGTG GCCGCCAGCG CGGCCGCCGC CGAGCAGGCG GCCAGCGCCG CCAGTGCGGC AGCCCTGGCC GAGGCCGCCC GGGTCAGAGT CGAGGAACTC GAGGCGGCGA TGAACAAGCT GCGCCTCGAG ACCGCAACCA CCCAGCAGGC CATTGCCGGC CTGCAGGCGC GGTTGCAGCG CGCGGAAAGC GATCGCTACA GCAACCCATT GGTCTACGGT CTGGCGGCGG CCCTCGCGGT TCTGCTCGGC GTGGTGATCT GGCTCTGGCG GCAACGCAAC CAGGAGCGCC AGTCGCACGC CTGGCTGGTG CAGGCGGCCA CGCCGGCGGA AGCGCCGCGG AGCGGTGGCG TGGGAGCGCC GGCCGCAACG GCAGAGGCGG CCGCCTTCGC GCCGGTGCCG ACGCTCAAGG CCTGGGTCAG CAAGCCCAAC GAGATTGACG GTTTCGACGA CACGCTGGCC GCGACGCGGG CCGGCAGCAT GTCGGTGACC GGCGCGCTGT CGGAGCCTGC TCCGCTGAAT GCGGTGCGGC CGCTCAGCGC TGCGCAGAAG CGCGAGGTTT CGGTCGAGGA GCTGATCGAC CTCGAGCAAC AGGCCGAGTT CTTCATCGTC CTGGGTCAGG ACGCTGCGGC GATCGATCTG CTGATGGGGC ATCTGCGCAG CACGTCAGGT ACCAGTCCGC TGCCCTATCT GAAGCTGCTG GAAATCTACA AGCGCCGCGG TGACCGTAGC GACTACGAAC GGCTGCGCGA GCGCTTCAAC AGCCGCTTCA ATGCCTACGC CCCGGCCTGG GAGACCGACC TTCTGGGCGG ACGCACGCTG GAAGACTACC CGGCCGTCAT CGAGCAGCTC CAGGCGCTGT GGTCGACGCC GTCGCGAGCG ATGGACGTGT TGCAGGTCAG CCTGCTGCGC CCCGACGATG GCAATGCCGA TCCCGAGGGC AGCGACAGCT TCGACCTGCC GGCCTATCGC GAGCTGATGC TGCTGTATTC GGTCGCACGC GATCGATCGG AGCTCGAGGC CGGCGGGGCC GTCGACCTGC TCCTGCCGAT CGGCGCAGGC GAACCCACGG ACGATGCGCC GGCCAGCGCG GTCTTCGAGC GTCTGCTGGC GACGACCTCG CTCGAGGCCC AGCCGGAGGT GCAGAAGCCG CTGGCGGTCG ATCTGTCGCT CGAGGACCTC GAGCCCAAGC CGGCCGAAGG CGGCACAGGC CAGGACGGCG GTTCGCGCGG CTGA
|
Protein sequence | MIGVSLCALA QGVVAVGFGK LPDSTTLGQP LELKIPLRVD AGEDLRPECL NGEVHFSDNL QQPGTTSLRL EPAEPSATER VLRLATTTRV DEPVVTVQIT VGCASKVTRR FTLFADPPLI APAEVAAPVA AVAPSAPPAP AASAASRPEA APPDTAPMPR AATPRTPARS RATTAAPRRA PATAAAPIAV PRPAPQRAAR PAATGNNARL KLAPLEAGAL VAATPTPAEV AASAAAAEQA ASAASAAALA EAARVRVEEL EAAMNKLRLE TATTQQAIAG LQARLQRAES DRYSNPLVYG LAAALAVLLG VVIWLWRQRN QERQSHAWLV QAATPAEAPR SGGVGAPAAT AEAAAFAPVP TLKAWVSKPN EIDGFDDTLA ATRAGSMSVT GALSEPAPLN AVRPLSAAQK REVSVEELID LEQQAEFFIV LGQDAAAIDL LMGHLRSTSG TSPLPYLKLL EIYKRRGDRS DYERLRERFN SRFNAYAPAW ETDLLGGRTL EDYPAVIEQL QALWSTPSRA MDVLQVSLLR PDDGNADPEG SDSFDLPAYR ELMLLYSVAR DRSELEAGGA VDLLLPIGAG EPTDDAPASA VFERLLATTS LEAQPEVQKP LAVDLSLEDL EPKPAEGGTG QDGGSRG
|
| |