Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A1768 |
Symbol | |
ID | 4784227 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 1899993 |
End bp | 1901516 |
Gene Length | 1524 bp |
Protein Length | 507 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640090339 |
Product | hypothetical protein |
Protein accession | YP_001020962 |
Protein GI | 124266958 |
COG category | [S] Function unknown |
COG ID | [COG0397] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0126151 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAGATG TTTCCCTGCC GGAGCTCGAC CGGCCCGACG CGACGCCCCT CGGCCTGCAC TGGCGCAACC GCTATGCCGC ACTGGGCCCC GTCTTCCACA CGCGACTCGC AGCGCAGGCG CTGCCGCAGC CGCACTGGGT GGCCACCAGC GACAGCGCCG CCCGCCTGCT CGGCTGGCCC GGCGACTGGG CCGAGCGCGC CGACTGGCAG GCGCTGGAGG TGTTGTCCGG CGGACGCACC TGGCCGGGCA GCGAGCCGCT GGCCACCGTC TACAGCGGCC ACCAGTTCGG CGTCTGGGCC GGTCAGCTGG GCGATGGCCG GGCCCTGCTG CTGGGCGAGA TCGACACGCC GAACGGACCG ATGGAGCTGC AGCTCAAGGG CGCCGGCCGC ACGCCCTATT CGCGCATGGG CGACGGCCGC GCGGTGCTGC GTTCGTCGAT CCGCGAGTTC CTGTGTTCCG AAGCGATGCA CTTCCTGGGC ATTCCGACGA CGCGCGCGCT GGCGGTCGTC GGTTCACCGC TGCCGGTGCG GCGCGAAACC GTCGAGACCG CCGCGGTCGT GACGCGGGTG GCGCCGAGCT TCGTGCGCTT CGGCCACTTC GAGCACTTCG CCCATCATGG CTTGCCCGAA GCGCTGCGCA CGCTGGCCGA CTTCGTGATC GACCAGCACC ACCCCGCCTG CCGCGAGGCG GCCAATCCCT ATGCCGCGCT GCTGGAGACG GTGGCGCGCC GTACCGCCAC GCTGCTGGCC GACTGGCAGG CCGTGGGCTT CTGCCACGGC GTGATGAACA CCGACAACCT GTCGATCCTC GGCCTGACGA TTGACTACGG TCCGTTCGGC TTCCTCGACG GTTTCGACCC CGGCCATGTC TGCAACCACT CGGACCACCA GGGCCGTTAT GCCTACTCGC GCCAGCCGAG CGTGGCGTTC TGGAACCTGC ATGCGCTGGC GCAGGCGATG CTGCCGCTGA TCGCGATGGG CGGCGAGGTG ACCGAGGCCA CCGGCGACCT GGCGCTGGAG GCGATCGAGC CCTACAAGCA CACGTTCTCC GAGGCCATGG CGGCGCGGCT TCGCGCCAAG CTCGGCCTCG CCGGCGAACG CGACGAGGAC GTGGCGCTGG CCGACGACTG GCTCCAGCTG ATGGCCACCG AGCGCGCCGA CCACACCATC ACCTGGCGCC GGCTGGCGCA GTGGTCGCCG GCCGAGCCGC AGGCGGTGCG CGACCTCTTC CTCGACCGCC CGGCCTTCGA TGCCTGGGCC GACCGCTATG CACGTCGCCT CGCGCTCGAC GGCCGCGCGG AGGCCGAGCG CCGCTTGCAA ATGGACCGCG CGAACCCCAA GTACGTGCTG CGCAACCACT TGTGCGAGAA CGCGATCCGT GCGGCACAGG GTGGCGATTT CGGCGAGACA CAGCGCCTGC TGAAAGTTCT GGAGCGGCCG TTCGACGAAC AGCCGGAGCA CTCGGCCTAT GCCGAGTTTC CGCCCGATTG GGCGCAAACC CTGGAGGTGT CCTGTTCATC ATGA
|
Protein sequence | MQDVSLPELD RPDATPLGLH WRNRYAALGP VFHTRLAAQA LPQPHWVATS DSAARLLGWP GDWAERADWQ ALEVLSGGRT WPGSEPLATV YSGHQFGVWA GQLGDGRALL LGEIDTPNGP MELQLKGAGR TPYSRMGDGR AVLRSSIREF LCSEAMHFLG IPTTRALAVV GSPLPVRRET VETAAVVTRV APSFVRFGHF EHFAHHGLPE ALRTLADFVI DQHHPACREA ANPYAALLET VARRTATLLA DWQAVGFCHG VMNTDNLSIL GLTIDYGPFG FLDGFDPGHV CNHSDHQGRY AYSRQPSVAF WNLHALAQAM LPLIAMGGEV TEATGDLALE AIEPYKHTFS EAMAARLRAK LGLAGERDED VALADDWLQL MATERADHTI TWRRLAQWSP AEPQAVRDLF LDRPAFDAWA DRYARRLALD GRAEAERRLQ MDRANPKYVL RNHLCENAIR AAQGGDFGET QRLLKVLERP FDEQPEHSAY AEFPPDWAQT LEVSCSS
|
| |