Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A2271 |
Symbol | |
ID | 4785110 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 2431570 |
End bp | 2432511 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640090839 |
Product | hypothetical protein |
Protein accession | YP_001021462 |
Protein GI | 124267458 |
COG category | [R] General function prediction only |
COG ID | [COG4447] Uncharacterized protein related to plant photosystem II stability/assembly factor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.160712 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGCGC CTGATTTCAG TGCGGTGATG GCTGAGCGCG AGAAGTCTGT GCTGAGGTCC GACACCTTCC AGTCCGCGTC GAGCAACGGC AAGGTCCTGG TGGCGGGGAC CGCGAGCGGG GCGCTGGTCA GCTCGGCCGA TGCGGGGCGC AGCTGGAGCC GCCAGCGCCT GGCCTCGCCG GCCTCCGTCA TTGCCCTGAC CGCCTGCGCC GACGGCAGCT TCATCGGCCT GGACTTCTAC CGCAAAGTCT GGATCGGCGA CGCTGCCGGC CAGCAATGGA GCGCGCGACC CCTGCCCGGC AAGATCAACC CCATGGCCAT CGCCTGCGCC CCCGACGGAC GCCTGTGGGT GGCCGGCAGC CACACCACGC TCCTCTCCAG CGCCGACCGC GGCCAGACCT GGGAAGCCCG CGACTTCGAC GAAGACGCGC TCCTCACCAC CCTCCAGTTC ATCGACGACC AGCACGCCGT CGTCACCGGA GAATTCGGCA CCGTCCTCAC CAGCGCCGAC GCCGGCAAGA CCTGGGTCAA GCAAGCCCCC ATCCCCGGCG ACTTCTACCC CTACGCCACC GTCTTCACCG ACCGCTCGAA CGGCTGGACC AGCGGCCTGG GCGGCGTCAT CTGGCACACC GCCGACGGCG GCAAGACCTG GCGCGCCCAG GACAACCGCG CCGCCGCCCC CATGTACACC CTGCTGCGCC AGGGCGACGA ACTCTACGGC CTCGGCGGAG GCGGCCTCAT GGTCGTCAAG CGAGGCGACA CCTGGGAACG CTTCGACCAC GGCCTCGTCC CGCCCGCCTA CCTCGCAGCC GGCGCCGTGC TCGACGCCCG CTCCATGCTC GTGGCCGGCT CCGCCGGCGC CCTGCACGTC GTCACCGCGC CCGGCCAGAT CGCGCTCGCC GCCCACGACC CCCACACCCC CAGCCAAGGA GCCGCCCGAT GA
|
Protein sequence | MQAPDFSAVM AEREKSVLRS DTFQSASSNG KVLVAGTASG ALVSSADAGR SWSRQRLASP ASVIALTACA DGSFIGLDFY RKVWIGDAAG QQWSARPLPG KINPMAIACA PDGRLWVAGS HTTLLSSADR GQTWEARDFD EDALLTTLQF IDDQHAVVTG EFGTVLTSAD AGKTWVKQAP IPGDFYPYAT VFTDRSNGWT SGLGGVIWHT ADGGKTWRAQ DNRAAAPMYT LLRQGDELYG LGGGGLMVVK RGDTWERFDH GLVPPAYLAA GAVLDARSML VAGSAGALHV VTAPGQIALA AHDPHTPSQG AAR
|
| |