Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A2229 |
Symbol | |
ID | 4785361 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 2383606 |
End bp | 2384655 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640090797 |
Product | hypothetical protein |
Protein accession | YP_001021420 |
Protein GI | 124267416 |
COG category | [S] Function unknown |
COG ID | [COG3181] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCAATC CGGCGTCCGC CGACCACGAA GGAGACAAGA TGCAGACAGT CAAGAAGCTC TTCCTGCGCA CCGCGCTCGC GACGGCCGCG CTGGCCGGCC TAGGCCATGC GCCGCTGGCT GCCGCCTGGG AACCCGTCAA GCCGATCGAG TTCGTCGTGC CGGCGGGCAC CGGTGGCGGT GCCGACCAGA TGGCGCGCTT CATCCAGGGC GTGGCGGCGA AGAACAACCT GACCAAGCAG CCGATCGTCG TGGTCAATCG TTCGGGCGGC GCTGGCGCGG AGGGCTTCCT CGCCGTGAAG GAAGCGAAGG GCGATCCGCA CAAGATCATC ATCACGCTGT CGAACCTGTT CACCACGCCG CTGGCCACCG GCGTGCCATT CAACTGGCGC GACCTGACGC CGGTGCAGAT GCTGGCGCTC GATCAGTTCG TGCTGTGGGT CAACGAGGAG TCGCCTTACA AGACGGCCAA GGCCTACTTC GACGCGGTGA AGGCCGCGCC GCCCGGCAGC GTGAAGATGG CCGGCACCGG CTCCAAGCAG GAAGACCAGA TCATCACCGT GCTGCTGGAA AAGGCCGCCG GCAAGAAGAT CACCTACATC CCCTTCAAGG GCGGCGGCGA CGTGGCGGTG CAACTGGTCG GCAAGCACGT CGACTCCACC GTCAACAACC CGATCGAGGC CGAGTCGCAC TGGCGCGCCG GCAAGCTGCG GGCGCTGTGC GTGTTCGACA AGCAGCCGAT GCCGTACAAG ACCAAGCTCA CAGCCACCCA GTCGTGGGCC GATGTGCCGA CCTGCCCGGC GGCGGGCCTG CCGGTCGAGT ACGTGATGCT GCGCGGCATC TTCATGCCGC CTGGCGTGTC GCAGGAGCAG GTGGCCTACT ACCTCGACCT GTTCAAGAAG CTGCGCGCGC TGCCCGAGTG GCAGGAGTTC ATGGCCAAGG GCGCCTTCAA CCAGACGGCA CTCACCGGCT CCGAATTCTT CGACTGGCTC GGCAAGACCG AGCAGATGCA CCGCGTCCTC ATGAAGGAAG CGGGCTTCAT CGCGCAATAA
|
Protein sequence | MINPASADHE GDKMQTVKKL FLRTALATAA LAGLGHAPLA AAWEPVKPIE FVVPAGTGGG ADQMARFIQG VAAKNNLTKQ PIVVVNRSGG AGAEGFLAVK EAKGDPHKII ITLSNLFTTP LATGVPFNWR DLTPVQMLAL DQFVLWVNEE SPYKTAKAYF DAVKAAPPGS VKMAGTGSKQ EDQIITVLLE KAAGKKITYI PFKGGGDVAV QLVGKHVDST VNNPIEAESH WRAGKLRALC VFDKQPMPYK TKLTATQSWA DVPTCPAAGL PVEYVMLRGI FMPPGVSQEQ VAYYLDLFKK LRALPEWQEF MAKGAFNQTA LTGSEFFDWL GKTEQMHRVL MKEAGFIAQ
|
| |