Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A1814 |
Symbol | |
ID | 4786813 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 1954932 |
End bp | 1955993 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640090385 |
Product | hypothetical protein |
Protein accession | YP_001021008 |
Protein GI | 124267004 |
COG category | [S] Function unknown |
COG ID | [COG2850] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCTTC GCGAAATCCC CCCCGCACCG CAACCGGTAG GCTATCAGCG GCAACCGCGG CCGGACGATG CCCGCTGGAT GCAGTGGATC GCCGAGAACC GGCTGCGCGA CTGCACAGCG GCTTCGATGA CAGACACCAT GGTCGCCGCG GGCCTGGACC GGCAGGCCGC TCAGCAGGCA ATCGCCGCGA TGGAGGTCCA CCCGGTGTTC GTGGCCGCAC GCCGCCACCA ACAGTTGCAG CGCAAGTTCG CTTCGGTGAT GGCCAATCAA CAGCGCATCT GGGAGATGGG TTGGGCCTAC GACGGCGTGG AGAAACGCAG CCACGTTTCG CCGGCCGAGT TCTTCGAGCG TTACGTGGTG GGTTCCCGGC CGCTCGTGCT GACCGACGTG GCGGGCGATT GGCCCGCTCT GCATCGCTGG TCGCCGGCCG ACCTGCGCGA ACGGTTCGGT CACCTCGATG TCGAGATCCA GGCGGAACGG GCCGTCAACC CGAAGTACGA GCAGGACAAG CTCAAGCACC GCCACAACGT CCGGCTCGGC GATTTCGTCG ATCGCGTGCT GGCGGGCGGT GCCACCAACG ACTACTATCT GACCGCCAAC AACGAGATTT TGCGCCGACC GGAGTTCGCG CCACTGCTGG CTGACATTGG AACGCTGCCG CTATTCTGCG ACCCGGCACA GTTGGCGCAA CGCTCCTCGT TCTGGTTCGG CCCGGCCGGC ACCGTCACTC CCCTGCATCA CGACACCCTG ATGTTGCTGC ACACCCAGGT GGTCGGACGC AAGCGCTGGC GCTTCATCTC GCCACTCGAG ACGCCGCGTC TATACAACCA CGACGGGGTG TTCAGCGCAA TCGACTTGGA TCATCCTGAC CTTGACCGTT ACCCGGCCTT CCGCGACGTC AAGGTGCTCG AGGTGGTGTT GGAACCGGGT GACACCGTCT TCCTCCCGCT GGGCTGGTGG CACCAGGTCG CCTCGCTGGA AGTCAGTCTG TCGTTCTCGT TCTCAAATTT CGTGTTCCCT AACACTTACA GCTATGAGAA TCCGTCAATC TCAGACTGGT GA
|
Protein sequence | MALREIPPAP QPVGYQRQPR PDDARWMQWI AENRLRDCTA ASMTDTMVAA GLDRQAAQQA IAAMEVHPVF VAARRHQQLQ RKFASVMANQ QRIWEMGWAY DGVEKRSHVS PAEFFERYVV GSRPLVLTDV AGDWPALHRW SPADLRERFG HLDVEIQAER AVNPKYEQDK LKHRHNVRLG DFVDRVLAGG ATNDYYLTAN NEILRRPEFA PLLADIGTLP LFCDPAQLAQ RSSFWFGPAG TVTPLHHDTL MLLHTQVVGR KRWRFISPLE TPRLYNHDGV FSAIDLDHPD LDRYPAFRDV KVLEVVLEPG DTVFLPLGWW HQVASLEVSL SFSFSNFVFP NTYSYENPSI SDW
|
| |