Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A1952 |
Symbol | |
ID | 4784738 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 2088678 |
End bp | 2089769 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640090522 |
Product | hypothetical protein |
Protein accession | YP_001021145 |
Protein GI | 124267141 |
COG category | [R] General function prediction only |
COG ID | [COG3173] Predicted aminoglycoside phosphotransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.126845 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAGCCC AGTCCGGAAC GAAACCCGTC ACCTCGCAGC ACGCATTCGA CGTTGGCGCA CTGCAGTCCT ATCTCGAATC CGCGCTCGAC GGCTTCCGCG GTCCGCTCAG CGTCGAGCAA TTCAAGGGCG GCCAGTCGAA CCCGACCTTC AAGCTGCTCA CCCCGCAGCG CAGCTACGTG ATGCGCAGCA AGCCGGGGCC GGTCGCGAAG CTGCTGCCGT CGGCACATGC GATCGAACGC GAGTTCACGG TGATGCGAGC ACTCGCCGGA AGCGAAGTAC CAGTGGCCCG GATGCACCTG CTGTGCGAGG ACGAATCGGT GATCGGCCGG GCCTTCTACG TGATGGAGTA CGTGGAGGGC CGCGTGCTGT GGAGCCAGGC ACTGCCCGAC ATGACCACCG TGCAGCGTGG TGAGATCTAC GACGAAATGA ATCGCGTGAT CGCGGCGCTG CACCGTGTGG ACTACGCCGC TTGCGGTCTG GCCGGCTACG GCAAGCCCGG CAACTACTTC GAGCGTCAGA TCGGACGCTG GAGCCGCCAG TACCAGGCCT CTCTGGGTCC CGGCGGACCG GAGCCGATCG ACGCAATGGA GCGCCTGATC GACTGGCTGC CGGATCACAT CCCGGCCAGC GCACGCGATG AATCCCAGAC CCGCATCGTT CACGGCGACT ACCGGCTCGA CAACCTGATC TTCCACCCGA GCGAGCCGCG CATCGTCGCG GTGCTCGACT GGGAACTGTC GACGCTGGGA CACCCTCTGG CCGACTTCAG CTACCACTGC ATGAGCTGGC ACATCTCGCC CGGCACCTTC CGCGGCATCG GCGGGCTCGA CGTCGCGGCG CTCGGCATCC CGACCGAAGC CGAGTACATG CAGCGCTATT GCGAACGCAC CGGCCGCGGC GGCACCGACG CGCTGGTGAC AGACTGGAAC TTCTACCTGG CATACAACCT GTTCCGCATG GCCGGCATCC TGCAGGGCAT CGCCAAGCGT GTGCTGGACG GCACCGCCGC GAGCGAGCAG GCCCGACAGG CCGCTGCCGG CGCTCGACCG CTAGCCGAAC TGGGCTGGGC GATCGCACAA CGGCAGCGCT GA
|
Protein sequence | MEAQSGTKPV TSQHAFDVGA LQSYLESALD GFRGPLSVEQ FKGGQSNPTF KLLTPQRSYV MRSKPGPVAK LLPSAHAIER EFTVMRALAG SEVPVARMHL LCEDESVIGR AFYVMEYVEG RVLWSQALPD MTTVQRGEIY DEMNRVIAAL HRVDYAACGL AGYGKPGNYF ERQIGRWSRQ YQASLGPGGP EPIDAMERLI DWLPDHIPAS ARDESQTRIV HGDYRLDNLI FHPSEPRIVA VLDWELSTLG HPLADFSYHC MSWHISPGTF RGIGGLDVAA LGIPTEAEYM QRYCERTGRG GTDALVTDWN FYLAYNLFRM AGILQGIAKR VLDGTAASEQ ARQAAAGARP LAELGWAIAQ RQR
|
| |