Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A3778 |
Symbol | |
ID | 4785947 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 3996789 |
End bp | 3998099 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640092361 |
Product | hypothetical protein |
Protein accession | YP_001022966 |
Protein GI | 124268962 |
COG category | [S] Function unknown |
COG ID | [COG2718] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00112787 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCGCTTC AGCAGATCAT CGATCGCAGG CTCGCAGGCA AGAACAAGTC GATCGGCAAC CGCGAGCGTT TCCTGCGCCG ACACAAGGCG CAGATTCGAG AGGCCGTGCG CAAGGCGGTC AGCGGCCGCG GCATCCGCGA CATCGAGCAA GGCGAGGACA TCACGCTGCC CAAGCGCGAC GTCTCCGAGC CGCTGTTCGG GCACGGTTCC GGCGGCAAGC GCGAGATGGT GCATCCGGGC AACAAGGAGT ACGTGCGCGG CGACCGCATC AAGCGACCCG AGGGCGGCGG CGGGCAGGGC GGCGGGTCGC AGGCCAGCGA CTCGGGCGAG GGCGAGGACG ACTTCGTCTT CCACCTGAGC AAGGAAGAGT TCATGCAGGT CTTCTTCGAC GACCTGGCAC TGCCCAACCT CGTGAAGACG CAGCTGGCCG AAACACCAGA GTTCAAGAAC CAGCGAGCCG GCTTCACCAG CGACGGCACG CAGAGCAATC TGCACGTGGT GCGTTCGATG CGCGGCGCGA TCGGCCGACG CATCGCGCTC GGCGCCGATG CGCGACGAGA GCTGCGGCAC CTGGAGGCCC AGCTCGCGAG CCTGAAGCAG CGTTCGCGGC TCGACACCGG CGGCGTCGAC CTCGACCCGG GCCATGCCTT GCGGCAGCGC GAGATCAAGG ACCTGGAGGG CCGCATCGAG CTGCTGCGCC AGAAGGTCGC ACGCATCCCC TTCCTCGACC CGATCGACTT GCGCTTTCGC AGCCGCGTCA AGGTGCCGGT TCCCACCACC AAGGCGGTGA TGTTCTGCCT GATGGACGTG TCGGGCTCGA TGGACGAAGC GCGCAAGGAC CTGTCGAAGC GCTTCTTCAT CCTGCTCTAC CTGTTCCTGA CGCGCCACTA CGAGAAGATC GACGTGGTCT TCATCCGCCA TCACACGCAG GCGCAGGAGG TCGACGAAGA AGGCTTCTTC CACTCGACCG AAACCGGCGG TACCGTGGTC TCCAGCGCCC TGGTGTTGAT GGAGGAGATC GTCCGCGCAC GCTACCCGAC CAGCGAGTGG AACATCTACG GTGCGCAGGC CAGCGACGGC GACAACTGGC ATCACGACAG CGGCCGCTGC CGCGAGATCG TGTCCACGCA ATTGCTGCCG CTGTGCCGCT ACTTCGCCTA CGTGCAGGTG GCCGAACCCG AGCAGAACCT GTGGGAAGAG TACGCACAGG TCGCCCGGAC GAACCCGCAC TTCGCGATGC GCAAGGTGCT CGAGCCTTCG CAGATCTACC CCGTCTTTCG TGATCTGTTC AAGAAGGAGG GCGCGACATG A
|
Protein sequence | MALQQIIDRR LAGKNKSIGN RERFLRRHKA QIREAVRKAV SGRGIRDIEQ GEDITLPKRD VSEPLFGHGS GGKREMVHPG NKEYVRGDRI KRPEGGGGQG GGSQASDSGE GEDDFVFHLS KEEFMQVFFD DLALPNLVKT QLAETPEFKN QRAGFTSDGT QSNLHVVRSM RGAIGRRIAL GADARRELRH LEAQLASLKQ RSRLDTGGVD LDPGHALRQR EIKDLEGRIE LLRQKVARIP FLDPIDLRFR SRVKVPVPTT KAVMFCLMDV SGSMDEARKD LSKRFFILLY LFLTRHYEKI DVVFIRHHTQ AQEVDEEGFF HSTETGGTVV SSALVLMEEI VRARYPTSEW NIYGAQASDG DNWHHDSGRC REIVSTQLLP LCRYFAYVQV AEPEQNLWEE YAQVARTNPH FAMRKVLEPS QIYPVFRDLF KKEGAT
|
| |