Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A0176 |
Symbol | |
ID | 4784139 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 190794 |
End bp | 191924 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640088724 |
Product | hypothetical protein |
Protein accession | YP_001019373 |
Protein GI | 124265369 |
COG category | [S] Function unknown |
COG ID | [COG3268] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.668245 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAACA AGAAGCCCGT CGTCGTCTAC GGCGCCTCCG GCTACACCGG CCGCCTGATC TGCGAATACC TTCGCGAGTA CGGCATCCCG TTCATCGCCG CGGGTCGCAG CGCGGACAAG CTGCAGACGG CGATGCAGTC CAACGTGCCG GGCATCGAGA CCGCGAGCTA CGAGATCGCC GAGGTGCCGC ACAGCGTCGC GGCGCTGACC GAGCTGTTCC GCGGCGCGTC GGTGGTGCTG AACACCGTCG GACCGTTCGC GAAGTTCGGC CCGGAGGTCG TCGAGGCCTG CCTGGCCGCC CGCTGTCACT ACACCGACAC CACCGGCGAG CAGGACTGGC TGATCACGCT CGACGAACAG TACGGCGCCC AGTTCGCCGC CGCCGGCCTG CTGCTGTCGC CCGGCCTGGC GCACATGTAC ACCACCGGCG AGATCGCCGC GCAGCTGTGC CTCGAGACGC CGGGGCTCGA CACGCTCGAC ATCGCCGTGT TCTGGGGCGG CAGCCCGACG ATCGCGTCGA CGCAGACCAT CCTGGTCAAT GCTGCGACGT CCAAGGCCTA CTACCTCGAC CAGAACAGGT ACGTCGAGTG GCAGCCCGAC GCGGGTCTCT ACAACGTGAC CATCCCGGGC CAGCACGAGG CCGCGCTGGC GCTGCCCTGG GGCGGCACCT CGCATCCGGT GTGGTTCAAG CGCGACCCGC GTGTGGCCAC CGTCAAGGTG CTGGGCGGCG TGTTCAACAA GCCGCTGATG CAGGGCGTGC CGCTGATCGT CGCGGCGGCG CTGAAGGCGA CCGAGGGCAT GAACCCCGAG GAGCGCTACG CGGCGCTGGC CCAGACGGCC GCCGGCGTGA TGAACACCAT GCCACCACGC GAGAACCCGC GTCTCAACAA GTCGGTCGAC TCGGTCCACG CGTCCGGCCC GCTGGCGCGC GCGCACTGCG TGATCTTCGG CAACTGCAAC TACAAGCAGA CCGGGTTGCT GCAGGCGTTT GCGGCGGCCT CGCTGCTGCA GCAGGCGCCC CGGCGCGTCG GCTTCGCCTC CGGCTGCCAG GCCTTCGGAC ATCACGAACT GCTCGGCGCC CTGCGCAGCT TCGGCCTGGT GCAGGCGCCG ATCCTGACCG TCCACCGCTA G
|
Protein sequence | MSNKKPVVVY GASGYTGRLI CEYLREYGIP FIAAGRSADK LQTAMQSNVP GIETASYEIA EVPHSVAALT ELFRGASVVL NTVGPFAKFG PEVVEACLAA RCHYTDTTGE QDWLITLDEQ YGAQFAAAGL LLSPGLAHMY TTGEIAAQLC LETPGLDTLD IAVFWGGSPT IASTQTILVN AATSKAYYLD QNRYVEWQPD AGLYNVTIPG QHEAALALPW GGTSHPVWFK RDPRVATVKV LGGVFNKPLM QGVPLIVAAA LKATEGMNPE ERYAALAQTA AGVMNTMPPR ENPRLNKSVD SVHASGPLAR AHCVIFGNCN YKQTGLLQAF AAASLLQQAP RRVGFASGCQ AFGHHELLGA LRSFGLVQAP ILTVHR
|
| |