Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A1204 |
Symbol | |
ID | 4787051 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 1299697 |
End bp | 1301586 |
Gene Length | 1890 bp |
Protein Length | 629 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640089770 |
Product | hypothetical protein |
Protein accession | YP_001020402 |
Protein GI | 124266398 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.290549 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACATCA GCATCGCAGT CCGTCTGAAC CGCTACGGTG CGATATACAC CGGCCGTAAG CTGTTCCGCA AGCTCGACAC GCAGTTGGGC GCTCCTGGAT GGGTTGCTGA AGTGCTCAGT GCGCGTGCCA GACGCGCCGA GTCGACACCT GAACACACCA GCGTGGCGGA CTCCGTTTGG CTGGCGCTTT CCGCAGCGGC GGAGAACGCC GAGTGGCTCG CGGCTGCGTC GATACCTGTC GGTCGAGTAG GACTCACAGA CATTGCTCGA GCTCTGGCCG CGCTCGAGGA CTCTATTGCC GAGACCAGCT GGCCAGAAGC TCAGAAGTTC AGTCGTTCAA ATGAGACTAG GCGCTGGCTG CTAGAGCACA AGCTCTTTCC GCCAAACATC GACCCAGAAC GACTGAGCCG CCACTTCGTC AGCCGGTTCG ATCGCGTCGC GCCTCAGAAT CGAAAGTTGT TCAGCGAACT ACCGGACGGC GCGAACACTG TTCATCGCTA CCCTGTGGGT GCACTGCCCC ACGAGTCATT AGCTGACCTC AAGGCCCAAA TACGGCAGAC GCTCGACACC GATCTCGCCA GAGTCGTCGA GGGAGCAGTC AAGGATCTTG ACGCATTCGC CACTCTGCAG TTGACGATCG CAGACCTTGC GCGGAGCGAG TGCCACGGAA GCGAGCTCGA GCAACTCAAG TTGTTCATTG AGTCCAACGC GCACCTGCGG CATAAGTTCG TGCCTGAGAT AGTGCGCGTG GCTAGCCCCC AAACGGTCTT GACCGCATAC GCCCAGGTCA TCGAAAAGTG GCGGCGAGAG GCACGTGTAC CAATACTTCC AACCGTGTAT GGCGGCGAAG CGATGTGCGC CCTGGCCCGT GACTACGGCG TGAAGATCGA TCGCAACAAA GCATATCGAC TCCTGACCCC AACGGTCCTG ACTCAGACGG AAATGCTTGC CTGCGCGCTC ATTCTTCAAT GCGCGTCACG ATGGAACTTC ACGACTGTCG TAGCCCTCAC CACGAAGGGA ATAGTTCCTA ACGGCAATGG GTTCATCGTG ACTTCACTGA AGGGACGAAC GAATCAAACT GCTCCCGATC TGGTCGTATC ACCTCGAGAT CACGAAGTCC TGCGGGCCTT GCGCACACTC AAAGAGAATC TTGGCGAAGT CAAAGCGCTC GGTTGGGTCG CGAGCGGTGA GGACCGCCTC TTCTTCAACA CGCACGTAGC TAGGCGCGGC GTAGTCCGTC CCTATGCCAA CTGGCACTAC GTCCTGTCGG GATTCATCTC TCGACATGAC TTGCCTCAGT TCTCACTGGA CCAGGTTCGA GTTCAAGCGA TCAATGCCTT TGATCTCGAG AGCGCGAGTA TCGAGGCGAC GCAACGGAAG GCCGGACATG CTACGTCAAC CACAACGGCG CGTTACCTGG ACCAACCCAT CCTTCGGGCC ATTAACTCGT CAATAAATTT GGCCTACCAG CGCGAGCTAG AACGCTCTGT GCAGTTCGCC ATCGAAGGTC AACCATGCCC GACGGGCAGG CTTTTCTCAC CGGTGGGCGA TGGAAGTTCG TGCGCTGACC CTGCAACACC ACCGAGGCTC GATATGCTCG TCGACGGGCT TTGCGAAGCA CACGAATGCC ACCTTGGCGC CGGGTGCCCC AACAGAAGAA TCGTCATCGA CACCGATGCA CTCAGGGACC TCACGTGCAC GCACCGGTTC TACAGTCGTC ACTGGAAGGC GCTCCTCGAT GAGAACGCCG AAGCATTCGA GAAGCACCAC CTTCCTACGA TGCTGTTCAC ATTCGGCCTT CGAGAAATTG TCGCGCAGGG ACCTTATCGA AGGTACCTGG CACTGGCCGA AGGGCCTGTC GATCCACCAG CATTCCCGCC ACTGAGCTAG
|
Protein sequence | MDISIAVRLN RYGAIYTGRK LFRKLDTQLG APGWVAEVLS ARARRAESTP EHTSVADSVW LALSAAAENA EWLAAASIPV GRVGLTDIAR ALAALEDSIA ETSWPEAQKF SRSNETRRWL LEHKLFPPNI DPERLSRHFV SRFDRVAPQN RKLFSELPDG ANTVHRYPVG ALPHESLADL KAQIRQTLDT DLARVVEGAV KDLDAFATLQ LTIADLARSE CHGSELEQLK LFIESNAHLR HKFVPEIVRV ASPQTVLTAY AQVIEKWRRE ARVPILPTVY GGEAMCALAR DYGVKIDRNK AYRLLTPTVL TQTEMLACAL ILQCASRWNF TTVVALTTKG IVPNGNGFIV TSLKGRTNQT APDLVVSPRD HEVLRALRTL KENLGEVKAL GWVASGEDRL FFNTHVARRG VVRPYANWHY VLSGFISRHD LPQFSLDQVR VQAINAFDLE SASIEATQRK AGHATSTTTA RYLDQPILRA INSSINLAYQ RELERSVQFA IEGQPCPTGR LFSPVGDGSS CADPATPPRL DMLVDGLCEA HECHLGAGCP NRRIVIDTDA LRDLTCTHRF YSRHWKALLD ENAEAFEKHH LPTMLFTFGL REIVAQGPYR RYLALAEGPV DPPAFPPLS
|
| |