Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A1188 |
Symbol | |
ID | 4785589 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 1283549 |
End bp | 1286617 |
Gene Length | 3069 bp |
Protein Length | 1022 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640089753 |
Product | hypothetical protein |
Protein accession | YP_001020386 |
Protein GI | 124266382 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTACC CGATGCCACC TGTGATCAGT CCGCCGATCA AGACGGAGAT CAGCGTGGTC AAAGGTGTCG TCCACCACGC TGACGGCAGC TTTCCTTCGA TCCGGGCGAC CGCTGATTCA AGGCCAGTCA GCGACTTCGT TAACGCAGCT TCATCTGGTT GCGATCCCTT GCCGTGGCCT GTCGCTGCGG GCGCTGGTGG AGGACTGACC TGGGTCGTTA TCACTGGCGC CACCGCCGCG GAGTCAACGG CTTTCGGTGG GGGCGTCTGC TGCGCGCTGG CCAGCGTTGT CACGGTGGCC AAGGCAAACG CGATGAGCCA TGTCTTCATG TTCCCTCCGG AGCGCGTTGA TCGCGAATTG TGCCGCCATC GGAAGGCAGC CACCGCTCGT GCGGCCGACG TGAAGAAGGG CGCTAACATG ATGCTGCCGT GCTGGAATGT TGCGCACGGG CAGACCGCCT GGAGCGACGC AGCGGTCGAG TTAAGCGGTG CACTTGGAGA GGCCTCGATG CGCAACGGCG CGGGGAAGCG GGTCGGCGAC GACCTCTACG TCCATCTCGA GTTCGTTGAC GATCTCGCGG ATTCGTCGCA ACGCGCGATC ATTCGGAATG CGCTTCAGCG GCTCGACGAT GGAGGGCGAG CGTTGGCCAA CGTCGCGAAG ATCAACACCC GCTCACAGCG CGTTTCACTT CTCGCCTACC CGGAGTTCGA TGACGAGCCG TTTCCAGCCC TTGCTTTGAG CTGGTCCCCA GGGAGGAGCG GCAACAGCGA GCCAGTGCTA CGGTCCTATG TCGAATCGCT GAACCCACCG ATCCTGCACA GGAAGGAGCT GCTGGTGTCG GATCGGCATC CTGCGCGGGA ACGTTGGTGC GCGATGACCT CACAGGCCGA AGCGCTCGGC TTGTTCGACG ATCCGCTGTC AATCGGCTTC CGAATGAACT GGGAGCGACA CATTGCGGCA AAGGGATATC AGCTGCTCGG TGGCCAGCTC GCGCCGTTGG GCAACGCCAT CGACGATGTT GATGGGCGGG CGAACCGGGT AGGGCAATGT GTTCAGCGTC ACCTGACCGC GCTCGGGCGG TCGGCGCTCT CGGCACCGGT CCAGACTTTG CTGCGGTTGG GTTTGCTAAG CCGCGAGATG TCATTCTTCG ACTATGGCTG CGGACGAGGC GACGACCTGT CAACGCTCAG TGGCGAAGGT TTCGCTGCCC AAGGCTGGGA TCCTCACTAC GCGCCGAACA GGCCGCTCTC CACTGCCGAA GTGGTCAACG TCGGCTTCGT GATCAACGTC ATCGAAGATC CCGCGGAGCG CGTCGACGTG CTCCACCGCG CTTTCTCCCT CGCCCAGCGC GTGATGTCCG TGGCAGTGAT GCTGTACGGA CCGGAGAACG CTGGAAAGCC ATTCGGCGAT GGCTTCATGA CGTCGAGGGG CACGTTCCAG AAGTACTTTC AGCAGGCGGA ACTGAAAGAC TACCTCGAGC AGGCGCTGCA CCAGGAGGCA ATCCTTGTTG GGCCAGGCAT GGCATTGGTC TTCAAGGACA AGGATTGGGA GCAGCGATAC CTCGCCGGCC GATACAGACG GCGCGACGTG ACGGAGCGCC TGCTTGCTGT CCGGCCGAGG CCGCCCAAGC CAGTCAGGGA GCAACCTGTC CGTGAGATCG CCGCGCCACG GACGCCTGAA CCGCCACATC CACTGCTCAC GGAGCTGTGG CGTGCAACCC TTGACCTGGG CCGATACCCA GAAGAGGCTG AGATTGACAG GCTGCCGGAA CTCATCGACG TCTTCGGCAG CTTGGGCCGA GCCATCAGGA AGATGGTCCG CAGCTTCGAT GGGGCTATGC TGGCGAAGGC GCAGGCGGCG CGCGCCGACG ACTTGCTGCT CTACTTCGCG ATCCAGCAGT TCAGCAAGCG CCCTCGGTAT CGTCAGCTGG AGGTCCGCCT ACAGCGAGAC GTCAAGGCGT TCTTTGGTGA CTATGCGAGT GCCCAGGCGG CCGGTATGGA ACTGCTCACT CGTGCGGCTG ATGGAGATGC GCTGCTCACG GCTTGCCGAG AAGCGGTGAC TAGCGGGCTC GGCTGGCTGG ACGCCGACAA GCTGCAGCTT CATGTCTCGC TGGTCGAGCG GCTCCCGATT GTCTTGCGCG CCTTCGTTTC GTGCGGCCTC CTGGTCTACG GTGACCTGGG CAAAGTGGAT CTCGTCAAGG TGCACTGCGG CTCAGGGAAA CTGACGCTGA TGCAGTTCGA AAACTTCGAT GCACAGCCAC TGCCGCTGAT GACCAGGCGC ATCAAGGTGA ACGTACGCCG CGCCGACTAC GACCTCTTCG TGTACGGGGC CGAATACCCC AAGCCGCCTC TCTATCTCAA GGGTCGCTAC ATGCACGAAG AAATGGCGCA CTACGAAGAG CAGGAAGCCT TCGATCGGGC ACTGGAGGAA GCTGGTGTCC TGGGCAGCGC CGAACATGGA CCGACCTATG AGCAGCTGGA GAAGAGCTTG GCAAGGCGCA GGCTGGAGGT CAAAGGCTTT TCCCTTCGCC GCAGCACGAC CATCCCATCG CTCGACGAAG CGTGTGGCGC GACGCTTTCC TACCGCAGCT TCATCGAGTG CGGCGAAACC CAGGCGCGGC TGCGGCTGCC CAACACCCCG CTGAATCCGG AGAGCTATAA CGCCTTGTAC GACTTGGCCG TGAAGCTGCT CGATCCGATC GTGGAGTACT TCGGCGCGAT CCGCTTGACC TATGGCTTCT GTTCCCCTGG TCTGGGCGCG CACATCAAGA GGCGGGTGGC ACCCGACTTG GACCAGCATG CGGCACATGA ACTGAACCGG CGCGGCCAGC CGCTTTGCGA GCGCGGTGGC GCCGCATGCG ACTTCATCGT CGACGATGAG AACATGGAGG AGGTCGCCGA TTGGATTGTC GAGAACCTCC CTTTCGATCG GCTCTACTAC TACGGGAAGG ACAGGCCGAT CCACCTCAGC TATTCGCCTA CCGAGTTGGG AGAGGCGATC GAGTTGCGGG CTGGACCTTC CGGCCGCCTG GTGCCCCGGC GCTACAAGTC GGCAGGAACT CCCAAGTGA
|
Protein sequence | MKYPMPPVIS PPIKTEISVV KGVVHHADGS FPSIRATADS RPVSDFVNAA SSGCDPLPWP VAAGAGGGLT WVVITGATAA ESTAFGGGVC CALASVVTVA KANAMSHVFM FPPERVDREL CRHRKAATAR AADVKKGANM MLPCWNVAHG QTAWSDAAVE LSGALGEASM RNGAGKRVGD DLYVHLEFVD DLADSSQRAI IRNALQRLDD GGRALANVAK INTRSQRVSL LAYPEFDDEP FPALALSWSP GRSGNSEPVL RSYVESLNPP ILHRKELLVS DRHPARERWC AMTSQAEALG LFDDPLSIGF RMNWERHIAA KGYQLLGGQL APLGNAIDDV DGRANRVGQC VQRHLTALGR SALSAPVQTL LRLGLLSREM SFFDYGCGRG DDLSTLSGEG FAAQGWDPHY APNRPLSTAE VVNVGFVINV IEDPAERVDV LHRAFSLAQR VMSVAVMLYG PENAGKPFGD GFMTSRGTFQ KYFQQAELKD YLEQALHQEA ILVGPGMALV FKDKDWEQRY LAGRYRRRDV TERLLAVRPR PPKPVREQPV REIAAPRTPE PPHPLLTELW RATLDLGRYP EEAEIDRLPE LIDVFGSLGR AIRKMVRSFD GAMLAKAQAA RADDLLLYFA IQQFSKRPRY RQLEVRLQRD VKAFFGDYAS AQAAGMELLT RAADGDALLT ACREAVTSGL GWLDADKLQL HVSLVERLPI VLRAFVSCGL LVYGDLGKVD LVKVHCGSGK LTLMQFENFD AQPLPLMTRR IKVNVRRADY DLFVYGAEYP KPPLYLKGRY MHEEMAHYEE QEAFDRALEE AGVLGSAEHG PTYEQLEKSL ARRRLEVKGF SLRRSTTIPS LDEACGATLS YRSFIECGET QARLRLPNTP LNPESYNALY DLAVKLLDPI VEYFGAIRLT YGFCSPGLGA HIKRRVAPDL DQHAAHELNR RGQPLCERGG AACDFIVDDE NMEEVADWIV ENLPFDRLYY YGKDRPIHLS YSPTELGEAI ELRAGPSGRL VPRRYKSAGT PK
|
| |