Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A3600 |
Symbol | |
ID | 4786126 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 3810948 |
End bp | 3812405 |
Gene Length | 1458 bp |
Protein Length | 485 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640092182 |
Product | hypothetical protein |
Protein accession | YP_001022788 |
Protein GI | 124268784 |
COG category | [C] Energy production and conversion |
COG ID | [COG2421] Predicted acetamidase/formamidase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.229476 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.563302 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACGAGC ACATCTGCGG TCCCGGCTGC AACCACGGCC CTGCCACGCC CGAGGCCGAG GCAGAAGTCC GCGAGGAGTT CAAGGTCGCG CGCCGCAGCT TCCTGCGCGA CGCGCTGGCG GTCGGCGGCG CCACCGTGTC GGCGGCCTCT CTGAACGTGG CGATGACGCC CAGCGCCTTC GCGCAGAGCG CCGCGAAGCC GGGCTCGGGG CTCACCTCGC ACTACTACAT CCCGGCCTCG GCCGGCACCG TGCTGTGGGG CTACTTCAGC AAATCGGCCA AGCCGGTGGT GGAAGTGGAG TCGGGCGATT TCGTGACCAT CGAGACCCTG ACCCACCACG CCAACGACGA CGCCGAGCGC ATGATCAAGG GCGACCCCGG CGCCGAGAGC GTGTTCCACT GGGACGCCAA GAAGAAGGGC GTGAACCGCC GTGGCGCCGG CGCGATGGAC GCCAAGGTGG GCGCCGGTGG CGGCGAAGGC GTGCACATCT GCACCGGGCC GGTGCGCATC AAGGGCGCGG AGCCGGGCGA CATCCTGGAA GTGCGCATCG TCGACGTGGC CACGCGGCCC AGCGCCAACC CGGCCTACAA GGGCCGCGCC TTCGGCAGCA ACGCCGCCGC GTGGTGGGGC TTCCACTACG GCGACACGAT CACCGAGCCG AAGAAGCGCG AGGTGATCAC CATCTACGAG GTCGACGCCA CCGGCGAGCG CAACTGGGCG CGCGCGGTCT ACAACTTCAA GTGGACGCCG CAGACCGACC CCTTCGGCGT GGTGCACCCG ACGATCGACT ACCCCGGCGT GCCGGTCGAC CACCGCACCA TCACCAAGAA CGAGAATGTG CTGAAGAACA TCCGCATCCC GGTGCGGCCG CACTTCGGGA CCATGGGCGT GGCGCCGGTC GAGGCCGAGA TGGTGAACTC CATCCCGCCC AACTACACCG GCGGCAACAT CGACAACTGG CGCATCGGCA AGGGCGCCAC CGTCTACTAC CCGGTGGCCG TGCCCGGCGC CATGTTCTCG GTGGGCGACC CGCACGCGTC GCAGGGCGAC TCCGAGCTCT GCGGCACCGC CATCGAGTGC TCGCTGACCG GCACCTTCCA GCTCATCCTG CACAAGAAGG CCAGCCTGCC CGGCACGCCG CTGGCCGAGC TGAAGTACCC GCTGCTCGAG ACGCAGGACG AGTGGCTGCT GCACGGCTTC AGCTACGCCA ACTACCTGGC CGAGCTGGGC CCCAATGCGC AGAACGACAT CTACAGCAAG TCATCGGTCG ACAAGGCGCT GCGCGACGCG TATCACAAGA TGCGCCATTT CCTGATGACC ACGCAGGGCC TGGGCGAGGA CGAGGCGATC TCGCTGATGT CGATCGCGGT CGACTTCGGC ATCACCCAGG TAGTGGACGG CAACTGGGGC GTGCACGCGG TGGTGAAGAA GAGCATCTTC CCCGCGCGCG GCGGCTGA
|
Protein sequence | MYEHICGPGC NHGPATPEAE AEVREEFKVA RRSFLRDALA VGGATVSAAS LNVAMTPSAF AQSAAKPGSG LTSHYYIPAS AGTVLWGYFS KSAKPVVEVE SGDFVTIETL THHANDDAER MIKGDPGAES VFHWDAKKKG VNRRGAGAMD AKVGAGGGEG VHICTGPVRI KGAEPGDILE VRIVDVATRP SANPAYKGRA FGSNAAAWWG FHYGDTITEP KKREVITIYE VDATGERNWA RAVYNFKWTP QTDPFGVVHP TIDYPGVPVD HRTITKNENV LKNIRIPVRP HFGTMGVAPV EAEMVNSIPP NYTGGNIDNW RIGKGATVYY PVAVPGAMFS VGDPHASQGD SELCGTAIEC SLTGTFQLIL HKKASLPGTP LAELKYPLLE TQDEWLLHGF SYANYLAELG PNAQNDIYSK SSVDKALRDA YHKMRHFLMT TQGLGEDEAI SLMSIAVDFG ITQVVDGNWG VHAVVKKSIF PARGG
|
| |