Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_B0174 |
Symbol | |
ID | 4787777 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008826 |
Strand | - |
Start bp | 150723 |
End bp | 153062 |
Gene Length | 2340 bp |
Protein Length | 779 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640092582 |
Product | hypothetical protein |
Protein accession | YP_001023187 |
Protein GI | 124262717 |
COG category | [S] Function unknown |
COG ID | [COG1479] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00791399 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00018824 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGGCTCAGC CGTTTACCCA CAGCGCCGAA GCACTGCATG GCTTCTTTTC TCAGATGGGC AGGGGCTTCT ACATCCCCTA CTACCAGCGG AACTACTCAT GGGATGACGA AAACGCGAAG AAGCTCATCA CGGACATCTT CGCCGCCGTT CGCCGCACGC TCTCTAAGCC CACGCATGCG CTGTTCCTCG GCACGGTCAT TCTTCATGAC GAAAAGAATC CGAAGCTGGG AACCCACTTC GACACGCCGA ATCTGTTGAC CAAAGTTTCC AACGTGGTCG ACGGCCAGCA GCGCATCACG TCCCTAGCCA TCCTCGCATG CGTGTTTGCT GAGTACGCGG GACGGTTGGC GAGCCAGCTG ACCACGGTCG GCGGCGCCCA TGCCGAGTTC GCAAGCTTGG CGAACGAACT GACCGACGAG GTCCGTCTGA TTCGCGAGTT CTTTTCCGTG GAGACGACCA AGGGGGGGGC CCAGCCGAGC CGCAAGCCCA TCGTCATTCG CGCCGGTGAC GTGACCAGCA ACCCCGTCTC GGACCAATGG ACGTTGGCCG GCGCCATCGA CGCGTTCTAT CGGTCCAACA CTTCATCGCT ACTGGCGAAT GCGATCAATG GGGTCAAGCT GGCGGACTTG GCGCTGGATG ATCGTTTGAA GTCGGTCGTC GAAGAGTTCA TCGATACCAT TGATTCGCAG CTTGAGACTG CGGACCAGGT GCTGGCCTCG CGCCTACTTG TCGCGAACGA TATGGAAGGC AGCGCGCTTG AACGCTTCAC CGGATACCCC CCGGACCTTA CGAAGATCGG AGCCTTGGGA GCACCAGAGC AGAAGTGCTT CTATTCGGCG GTGCTGTTGC TCGCGGCTTC GATCTATCTG CGCAAAGGGT GTCATGTTGT CGTGATCGAA TGCTTGGACG AGAGTCTGGC GTTCGACATG TTCCAGTCGC TCAATGCCAC CGGCACTCCG CTAACCGCAT TTGAGGTGTT CAAGCCGGCC GTCGTGAAGG CCTGGGGAGC CAACTACTCA ACGGGCATCA AGCCGCAGGT GGATCGCATC GAGCGCGTTT TCGATGAAGA GAGCACCGCT GATCGGAAGG AGGACCTCAC CGATCGGGTG ATTGTGTCGA CCGCGTTGGT GTATGACGGG ACCGAGCTGA GCAAGCGCTT CAGCGAAGAA CGGGACTGGT TGACGGCCAC GCTGCCAGCG CCGCCGAGCG CCTTGGCTGC GACGTTCGTT CAGGCGTTAG CTGACCAAGC GGAGTATTTC GATGTCGTCG TCAAGCCACG GCGATCCCCG AAGAACTCGA CCAACCTGGG CTTGGTGACG CACCTCCAGC AGCTCGGCAT GCCGCTCGAC GATGCGGACC TGGCGGCCCT ATGCATCTAC TTCGTTCGCG ATGCACAGCA TGCCATGGCG CACTCTGTCC TCAGCCTGTT CTACGCCAAG CTTCTTCGGG CCAAGGGCGA CCCGGTGGCA TTGGCTGGCG CATCGGCAGA GTTTCTGTCA GCCGCGAAGG CGGTCGCGGC GTTCTTCACC CTTTGGATGG GAGCGTCGCA AGGGCGATTC CCTGATTCGG ACTACCGCAA GCTCTTCGAC CAGCATGCGC CGAACATGTC CCTGAAGGGA GGCGCTGCCA ACCAGACAGC CGCCTACCTC AAGGGTGCCC TCCGCTTGGC CCTCGCTGGC CAGGGGATCT ACGATGCGAC CAGCAGCATC GCAGCTCGCC CGCTCTGGGT GAACGCTGCG AAGGGTTCGC CGTGGTATCA GCGAAAGACC GTATGCAAGT TCGGGTTGTT CGTCTCAGCG CACGACGTTG GGGTCGATCT CGCCGCTGGA CGAGAGGGCT TCTACGTCGA TGGCAAGCCC GGTTCGGTGC CAATGCTGAC CTCAAAGGCT TGGCACGCTT CAGCCTTCGA GGTGATCGAG CACGTGGCGA CACGTGACCA GCCCAAGAAG TTGACGTTTC CTGCGATGTT CGATCCGGCG ATCTACCCGG GCAACACGTC CATCGTCGAC AAGATTGGCA ACCTGACCCT ACTGTCAATG CCGGTGAACT CGTCGGTGTA CTCGGAGTGG CCGGACAAGG CCTTCTACTA CTGGCACCTG ACCACGCCAA CCAGTACCGC CGCAGGCCCG AGTTCCGGTG CCCTCATGGC GAGCCTGGGG ATCGCTTCGC TACCACCGAG CCTCAGCACG CTGAGTGCAG GGACTCAACA CGTGCCGCAC TTGGCGCCGT TGGCCCTGCG CGGAACTGTC GGCGCGAAGT GGGATGCGTC GTTCATCGAG CAGCGGTCTC AGCACCTGTG TGAGCGCATC TTTGACAAGC TCGACGGCTG GCTGCGCTAG
|
Protein sequence | MAQPFTHSAE ALHGFFSQMG RGFYIPYYQR NYSWDDENAK KLITDIFAAV RRTLSKPTHA LFLGTVILHD EKNPKLGTHF DTPNLLTKVS NVVDGQQRIT SLAILACVFA EYAGRLASQL TTVGGAHAEF ASLANELTDE VRLIREFFSV ETTKGGAQPS RKPIVIRAGD VTSNPVSDQW TLAGAIDAFY RSNTSSLLAN AINGVKLADL ALDDRLKSVV EEFIDTIDSQ LETADQVLAS RLLVANDMEG SALERFTGYP PDLTKIGALG APEQKCFYSA VLLLAASIYL RKGCHVVVIE CLDESLAFDM FQSLNATGTP LTAFEVFKPA VVKAWGANYS TGIKPQVDRI ERVFDEESTA DRKEDLTDRV IVSTALVYDG TELSKRFSEE RDWLTATLPA PPSALAATFV QALADQAEYF DVVVKPRRSP KNSTNLGLVT HLQQLGMPLD DADLAALCIY FVRDAQHAMA HSVLSLFYAK LLRAKGDPVA LAGASAEFLS AAKAVAAFFT LWMGASQGRF PDSDYRKLFD QHAPNMSLKG GAANQTAAYL KGALRLALAG QGIYDATSSI AARPLWVNAA KGSPWYQRKT VCKFGLFVSA HDVGVDLAAG REGFYVDGKP GSVPMLTSKA WHASAFEVIE HVATRDQPKK LTFPAMFDPA IYPGNTSIVD KIGNLTLLSM PVNSSVYSEW PDKAFYYWHL TTPTSTAAGP SSGALMASLG IASLPPSLST LSAGTQHVPH LAPLALRGTV GAKWDASFIE QRSQHLCERI FDKLDGWLR
|
| |