Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_B0120 |
Symbol | |
ID | 4787723 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008826 |
Strand | + |
Start bp | 109369 |
End bp | 111312 |
Gene Length | 1944 bp |
Protein Length | 647 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640092529 |
Product | hypothetical protein |
Protein accession | YP_001023134 |
Protein GI | 124262664 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0210] Superfamily I DNA and RNA helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00000210723 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGCAAGAC GGCAGCAAGG GCGAGTGAGC CCGGCGCAAC GAATCGAGAT CCAGATCATG ACGACCCTTC AACTCACCCC CGAGCAGAGC CTGATCGTCC ACAGCAGCGT GGACGTGCTC CTGGTTCAAG CCTTCGCCGG CACGGGCAAG ACCTCGACCC TGATCGAGTA CGCCCGGGCC AACCCGCATC TGCGCATCCT CTACCTGGCG TTCTCCAAGG CGATCCAGCT GGAGGCGGCC AAGCGCTTCC CGACCAACGT CACCTGCAAG ACCACCCACG GTCTGGCCTG GCGGCGTTCG GGCGCCCTCT ACAAGGAGGC GGGCAAGCTC GGCTTCATCA ACGTGCGCGA CCTGATGCAA CTCTTCAGCG TCGAGGCTCG TGAGGCCCGG GTCGTTCTCA CCACCCTGGA GCGCTTCATC CAGTCGGCCG ACGCCAAGGT CCAAGACGCC CACGTGCCCT CGGACATCAT GAACACGGCT CACCGTGCAA GGGCGGGTCA ACTGGCGGCG CGTGCCTGGG ACGTGATGCG CGACCTGCAG AACTCGGCCC TGAAGATGAC CCACGACGGC TACTTCAAGT TGTTCCAGCT GAGCAACCCG GACCTGTCGA TGGAGTGCGA CGTGATCCTC GGCGACGAGT GGCAGGACAC CAACCCGGTC ACCCACGCCC TTGTCTCGGC ACAACAGTGC CGCAAGGTCT TCGTCGGCGA CTCGCACCAG TCGATCTTCG CCTTCCGCGG CGCCGTCAAC GCGATGAAAC AAGTCAAGGC GGAACAAGTC CTGCGCCTGA CCCACTCGTT CCGCTTCGGC TCGGGCATCG CCCAGCTCGC CACCAAGATC CTCGCCGGCC TCAAGGGCGA GAAGCACCCG CTCGTCGGCC GCGGCCGCAA CGAGTCGGTC TTCACCGTCG ACCGCACCCG CCCCTACACC GTCATCGCCC GCTCCAACGG CACCCTCTTC GCCGAAACCG TGGCGCTGCT GGGCCGCGTC AAGTTCCACC TGGTCGGCCT CGAAGAAGAC CGCGACCGCA ACCTGACCTA CGCCCCGTTC GAGAAGCTGG TCGACGTCCA CTACGTCCTC ACGCACCAGG GCCACCTGGC GCGCGACGCG TTCATCCGCG CCTTCAAGTC GCCCGCCCAG CTCAAGGACT ACGCCGGCGC CGCCGACGAC AAAGAACTGC TGATGCTGGT CAAGATCGCC GAAGACTACG GCCGCAACGT GCCGGCCCTG GTCTCGCGCA TCAAGGCCGA AGTCCTGCGC GACGGCCACG ACGCCCACGT CACCCTCACC ACCGCCCACC GCTCCAAGGG CCTGGAATGG GATCAGGTCG TGCTCTGCAG CGACTTCGAG GAATTCATCG GCGACAAGGG CGACCTGCGC CGCGCCACGA CGCAAGAGCT CGTGCAGGAA GCCAACCTCC TGTACGTGGC CGCCACCCGC GCCCTGCGCG CCCTGGAGAA GAACCGCCAG ATCCTCGAGA TCGAGCAGCT GCTGGCCAAG GCCAACTTCG AGCTGCCGAA GGAACAAGCC CTGCCGGCCC CGGCCGCCGC ACCCGCGAAG GCCCCGGCCG CCGCCGCGGC GCCGGTCGCC CCGCGCGCCG AAGGCCTGGC TGCCACCATG CACTCGGCGG AAGAAGCTGC GCGCGCCGCT CAACCCCCGG TGCAACTGGC CGGCGAACTG CCCAAGCTGC CGCACGACTT CCACAAGCGC GCCCTGCTCG AGCAGGTGCA GCACGCGATC CTCGTGGTCG GCCTGCTGGA CCTGGGCGAA CTCGCCGCCC TGCTGGCGCG CACCCGCGAA GACACCGCCC GCATCCTCGG CAACCTGATC GCCAAGGGCC ACATCTCGGC TCGCCTGTTC GCCCACGAGC CGGCGATCGC CGCCTCGGCC ACCGCCGCCC AGACCCAGGC CGCCGCACCG CAAGCGGCCA TCGACTTCCT CTGA
|
Protein sequence | MARRQQGRVS PAQRIEIQIM TTLQLTPEQS LIVHSSVDVL LVQAFAGTGK TSTLIEYARA NPHLRILYLA FSKAIQLEAA KRFPTNVTCK TTHGLAWRRS GALYKEAGKL GFINVRDLMQ LFSVEAREAR VVLTTLERFI QSADAKVQDA HVPSDIMNTA HRARAGQLAA RAWDVMRDLQ NSALKMTHDG YFKLFQLSNP DLSMECDVIL GDEWQDTNPV THALVSAQQC RKVFVGDSHQ SIFAFRGAVN AMKQVKAEQV LRLTHSFRFG SGIAQLATKI LAGLKGEKHP LVGRGRNESV FTVDRTRPYT VIARSNGTLF AETVALLGRV KFHLVGLEED RDRNLTYAPF EKLVDVHYVL THQGHLARDA FIRAFKSPAQ LKDYAGAADD KELLMLVKIA EDYGRNVPAL VSRIKAEVLR DGHDAHVTLT TAHRSKGLEW DQVVLCSDFE EFIGDKGDLR RATTQELVQE ANLLYVAATR ALRALEKNRQ ILEIEQLLAK ANFELPKEQA LPAPAAAPAK APAAAAAPVA PRAEGLAATM HSAEEAARAA QPPVQLAGEL PKLPHDFHKR ALLEQVQHAI LVVGLLDLGE LAALLARTRE DTARILGNLI AKGHISARLF AHEPAIAASA TAAQTQAAAP QAAIDFL
|
| |