Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_B0391 |
Symbol | |
ID | 4788019 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008826 |
Strand | + |
Start bp | 345204 |
End bp | 347009 |
Gene Length | 1806 bp |
Protein Length | 601 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640092823 |
Product | hypothetical protein |
Protein accession | YP_001023401 |
Protein GI | 124262931 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG4547] Cobalamin biosynthesis protein CobT (nicotinate-mononucleotide:5, 6-dimethylbenzimidazole phosphoribosyltransferase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.951822 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.0515178 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAACAGG GAGCCAAGAC GGTGTCCATG CGGCACCTCT ACCTCTACAC CCACCTGCTT CTCGACGAGG CGGGCCGCCA GGGCCGCAAG GTCCGGCTGG AGTGGGCCAA CATCGAGACC GCGGCCATCG GCCCGGACCC GAAGAACCCC GCGGGTCTGC TGATGAAGCT GCCCCACATG GCTTCGGTCG GTACGGAAGG CGATGCAACG CTCCTGCGCG CGCTCGTCTC ACACGAAGTG CTCTGCCACG GCCACCACAC CGACTTCAGC GTCATGCCTG ACAAGGGCAT CGGCGGCGTA CTGGAGAACG TGCTGGAGGA TCCGCGCGGC GAGCTGCTCG CGCTGGCCCG GTATCCCGGG TCCAAGAAGG TGATCCGCGA GGGGATCGAG GTCCTGGTCG ACCGCGGCGT GTTCGCCGGC CCCAAGGCCG ACGAGCAACT GCACCCGGCC GAGATCCTCA CGTCCTGGCT GGTGACCGAG CTGCGCTCGG AACTGCTTGG GCAATCCTGC TTGGAAGCGT TCTCGCGCGA CTATCGAAAG CTGGCCATCC AGACTTTCGG CAGCCGCCTG ACCGCCGCGG TGAAGTCCGA GGCCCTCAAG GCCACGGCTG CACCCACAAC GGCAGATGTC CAGGTGCACT CGCGCAAGAT CCTCGAGCTG CTGAAGATGG CCAAGGAGAA CCCGCCGCCG CAACAGCAAC CGCAAAGCGG CCAGTCCGGC CAGCCTGACA AGGCTGATGC AGGGCAGGGC GCAGGCAACT CGAGCGACCC TTCGCAGGGT GGCAACCCGG GCGACAAGCC TGGGAAGGCT GACGGCGCGC CCCAACCGGG CAAGAAGGGC AGGGCCGGGA AGGGCGACCA ACCTGGCGAT CAGCCCGGAG ATCCTGCAAA GGGCGGGTTC GAGCCGAACC AGGGCGACCT GGCCAAGGCG ATCGAGCAAG TCCTCAAGGC ATCGAAGGAA GACGCCGGCA CGTACGGCAA GGGCCTCGAA GACCACCTGG TCGAAGGCAG CGAAGCCATG GCGACTGGCG GCGGCATGAG CCACACCCAC GAGATGGGAG TGGCATCCAG GCCGCAGCGC GACTCGGAAG AGAACCGCAC TGCGATGCGC GCCGGCGCGC GGGCGATCAC GGCCGCGCTG GGGCTCAAAA TCGAGGAACT CCTCGAATCG CGAGCACTGG TGCACCGTCG CCGCTCGACC GAAGGCCGCA TCCGGCCTGG CCGGGTCTGG CGCTTGGTGC AAGGCGACAC GGAGATCTTC CAGAAGCGCA GCCTGCAGGA GGAGCTCGAC ACGTGCGTGA TGGTCCTGGA CGACGAGTCC GGATCGATGA ACGAGCCGTT CGGCGACATG CGCCGCGAGG ACGCTGCCTC ACGCGTTTGC GTGGGCGCTG GCGAGGTGCT CAACAACGCC GAGGTGCCGT TTGCACTGGT GGGCTACAAC ACGTCGCTCC ATCAGTACAA GGGCTTCGAC GACAGCTGGG CTGAAACGCT CAAGGACTTC GGTCCCCACA GCGCGAGCAG CACCAACACG CACCTCGCGG TCGTTTGGGC ACTTCGGGAG CTGATTAACC GCAAGGAGCG TCGCAAGATC CTGAAGGTCG TGACCGACGG AGATCCTGGC GATCAAACGG TGCTGGCCGC CGCGATCGAG GAGGCCAAGG CCTTCGGAGT CGAGGTTCGC TTCGTGCTCA TCTCTTCGCG AGAAGAGTAC AAGTACCGCA GCATGGGCGT CCCCTACGGC GTCGCCAACG ACGCCCCGGA GCTCGCCAAC GCGGTGTTCG CAAGCCTCGA GGCGGCATTC GCCTGA
|
Protein sequence | MQQGAKTVSM RHLYLYTHLL LDEAGRQGRK VRLEWANIET AAIGPDPKNP AGLLMKLPHM ASVGTEGDAT LLRALVSHEV LCHGHHTDFS VMPDKGIGGV LENVLEDPRG ELLALARYPG SKKVIREGIE VLVDRGVFAG PKADEQLHPA EILTSWLVTE LRSELLGQSC LEAFSRDYRK LAIQTFGSRL TAAVKSEALK ATAAPTTADV QVHSRKILEL LKMAKENPPP QQQPQSGQSG QPDKADAGQG AGNSSDPSQG GNPGDKPGKA DGAPQPGKKG RAGKGDQPGD QPGDPAKGGF EPNQGDLAKA IEQVLKASKE DAGTYGKGLE DHLVEGSEAM ATGGGMSHTH EMGVASRPQR DSEENRTAMR AGARAITAAL GLKIEELLES RALVHRRRST EGRIRPGRVW RLVQGDTEIF QKRSLQEELD TCVMVLDDES GSMNEPFGDM RREDAASRVC VGAGEVLNNA EVPFALVGYN TSLHQYKGFD DSWAETLKDF GPHSASSTNT HLAVVWALRE LINRKERRKI LKVVTDGDPG DQTVLAAAIE EAKAFGVEVR FVLISSREEY KYRSMGVPYG VANDAPELAN AVFASLEAAF A
|
| |