Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A3727 |
Symbol | |
ID | 4786016 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 3941529 |
End bp | 3942566 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 640092310 |
Product | thiamine biosynthesis protein |
Protein accession | YP_001022915 |
Protein GI | 124268911 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.351343 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACAC GCCTGTCCTT CGCCGACCTG CCCCTGCCGG GCTACCGCCA CGACGGTGGG GTCGCGCCGC GCCTGCTGGC CGCCGCCGTG CAGGAGCTGG GCGGGCCGAC GATGGGCACC CGCTGGAGCG TGAAGTACTG GCATGCGCCG GCCACGCCCG GCCCGGCCCG CCGCGAGGTC CGCGAGGCGA TCGAGATCGC GCTGGACCTG GTGGTGCGCC AGATGAGCAC CTGGGAGGAC GACTCCGACC TGAGCCGCTA CAACCGCGCC GCGCCGGGCC GCTGGCAGAA ACTGCCCGAG CCCCTCTTCA GCGTGCTGCA GCACGCGCTC GAACTGGCGC GCGCCACCGG TGGGGCCTAC GACCCGACCG TCGGCCCCGC GGTCAACCTC TGGGGCTTCG GCCCCGACCC GGCGCGCCGC GATGCGCCGA CGGAAGGCGA TCTGGAGATG GCGCGCCGCC GCATCGGCTG GCAGCGCGTG CAGCTCGACG TCGAGCAGCG CCGTGCACGC CAGGATGGCG GCACCTACGT CGACCTGTCG TCGATCGCCA AGGGCTATGC GGTCGACGCC GTCGCGCGTG CGCTGCAGCG GCTGGGTTGC GGCAACGCGC TCGTCGAGGT CGGCGGCGAG CTGCTCGGCA TGGGCCGCCG GCCCGATGGG CAGCCGTGGC GGGTGGCGGT CCGGCTGCCC GGACTGGAAC AGGGCGATGC CGGTCCGGTG CTCGCACTCA AGGGGCTGGC GGTCGCGACC TCCGGCGACG ACTTCCGCTG CTTCGAGACC GACGACGGCG AGCGCCATTC CCACACCATC GACCCGCGCA CCGGCCGGCC GGTGCGGCAC GCGCTGGCGT CGGTGACGGT CGTGCACGCG CAATGCATGC AGGCCGACGC GCTGGCCACG GCGCTGACGG TGCTCGGGCC CGATGAGGGC TGGACCTACG CCGAGCGGGA GCGGCTGGCC GTGCTGTTCA TCCGCCGTGC TGCGGATGGC GGCCACGAGG CCCGCCCGAC GGCCGGGTTC GAAGCACTGC TGGCATGA
|
Protein sequence | MTTRLSFADL PLPGYRHDGG VAPRLLAAAV QELGGPTMGT RWSVKYWHAP ATPGPARREV REAIEIALDL VVRQMSTWED DSDLSRYNRA APGRWQKLPE PLFSVLQHAL ELARATGGAY DPTVGPAVNL WGFGPDPARR DAPTEGDLEM ARRRIGWQRV QLDVEQRRAR QDGGTYVDLS SIAKGYAVDA VARALQRLGC GNALVEVGGE LLGMGRRPDG QPWRVAVRLP GLEQGDAGPV LALKGLAVAT SGDDFRCFET DDGERHSHTI DPRTGRPVRH ALASVTVVHA QCMQADALAT ALTVLGPDEG WTYAERERLA VLFIRRAADG GHEARPTAGF EALLA
|
| |