Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A1216 |
Symbol | |
ID | 4787063 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 1310998 |
End bp | 1312008 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640089781 |
Product | thiamine biosynthesis lipoprotein |
Protein accession | YP_001020413 |
Protein GI | 124266409 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCATGT CCAGCCTGTC GAGTGTCGGC CGCCGTGCGG CGGGCGCGGT ACGCATGCCG GCCGCGGCCT GGGTTCAGGG CGGTTGGATG CGGCGCGAGG AGGCCATCAT GGGCACCTCG ATCAGCGTCG AGCTGTGGAG CGAAGACCCG TCCGCCGGCA ACGCCGCGAT GGATCTGGTG ATCGGCGAGA TGCACCGCAT CGACCGCGGC ATGAGCCCGC ACAAGCCGGA CTCCGAGCTG TCGCGCATCA ACCGAGAGGC GTCGGTTCGG CCGGTACCGC TCAGCGAAGA GATGTTCGCG CTGCTGGCGC GCTCGCTGGA GTTCTCGCGC CGCTCCGAAG GTGCCTTCGA CATCACCTTC GCCGGCGCCG GCCGGCTGTA CGACTACCGC GAGCGCATCC GGCCGACCGA TGCCGCGCTG GCACAGGCCT GTGCGGCCGT CGGCCACCAG TACCTGGAGC TCGACGCCGC CGCGCGCAGC GTGCGCTTCG CCCGCGACGG CCTGCGCATC GACCTGGGCG GCTTCGCGAA GGGGCATGCG GTGGACAACG CCGCCGCGAT CCTTGCGCGC CGCGGCATCC GCCATGCCTT CATCAGCGCC GGCGGCGACA GCCGCGTCAT CGGCGACCGC CGCGGCCGGC CCTGGACCAT CGGTGTGCGC GATCCGCGGC GGCCTGGCGA GATCATCGCG CTGCTTCCGC TCGAGGACGC GGCGGTCTCC ACCTCCGGGG ACTACGAGCG CTACTTCGAC ACGCCCGACG GCGCACGCTG CCATCACATC CTCGATCCGA GGACCGGCAA ATCCCCGGAC AGCGTGCGCA GCGTGACCAT CATCGCGCCG GACGGGCTGA CCAGCGAAGC GCTCTCGAAG TGCCTGTTCG TGATGGGCGT CGAGCGCGGC CTGCGCTTCG TCGAATCGCA CGCCGGTGTC GACGCCGTGG TGGTCGACGC GGCGGGGGCG CTGCACTACT CGTCCGGACT GCTCGCCGCC GGCGCGCAGC CGCGGCAGTG A
|
Protein sequence | MSMSSLSSVG RRAAGAVRMP AAAWVQGGWM RREEAIMGTS ISVELWSEDP SAGNAAMDLV IGEMHRIDRG MSPHKPDSEL SRINREASVR PVPLSEEMFA LLARSLEFSR RSEGAFDITF AGAGRLYDYR ERIRPTDAAL AQACAAVGHQ YLELDAAARS VRFARDGLRI DLGGFAKGHA VDNAAAILAR RGIRHAFISA GGDSRVIGDR RGRPWTIGVR DPRRPGEIIA LLPLEDAAVS TSGDYERYFD TPDGARCHHI LDPRTGKSPD SVRSVTIIAP DGLTSEALSK CLFVMGVERG LRFVESHAGV DAVVVDAAGA LHYSSGLLAA GAQPRQ
|
| |