Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A1607 |
Symbol | |
ID | 4787231 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 1734791 |
End bp | 1735672 |
Gene Length | 882 bp |
Protein Length | 293 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 640090175 |
Product | thiamine biosynthesis protein ThiG |
Protein accession | YP_001020804 |
Protein GI | 124266800 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG2022] Uncharacterized enzyme of thiazole biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0153075 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCCAAG CAACCCCGGC CGTCCCCCCG CCAGCCGACA TGGCCAACGA CGATCCCTGG CAGGTGGGCG GCGTGCGCCT GCACAGCCGT TTCCTGCTCG GCACCGCCGG CTACCCTTCT CCGGCGGTGC TGCAGGGCGC GCTGCGTGCG GCGCGCACGC AGGTGGTCAC CGTGGGCCTC AAGCGCACGC TCGCGGCGGC CGGCGACAAC GGCTACATCG CCACCATCCG CCAGACCCTG CGCGACACCG GCGCGCGCCT GCTGCCCAAC ACCGCCGGCT GCCGCACCGC GCGCGAGGCC GTGCAACTGG CGCACATGGC GCGCGAGCTC TACGACACGC CCTGGCTCAA GCTCGAGGTG GTGGGCGACG AGCACACGCT GCAGCCCGAT CCGTTCGAGC TGCTGACGGC GGCCTCGCAG CTGGTGCGCG ACGGCTTCAC CGTGTTTCCC TACTGCACCG ACGATCTCGT CAGCTGCCGT CGCCTGCTCG ATGCCGGCTG CCCGCTGCTG ATGCCCTGGG GCGCGCCGAT CGGCTCGGGC CAGGGCCTGC TGAATCCGTT CGCGCTGCGC ACGCTGCGCG AACGCCTGCC GGACACCACA CTGATCATCG ACGCCGGACT CGGCGCGCCC TCGCATGCCG CGCAGGCGCT GGAGCTGGGC TTCGACGCGG TCCTGCTCAA CTCCGCCGTC GCGCACGCGC GCGAACCGGT GGCGATGGCG CGTGCCTTCC GCCTGGCCGT CGAAGCCGGC CGCGCCGCCT GGCGGGCCGG CGTCATGGCG CGCCAGGACT TCGCCGTCGC CAGCACGCCG GTCAGCGGCC AGCCCTTCAC GCTGCCGGAC CCGCCGGCCG GCTCGGCGGC GCTGCGGGCG GACGCGTCAT GA
|
Protein sequence | MLQATPAVPP PADMANDDPW QVGGVRLHSR FLLGTAGYPS PAVLQGALRA ARTQVVTVGL KRTLAAAGDN GYIATIRQTL RDTGARLLPN TAGCRTAREA VQLAHMAREL YDTPWLKLEV VGDEHTLQPD PFELLTAASQ LVRDGFTVFP YCTDDLVSCR RLLDAGCPLL MPWGAPIGSG QGLLNPFALR TLRERLPDTT LIIDAGLGAP SHAAQALELG FDAVLLNSAV AHAREPVAMA RAFRLAVEAG RAAWRAGVMA RQDFAVASTP VSGQPFTLPD PPAGSAALRA DAS
|
| |