Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A1610 |
Symbol | |
ID | 4787234 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 1737035 |
End bp | 1738921 |
Gene Length | 1887 bp |
Protein Length | 628 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640090178 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_001020807 |
Protein GI | 124266803 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.403186 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGCCC CCGACAAGCT CGCCACCCTG CTCTCGCTGA CACGCGAGCC CTTCCCCGCC TCGCGCAAGT CCTATCTGCA AGGCTCGCGC AGCGACCTGC GCGTACCGAT GCGCGAAGTG ACGCTGACCA ACGGCGAGAC CGTCTCGCTG TACGACACCT CGGGTCCGTA CACCGAACCC GGCGTCGCGA TCGACGTGAG GCGCGGCCTG CCCAGCGTGC GCACGCCCTG GCTCGACGAG CGTGCCGACA CCGAGGTCTA TGCCGGCCGG CTGCACCAGG CGCTGGACGA CGGCGCGAAG CACGAGGACC GCGAGGCCGA GCGCATCGAA CAGTTGCGCC TCGACGCCGC AGCGCTGCAG CGGCCTCCGC GCCGTGCCAG GGCCGGCGCC AACGTCACGC AGATGCACTA TGCGCGCCGC GGCATCGTCA CGCCCGAGAT GGAGTACGTG GCGCTGCGCG AGAACGGCAA GCGCGAGTGG ATGCGGGAGT ACCTCGGCGA CGCCCCGCGC GAACAGCGCC TGCGCGGCAA CCCGATGGGT GCGCAGATCC CGGCCATCGT GACGCCTGAG TTCGTGCGCG ACGAGGTGGC GCGCGGCCGC GCCATCATCC CGGCCAACAT CAACCACCCC GAAGTGGAGC CGATGGCGAT CGGCCGCAAC TTCCTGGTGA AGATCAACGC CAACATCGGC AACTCGGCCG TCACGTCGAG CATCGAGGAA GAGGTGGAGA AGCTGGTGTG GGCGATCCGC TGGGGCGCCG ACAACGTGAT GGACCTCTCC ACCGGTCGCA ACATCCACAC CACGCGCGAC TGGATCCTGC GCAACTCGCC GGTGCCGATC GGCACCGTGC CGATCTACCA GGCGCTCGAG AAGGTGGGCG GCGTGGCCGA GGACCTCACC TGGGCCATCT TCCGCGACAC GCTGATCGAG CAGGCCGAGC AAGGCGTCGA CTACTTCACC ATCCACGCCG GCGTACGCCT GCCCTTCATC CACCTCACCG CGGACCGCCG CACCGGCATC GTCTCGCGTG GCGGCTCGAT CATGGCCAAG TGGTGCATCT CGCACCACCG CGAGAGCTTC ATCTACGAGC ACTTCGAGGA CATCTGCGAC ATCATGAAGG CCTACGACGT GAGCTTCTCG CTCGGCGACG GCTTGCGCCC CGGCTCGGCC GCCGACGCCA ACGACGAGGC GCAGTTCGCC GAACTGCGCA CGCTGGGCGA GCTGACCCAG GTGGCCTGGA AGCACGACGT GCAGACCATG ATCGAAGGCC CGGGCCACGT GCCGATGCAC ATGATCCAGG CCAACATGGA CGAGCAGCTC AAGCACTGCC ACGAGGCGCC GTTCTACACG CTGGGGCCGC TGACGATCGA CATCGCGCCC GGCTACGACC ACATCGCCAG CGCCATCGGC GCGGCCATGA TCGGCTGGTT CGGCACCGCG ATGCTGTGCT ACGTGACGCC CAAGGAACAC CTGGGCCTGC CCGACCGCGA GGACGTGAAG CAGGGCATCG TCGCCTACAA GATCGCCGCG CACGCGGCTG ATGTCGCCAA GGGTCACCCC GGCGCTCGCG CCCGCGACGA CGCGCTGTCC AAGGCGCGCT TCGAGTTCCG CTGGATGGAC CAGTTCAACC TTTCGCTGGA CCCCGACACC GCACGCGACT TCCACGACGA GACGCTGCCC AAGGACGCCA GCAAGGTGGC GCACTTCTGC TCGATGTGCG GGCCCAAGTT CTGCTCGATG AAGATCACCC AGGAGGTGCG CGACTACGCC GCGCAGCGCG GCGTCAGCGA GGCGCAGGCG CTGGGCGCCG GCATGGCCGA GAAGTCGAGC CAGTTCCGGC AGGCCGGCGG CGAGATCTAC ATCCCGCTCG CCGTCGACAA GGGCTGA
|
Protein sequence | MNAPDKLATL LSLTREPFPA SRKSYLQGSR SDLRVPMREV TLTNGETVSL YDTSGPYTEP GVAIDVRRGL PSVRTPWLDE RADTEVYAGR LHQALDDGAK HEDREAERIE QLRLDAAALQ RPPRRARAGA NVTQMHYARR GIVTPEMEYV ALRENGKREW MREYLGDAPR EQRLRGNPMG AQIPAIVTPE FVRDEVARGR AIIPANINHP EVEPMAIGRN FLVKINANIG NSAVTSSIEE EVEKLVWAIR WGADNVMDLS TGRNIHTTRD WILRNSPVPI GTVPIYQALE KVGGVAEDLT WAIFRDTLIE QAEQGVDYFT IHAGVRLPFI HLTADRRTGI VSRGGSIMAK WCISHHRESF IYEHFEDICD IMKAYDVSFS LGDGLRPGSA ADANDEAQFA ELRTLGELTQ VAWKHDVQTM IEGPGHVPMH MIQANMDEQL KHCHEAPFYT LGPLTIDIAP GYDHIASAIG AAMIGWFGTA MLCYVTPKEH LGLPDREDVK QGIVAYKIAA HAADVAKGHP GARARDDALS KARFEFRWMD QFNLSLDPDT ARDFHDETLP KDASKVAHFC SMCGPKFCSM KITQEVRDYA AQRGVSEAQA LGAGMAEKSS QFRQAGGEIY IPLAVDKG
|
| |