Gene Mpe_A1610 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1610 
Symbol 
ID4787234 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1737035 
End bp1738921 
Gene Length1887 bp 
Protein Length628 aa 
Translation table11 
GC content68% 
IMG OID640090178 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_001020807 
Protein GI124266803 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.403186 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCCC CCGACAAGCT CGCCACCCTG CTCTCGCTGA CACGCGAGCC CTTCCCCGCC 
TCGCGCAAGT CCTATCTGCA AGGCTCGCGC AGCGACCTGC GCGTACCGAT GCGCGAAGTG
ACGCTGACCA ACGGCGAGAC CGTCTCGCTG TACGACACCT CGGGTCCGTA CACCGAACCC
GGCGTCGCGA TCGACGTGAG GCGCGGCCTG CCCAGCGTGC GCACGCCCTG GCTCGACGAG
CGTGCCGACA CCGAGGTCTA TGCCGGCCGG CTGCACCAGG CGCTGGACGA CGGCGCGAAG
CACGAGGACC GCGAGGCCGA GCGCATCGAA CAGTTGCGCC TCGACGCCGC AGCGCTGCAG
CGGCCTCCGC GCCGTGCCAG GGCCGGCGCC AACGTCACGC AGATGCACTA TGCGCGCCGC
GGCATCGTCA CGCCCGAGAT GGAGTACGTG GCGCTGCGCG AGAACGGCAA GCGCGAGTGG
ATGCGGGAGT ACCTCGGCGA CGCCCCGCGC GAACAGCGCC TGCGCGGCAA CCCGATGGGT
GCGCAGATCC CGGCCATCGT GACGCCTGAG TTCGTGCGCG ACGAGGTGGC GCGCGGCCGC
GCCATCATCC CGGCCAACAT CAACCACCCC GAAGTGGAGC CGATGGCGAT CGGCCGCAAC
TTCCTGGTGA AGATCAACGC CAACATCGGC AACTCGGCCG TCACGTCGAG CATCGAGGAA
GAGGTGGAGA AGCTGGTGTG GGCGATCCGC TGGGGCGCCG ACAACGTGAT GGACCTCTCC
ACCGGTCGCA ACATCCACAC CACGCGCGAC TGGATCCTGC GCAACTCGCC GGTGCCGATC
GGCACCGTGC CGATCTACCA GGCGCTCGAG AAGGTGGGCG GCGTGGCCGA GGACCTCACC
TGGGCCATCT TCCGCGACAC GCTGATCGAG CAGGCCGAGC AAGGCGTCGA CTACTTCACC
ATCCACGCCG GCGTACGCCT GCCCTTCATC CACCTCACCG CGGACCGCCG CACCGGCATC
GTCTCGCGTG GCGGCTCGAT CATGGCCAAG TGGTGCATCT CGCACCACCG CGAGAGCTTC
ATCTACGAGC ACTTCGAGGA CATCTGCGAC ATCATGAAGG CCTACGACGT GAGCTTCTCG
CTCGGCGACG GCTTGCGCCC CGGCTCGGCC GCCGACGCCA ACGACGAGGC GCAGTTCGCC
GAACTGCGCA CGCTGGGCGA GCTGACCCAG GTGGCCTGGA AGCACGACGT GCAGACCATG
ATCGAAGGCC CGGGCCACGT GCCGATGCAC ATGATCCAGG CCAACATGGA CGAGCAGCTC
AAGCACTGCC ACGAGGCGCC GTTCTACACG CTGGGGCCGC TGACGATCGA CATCGCGCCC
GGCTACGACC ACATCGCCAG CGCCATCGGC GCGGCCATGA TCGGCTGGTT CGGCACCGCG
ATGCTGTGCT ACGTGACGCC CAAGGAACAC CTGGGCCTGC CCGACCGCGA GGACGTGAAG
CAGGGCATCG TCGCCTACAA GATCGCCGCG CACGCGGCTG ATGTCGCCAA GGGTCACCCC
GGCGCTCGCG CCCGCGACGA CGCGCTGTCC AAGGCGCGCT TCGAGTTCCG CTGGATGGAC
CAGTTCAACC TTTCGCTGGA CCCCGACACC GCACGCGACT TCCACGACGA GACGCTGCCC
AAGGACGCCA GCAAGGTGGC GCACTTCTGC TCGATGTGCG GGCCCAAGTT CTGCTCGATG
AAGATCACCC AGGAGGTGCG CGACTACGCC GCGCAGCGCG GCGTCAGCGA GGCGCAGGCG
CTGGGCGCCG GCATGGCCGA GAAGTCGAGC CAGTTCCGGC AGGCCGGCGG CGAGATCTAC
ATCCCGCTCG CCGTCGACAA GGGCTGA
 
Protein sequence
MNAPDKLATL LSLTREPFPA SRKSYLQGSR SDLRVPMREV TLTNGETVSL YDTSGPYTEP 
GVAIDVRRGL PSVRTPWLDE RADTEVYAGR LHQALDDGAK HEDREAERIE QLRLDAAALQ
RPPRRARAGA NVTQMHYARR GIVTPEMEYV ALRENGKREW MREYLGDAPR EQRLRGNPMG
AQIPAIVTPE FVRDEVARGR AIIPANINHP EVEPMAIGRN FLVKINANIG NSAVTSSIEE
EVEKLVWAIR WGADNVMDLS TGRNIHTTRD WILRNSPVPI GTVPIYQALE KVGGVAEDLT
WAIFRDTLIE QAEQGVDYFT IHAGVRLPFI HLTADRRTGI VSRGGSIMAK WCISHHRESF
IYEHFEDICD IMKAYDVSFS LGDGLRPGSA ADANDEAQFA ELRTLGELTQ VAWKHDVQTM
IEGPGHVPMH MIQANMDEQL KHCHEAPFYT LGPLTIDIAP GYDHIASAIG AAMIGWFGTA
MLCYVTPKEH LGLPDREDVK QGIVAYKIAA HAADVAKGHP GARARDDALS KARFEFRWMD
QFNLSLDPDT ARDFHDETLP KDASKVAHFC SMCGPKFCSM KITQEVRDYA AQRGVSEAQA
LGAGMAEKSS QFRQAGGEIY IPLAVDKG