Gene Mpe_A3727 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3727 
Symbol 
ID4786016 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3941529 
End bp3942566 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content74% 
IMG OID640092310 
Productthiamine biosynthesis protein 
Protein accessionYP_001022915 
Protein GI124268911 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.351343 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACAC GCCTGTCCTT CGCCGACCTG CCCCTGCCGG GCTACCGCCA CGACGGTGGG 
GTCGCGCCGC GCCTGCTGGC CGCCGCCGTG CAGGAGCTGG GCGGGCCGAC GATGGGCACC
CGCTGGAGCG TGAAGTACTG GCATGCGCCG GCCACGCCCG GCCCGGCCCG CCGCGAGGTC
CGCGAGGCGA TCGAGATCGC GCTGGACCTG GTGGTGCGCC AGATGAGCAC CTGGGAGGAC
GACTCCGACC TGAGCCGCTA CAACCGCGCC GCGCCGGGCC GCTGGCAGAA ACTGCCCGAG
CCCCTCTTCA GCGTGCTGCA GCACGCGCTC GAACTGGCGC GCGCCACCGG TGGGGCCTAC
GACCCGACCG TCGGCCCCGC GGTCAACCTC TGGGGCTTCG GCCCCGACCC GGCGCGCCGC
GATGCGCCGA CGGAAGGCGA TCTGGAGATG GCGCGCCGCC GCATCGGCTG GCAGCGCGTG
CAGCTCGACG TCGAGCAGCG CCGTGCACGC CAGGATGGCG GCACCTACGT CGACCTGTCG
TCGATCGCCA AGGGCTATGC GGTCGACGCC GTCGCGCGTG CGCTGCAGCG GCTGGGTTGC
GGCAACGCGC TCGTCGAGGT CGGCGGCGAG CTGCTCGGCA TGGGCCGCCG GCCCGATGGG
CAGCCGTGGC GGGTGGCGGT CCGGCTGCCC GGACTGGAAC AGGGCGATGC CGGTCCGGTG
CTCGCACTCA AGGGGCTGGC GGTCGCGACC TCCGGCGACG ACTTCCGCTG CTTCGAGACC
GACGACGGCG AGCGCCATTC CCACACCATC GACCCGCGCA CCGGCCGGCC GGTGCGGCAC
GCGCTGGCGT CGGTGACGGT CGTGCACGCG CAATGCATGC AGGCCGACGC GCTGGCCACG
GCGCTGACGG TGCTCGGGCC CGATGAGGGC TGGACCTACG CCGAGCGGGA GCGGCTGGCC
GTGCTGTTCA TCCGCCGTGC TGCGGATGGC GGCCACGAGG CCCGCCCGAC GGCCGGGTTC
GAAGCACTGC TGGCATGA
 
Protein sequence
MTTRLSFADL PLPGYRHDGG VAPRLLAAAV QELGGPTMGT RWSVKYWHAP ATPGPARREV 
REAIEIALDL VVRQMSTWED DSDLSRYNRA APGRWQKLPE PLFSVLQHAL ELARATGGAY
DPTVGPAVNL WGFGPDPARR DAPTEGDLEM ARRRIGWQRV QLDVEQRRAR QDGGTYVDLS
SIAKGYAVDA VARALQRLGC GNALVEVGGE LLGMGRRPDG QPWRVAVRLP GLEQGDAGPV
LALKGLAVAT SGDDFRCFET DDGERHSHTI DPRTGRPVRH ALASVTVVHA QCMQADALAT
ALTVLGPDEG WTYAERERLA VLFIRRAADG GHEARPTAGF EALLA