Gene Mpe_A1961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1961 
Symbol 
ID4784747 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2099599 
End bp2100669 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content70% 
IMG OID640090531 
Producthypothetical protein 
Protein accessionYP_001021154 
Protein GI124267150 
COG category[S] Function unknown 
COG ID[COG4394] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.12391 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACAGC CGCTGCGATG GGACCTGTTC TGCCACGTGA TCGACAACTT CGGCGATGTC 
GGCGTGTCGT GGCGGCTCGC TGCCGACCTG GCGCGGCGCG GTCAGCAGGT CCGGCTGTGG
ATCGACGACC CGTCGGCACT CGCCTGGATG GCCCCGGCCG GGCAACCCGG CGTCGAGGTG
CTGCGCTGGG CCGAGCCGCT GCCGGACCGC GAGCCGGGCG ACGTGGTCGT CGAGACCTTC
GGGTGCCAGC TCCCTGCGAC CTTCGTGCAG CGCATGCGGC GCCCCTGCCC GCCGGTGTGG
ATCAACTTCG AGTACCTGAG CGCCGAGTCC TACGTGGAGC GCAGCCATGG GCTGCCCTCG
CCGCAGCTCG AAGGTCCCGG CCAGGGGCTG AGCAAGTGGT TCCTCTACCC CGGCTTCACG
CCGCGCACCG CCGGCCTGAT CCGCGAACCT GAGTTGCTGC CGCGCCGTGC GGCCTTCGAC
GCCACGGCCT GGCTGGCATC CCACGGCGTG CAGCGCCAGC AGGGTGAACG CGTGGTCAGC
CTGTTCTGCT ACGAAAACGC GGCCTTGCCA ACCTGGCTGG ACAGCCTGGC GGAGGTGCCT
ACGGTCCTCC TGGTGACGCC GGACCGGGCG GCACGGCAGG TGCGGTCCGC GCTCGGCGAC
GGCGGTCGGA CGGGTGCGCT GCGAACCGTG ATGTTGCCCT ATCTGCCACA GGACGAGTTC
GATCACCTGC TGTGGGCCAG CGACCTCAAC TTCGTGCGCG GAGAGGATTC GTTCTCGCGC
GCGCAATGGG CGGGCGTGCC CTTCATCTGG CAGATCTACC CGCAGGTGGA CGACTTCCAT
GCCGTCAAGC TGGACGCCTT CCTGGACCGT TACCTCGACG CGGCCGCACC CGCCCAGGGT
GTGCAGATCC GCGCGCTATG GCACGGGTGG AACGGTTTGT CCGGCGTTAC CCGGCCGACC
TGGCCCGAAG GGCAGGATTG GCAGCGCCTT GCCCGCGACT GGAGCGCGCA CCTGGCCGCC
CTGCCCGACG CCACCGACTG TCTGCTTCGC TTCGCGGCGG ACAGAGGCTA A
 
Protein sequence
MTQPLRWDLF CHVIDNFGDV GVSWRLAADL ARRGQQVRLW IDDPSALAWM APAGQPGVEV 
LRWAEPLPDR EPGDVVVETF GCQLPATFVQ RMRRPCPPVW INFEYLSAES YVERSHGLPS
PQLEGPGQGL SKWFLYPGFT PRTAGLIREP ELLPRRAAFD ATAWLASHGV QRQQGERVVS
LFCYENAALP TWLDSLAEVP TVLLVTPDRA ARQVRSALGD GGRTGALRTV MLPYLPQDEF
DHLLWASDLN FVRGEDSFSR AQWAGVPFIW QIYPQVDDFH AVKLDAFLDR YLDAAAPAQG
VQIRALWHGW NGLSGVTRPT WPEGQDWQRL ARDWSAHLAA LPDATDCLLR FAADRG