Gene Mpe_A1722 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1722 
Symbol 
ID4785275 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1847124 
End bp1848122 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content68% 
IMG OID640090293 
Producthypothetical protein 
Protein accessionYP_001020917 
Protein GI124266913 
COG category[I] Lipid transport and metabolism
[R] General function prediction only 
COG ID[COG1597] Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0741572 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGCTC TCCCGGAAAT CGACCCCACG TCTTCGCTGC AGTTCATCGT CAATGCGGCG 
GCGGGCAGCA GCGACGCGGA GGCGAAGCGC GAAATCGTCG AAGCCGCGCT GCGCGCGGGT
GGACGGCGGG GTGACTTGCT CTTCTGCAGC CCCGCCGAGT TGATTGGCGT GTCGCACCAG
GCGGCGACGA GGGCGATCGC CACCCGCACG GCCGTGGTCG CCGTCGGTGG CGACGGCACG
CTCAACACCG TGGCACAGGC TGCACACGCT GCGGGCTGCG CCATGGGCGT GGTGCCACAG
GGCACCTTCA ACTACTTTGC CCGCACGCAC GGCATACCCG CAGACCCGGC CGATGCCGTC
CGCCAATTGC TGCTTTCGGT GCCTGCGCCG GTTCAAGTGG CCGGCATCAA CGACCGCGTG
TTCCTGGTCA ACGCCAGTCT CGGGCTCTAT CCTGAACTGC TGGAAGACCG TGAAGCCTAC
AAGGCCCGCT TCGGTCGCAG CCGCTGGGTG GCGTTCGTGG CAGCCTGTGC GACTTTGCTT
CGTGCGCAGC GCCGCTTGCG ATTGCACATC GAGATGGGTG GCAAGGTGCG CGACATGCAG
ACCTTGACGC TCTTCGTGGG CAACAACCGC CTGCAGCTGC AGCAGTTCGG CGCCGAGCCC
GATGACACCC TGGCCGGCAC GCCAGGCGAC GGCAGCATGG CCGCGCTCGT GCTGCGGCCT
ATCGGAACGC TGTCGATGAT CGGCCTGATG CTGCATGGCG CCATGGGCAG GCTGGGTGAA
GCCGCAGGCG TCGAGCGCTT CGAGTTCGAG CACCTGGTGG TGCGGCCTAC GCTGCCGCAG
GGCCGCAGCG GGGTGAAGGT GGCCTTCGAT GGCGAAGTGA CGATGATGCG CGCACCGCTG
GACTTCCGGG TGCTGGCCAA ACCACTGTAC CTGCTGATGC CACAGCGCGA CGCTGCCGTT
GTCGACGCCC GATCCAGCGC CGAAGGGGCA GCGCCTTGA
 
Protein sequence
MAALPEIDPT SSLQFIVNAA AGSSDAEAKR EIVEAALRAG GRRGDLLFCS PAELIGVSHQ 
AATRAIATRT AVVAVGGDGT LNTVAQAAHA AGCAMGVVPQ GTFNYFARTH GIPADPADAV
RQLLLSVPAP VQVAGINDRV FLVNASLGLY PELLEDREAY KARFGRSRWV AFVAACATLL
RAQRRLRLHI EMGGKVRDMQ TLTLFVGNNR LQLQQFGAEP DDTLAGTPGD GSMAALVLRP
IGTLSMIGLM LHGAMGRLGE AAGVERFEFE HLVVRPTLPQ GRSGVKVAFD GEVTMMRAPL
DFRVLAKPLY LLMPQRDAAV VDARSSAEGA AP