Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A1722 |
Symbol | |
ID | 4785275 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 1847124 |
End bp | 1848122 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640090293 |
Product | hypothetical protein |
Protein accession | YP_001020917 |
Protein GI | 124266913 |
COG category | [I] Lipid transport and metabolism [R] General function prediction only |
COG ID | [COG1597] Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0741572 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGCTC TCCCGGAAAT CGACCCCACG TCTTCGCTGC AGTTCATCGT CAATGCGGCG GCGGGCAGCA GCGACGCGGA GGCGAAGCGC GAAATCGTCG AAGCCGCGCT GCGCGCGGGT GGACGGCGGG GTGACTTGCT CTTCTGCAGC CCCGCCGAGT TGATTGGCGT GTCGCACCAG GCGGCGACGA GGGCGATCGC CACCCGCACG GCCGTGGTCG CCGTCGGTGG CGACGGCACG CTCAACACCG TGGCACAGGC TGCACACGCT GCGGGCTGCG CCATGGGCGT GGTGCCACAG GGCACCTTCA ACTACTTTGC CCGCACGCAC GGCATACCCG CAGACCCGGC CGATGCCGTC CGCCAATTGC TGCTTTCGGT GCCTGCGCCG GTTCAAGTGG CCGGCATCAA CGACCGCGTG TTCCTGGTCA ACGCCAGTCT CGGGCTCTAT CCTGAACTGC TGGAAGACCG TGAAGCCTAC AAGGCCCGCT TCGGTCGCAG CCGCTGGGTG GCGTTCGTGG CAGCCTGTGC GACTTTGCTT CGTGCGCAGC GCCGCTTGCG ATTGCACATC GAGATGGGTG GCAAGGTGCG CGACATGCAG ACCTTGACGC TCTTCGTGGG CAACAACCGC CTGCAGCTGC AGCAGTTCGG CGCCGAGCCC GATGACACCC TGGCCGGCAC GCCAGGCGAC GGCAGCATGG CCGCGCTCGT GCTGCGGCCT ATCGGAACGC TGTCGATGAT CGGCCTGATG CTGCATGGCG CCATGGGCAG GCTGGGTGAA GCCGCAGGCG TCGAGCGCTT CGAGTTCGAG CACCTGGTGG TGCGGCCTAC GCTGCCGCAG GGCCGCAGCG GGGTGAAGGT GGCCTTCGAT GGCGAAGTGA CGATGATGCG CGCACCGCTG GACTTCCGGG TGCTGGCCAA ACCACTGTAC CTGCTGATGC CACAGCGCGA CGCTGCCGTT GTCGACGCCC GATCCAGCGC CGAAGGGGCA GCGCCTTGA
|
Protein sequence | MAALPEIDPT SSLQFIVNAA AGSSDAEAKR EIVEAALRAG GRRGDLLFCS PAELIGVSHQ AATRAIATRT AVVAVGGDGT LNTVAQAAHA AGCAMGVVPQ GTFNYFARTH GIPADPADAV RQLLLSVPAP VQVAGINDRV FLVNASLGLY PELLEDREAY KARFGRSRWV AFVAACATLL RAQRRLRLHI EMGGKVRDMQ TLTLFVGNNR LQLQQFGAEP DDTLAGTPGD GSMAALVLRP IGTLSMIGLM LHGAMGRLGE AAGVERFEFE HLVVRPTLPQ GRSGVKVAFD GEVTMMRAPL DFRVLAKPLY LLMPQRDAAV VDARSSAEGA AP
|
| |