Gene GM21_1361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1361 
Symbol 
ID8136689 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1604872 
End bp1605837 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content66% 
IMG OID644868975 
Productdiacylglycerol kinase catalytic region 
Protein accessionYP_003021178 
Protein GI253699989 
COG category[I] Lipid transport and metabolism
[R] General function prediction only 
COG ID[COG1597] Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value8.758230000000001e-18 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGCCACTGT CAAACGACAC GAACCAGCGC GCGCCCAGCC CCTTCTTCAT AGTCATGAAC 
GCCGGCTCCG GAGACAAGGA CGCGGACGAG AGGGAGGGCG CTATCCGCAG CGTCCTTGCC
GCGGGGAAGC GTCCTTACCG CCTGTGGCGC GTAAGCGACA TCCGGCGTCT GCCGGAGGCG
GCGCGGGAGG CGGTGCAACT TGCACGGCAG CAGCAGGGGA CCGTCGTCGC CGCGGGGGGG
GACGGGACCA TCAACTCGGT GGTCCAGGAG GTGCTGCCTT CGGGGTGCCC CTTCGGCGTG
CTCCCCCAGG GTACCTTCAA CTATTTCAGC CGCGCCCACG GGATACCGGT CGACCCGGAA
GAGGCCTGCT CCTTGTTATT GCAAGGGGTA CTGCGGCCGG TGCAGGTCGG GCTGGTGAAC
CAGCGCCCCT TCCTGGTGAA CGCAAGCCTC GGCCTTTACC CCAAGCTGCT GGAAAAACGC
GAGGTTCATA AGAAGAGGTT CGGACGCAGC AGGCTCGTTG CTGCGCTATC GGGCCTTGCG
ACCTTGCTGG CTCCGCCACC CCGGCTGGTG TTGGCCCTTG ACGATGGCGC GGGAAGCGAC
GTGATGCACA TCTGCACCCT GGTAGTCGAC AACAACCTGC TGCAGCTGCG GCAGATGGGG
CTGCCTGAGG CGCGCGCCGT GCAGCACGGG GAACTGGCGG CGATCGTGGT CAAGGCTCAA
GGGGGGGCGC AACTGATCTC GGCCCTGGCG TTGGCCGTAC TGGGAAAGCT GGGGGAGGCT
GATGCCGTTG ATTCCTTTTC CTTTAAGAGG CTCACGGTGA AACCGCTGCA CCGAGGCAGG
CGCATCAAGG TGGCTACGGA CGGCGAGGTC ACCTGGATGG TCCCTCCGCT GGTGTTCCAG
GCGGCACCCG ACTCGCTGCT GCTCATCGTC CCCCCTCCGG GAACGGTGGG GGACGATGAG
GCGTGA
 
Protein sequence
MPLSNDTNQR APSPFFIVMN AGSGDKDADE REGAIRSVLA AGKRPYRLWR VSDIRRLPEA 
AREAVQLARQ QQGTVVAAGG DGTINSVVQE VLPSGCPFGV LPQGTFNYFS RAHGIPVDPE
EACSLLLQGV LRPVQVGLVN QRPFLVNASL GLYPKLLEKR EVHKKRFGRS RLVAALSGLA
TLLAPPPRLV LALDDGAGSD VMHICTLVVD NNLLQLRQMG LPEARAVQHG ELAAIVVKAQ
GGAQLISALA LAVLGKLGEA DAVDSFSFKR LTVKPLHRGR RIKVATDGEV TWMVPPLVFQ
AAPDSLLLIV PPPGTVGDDE A