Gene Phep_4164 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_4164 
Symbol 
ID8255299 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp5037970 
End bp5038848 
Gene Length879 bp 
Protein Length292 aa 
Translation table11 
GC content42% 
IMG OID644937829 
Productdiacylglycerol kinase catalytic region 
Protein accessionYP_003094417 
Protein GI255534045 
COG category[I] Lipid transport and metabolism
[R] General function prediction only 
COG ID[COG1597] Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase 
TIGRFAM ID[TIGR00147] lipid kinase, YegS/Rv2252/BmrU family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00574485 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000041756 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCATAAAA AGACTTCCAA ATTAAAATTG CTGTTTATTG TGAACCCTGG CTCGGGAAGT 
GGAGAGATAA ACTTCAGCGA GGTCATTGGC AATTATTTTG CCGAAAAAAC ACAGGATTTT
GAGATTTACA AACTTACAAA AAACTGTTCT CTAACTAAGA TCAAGGGTGT TATTCAGCAA
TCGATGGCCG ATAGGGTAAT TGCCGTAGGT GGGGATGGCA CCTTAAAACT GGTTGCAGAG
TGCGTACTGG AAACCAACAT ACCAATAGGC ATTATTCCGG CTGGTTCTGC CAATGGCATG
GCCCGCGAAC TAAACATCCC CTCCAGAATA GAAGAAGCGC TTGATATCGC CATAAATGCC
CCGGCTAAAA AGATACATGC CGTTATTGTA AACGGCGAGC TCTGTATCCA TCTGGCCGAC
ATTGGCTTTA ATGCTTACCT GGTAAAGAAA TTTGATGCCC TGCCGCAACG CGGAATGCTT
GCCTATGCTA AAGCAGCCTG GACAGCCCTC TGGAATCATT ATAAAATGGA AGTTGAATTT
AAGATCAAAG ATAAAACCAT TCACTCAAAG GCTGCCATGG TGGTTATAGC CAATGCCACT
ATGTATGGTA CGGGAGTTAA GATCAATCCT GATGGGCAAC TGGATGATGA CTTTTTTGAG
GTCATCCTTG TTAAAGAATA CTCCTTCATG GAAATACTTA AACTAAAGTT TACCAACCTG
CCTTTTAATC CAAAAAACAT CGAGTCCTTC CAAACTACCA ATCTCAGTAT TAAAACCCGG
CATAAGGCCC ATTTCCAGGT CGACGGAGAA TATATAGGAA AACTGAACAA CATTAAAGCG
CACATCGTTA AAGATGCCAT CCACATCATT GCACCATAA
 
Protein sequence
MHKKTSKLKL LFIVNPGSGS GEINFSEVIG NYFAEKTQDF EIYKLTKNCS LTKIKGVIQQ 
SMADRVIAVG GDGTLKLVAE CVLETNIPIG IIPAGSANGM ARELNIPSRI EEALDIAINA
PAKKIHAVIV NGELCIHLAD IGFNAYLVKK FDALPQRGML AYAKAAWTAL WNHYKMEVEF
KIKDKTIHSK AAMVVIANAT MYGTGVKINP DGQLDDDFFE VILVKEYSFM EILKLKFTNL
PFNPKNIESF QTTNLSIKTR HKAHFQVDGE YIGKLNNIKA HIVKDAIHII AP