Gene Arth_0139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0139 
Symbol 
ID4447402 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp141220 
End bp142314 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content66% 
IMG OID639687934 
Productdiacylglycerol kinase, catalytic region 
Protein accessionYP_829640 
Protein GI116668707 
COG category[I] Lipid transport and metabolism
[R] General function prediction only 
COG ID[COG1597] Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase 
TIGRFAM ID[TIGR00147] lipid kinase, YegS/Rv2252/BmrU family 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGATT GGCTGCTCTA CCTCATCCTT GCTGCGGCCC TGGCTTTCGC GGTCTCCAGC 
TGGTGGGGCG TGCGGCGGCT GAAGGCGCTG CATACGCGCA GCGCCGTCCA GGAGGACACG
CACCATCCGG GCATGGCGCA GCAGAAGGTG GCCGTGGTCA TGAACCCGGT CAAGGCGAAA
TCGTCGGAAG CCCGTGCACT CATCCAGCGC GCCTGCCTGT CCGCCGGCTG GGAAGCCCCC
CTCTTCTTCG ATACAACTGC CGAGGACCCC GGGTACGCGC AAGCCGAGGC GGCAGTCGCA
AGCGGGGCCG ACGTCGTCCT GGTGGGCGGC GGGGACGGCA CCGTGCGTGT AGTGGCTGAG
AAGCTCGCCC GCACGAACGT GCCCATGGGC CTGGTTCCGC TGGGCACGGG AAACCTGCTG
GCCAGGAACA TCCACCTGGA CGTCAACGAC CTCCACGGCA GCATCCAGAC AGCGCTCTTT
GGGCACCAGC GGCACATCGA CACGGCCCGC ATGGGTATCA GGAACTCCCG GACGGGTGCC
TCGTCAGAGC ACGCATTCCT GGTGATTGCC GGCATGGGCA TGGATGCCGA AGTCGTCGGA
GACACCAACG ACGGGCTGAA AAAGGCGGTG GGCTGGCTCG CCTACACGGA GGCAGGAGTC
CGGCATCTGC CGGGGCGGCG CAAGAAGGTG TCCATCGCCC TGGACGACCA GCCGGAACAG
TCCCGGAAGA TCCGCAGTGT GCTGTTCGCC AACTGCGGCC TCATTCCGGG CGGCATCGAC
TTCATTCCGC AGGCAATGAT CGACGACGGA ATGCTGGACG TGGTGGTGAT GAGCCCCCGC
AGCGCCATCG GGTGGATCGC GATGTACACA AAGGTCATGT TCAAGCACAA AGGGAACCTG
CCGGTGATGA GCTATTACCG TTCCGGCAAG ATCGTCATCA AGTGCGCCGA GCCGGTGGCC
ACCCAGGTCG ACGGCGATCC GTGCGGCGAG GCGACCGACG TTACGGTTCA GGTGGAGCCG
CGGTCCCTGC TGGTCCGGGT TCCGGAACGC AAGGGCGGCG AAACGCCCGC AAGGGAAGCG
TCGGCCCCGC ATTAG
 
Protein sequence
MNDWLLYLIL AAALAFAVSS WWGVRRLKAL HTRSAVQEDT HHPGMAQQKV AVVMNPVKAK 
SSEARALIQR ACLSAGWEAP LFFDTTAEDP GYAQAEAAVA SGADVVLVGG GDGTVRVVAE
KLARTNVPMG LVPLGTGNLL ARNIHLDVND LHGSIQTALF GHQRHIDTAR MGIRNSRTGA
SSEHAFLVIA GMGMDAEVVG DTNDGLKKAV GWLAYTEAGV RHLPGRRKKV SIALDDQPEQ
SRKIRSVLFA NCGLIPGGID FIPQAMIDDG MLDVVVMSPR SAIGWIAMYT KVMFKHKGNL
PVMSYYRSGK IVIKCAEPVA TQVDGDPCGE ATDVTVQVEP RSLLVRVPER KGGETPAREA
SAPH