Gene Arth_4068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4068 
Symbol 
ID4447710 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4592164 
End bp4593150 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content65% 
IMG OID639691899 
Productdiacylglycerol kinase, catalytic region 
Protein accessionYP_833543 
Protein GI116672610 
COG category[I] Lipid transport and metabolism
[R] General function prediction only 
COG ID[COG1597] Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase 
TIGRFAM ID[TIGR00147] lipid kinase, YegS/Rv2252/BmrU family 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCAGC ACAATAATGA GCCGGAGAAC ACGAACACTC CGGAAGGCAG CGCCGCGCCG 
CAGGCCGGCA CCGGTCCCAA GCGCGCCGCC GTGATCATCA ATCCGGCCAA ACCGGTGGAC
TTCGATATTC GCGGCATGAT GGCCAAGCAC TGTGCAGATG CGGGCTGGGA TGAGCCCATG
TGGATCGAAA CCAGCAAGGA GGACCCCGGC GTCGGACAGG CGAAGGAAGC GCTGAGCCGC
GGGGCCGACG TCGTGATTGC CGCCGGCGGC GACGGCACCG TTCGGTGCGT GGCAGAGGTC
CTCTCCGGCA CTTCCACGCC CATGGGGCTG CTGCCGCTGG GCACGGGAAA CCTGCTGGCC
CGCAACCTCG GCATGGACGT CACCGATATT GAAGGCGCCA TGGCCGGTGC CCTGACGGGC
GAGGACCGCA AGATCGACGT GGTCCGTGCC GTCCGGAGCG ATCCGGACAA GGAGCAGCAC
TTCCTGGTGA TGGCTGGCGT GGGTTACGAC GCCACCATCA TGGCGGACAC GAATGAGGAC
CTGAAGGACA AGGTGGGCTG GCTGGCCTAC GTTGATGCGG GCATCAGGAA CCTTCCCGGC
AAACCGGTGA AGGCGAGCAT TGTCATCGAC GGCAAATCAG TGGTCCATCG CCGTGTCCGC
AGCGTCATGG TGGGGAACTG CGGCAAAGTT CAGGGCGGAC TGGAAATCTT CCCCGACGCG
AAGGTGGATG ACGGCCTGCT GGATGTCGTG GTCCTGGCGC CCCGGGGCAA GCTCGGCTGG
TTCTCCGTTG TGGCCGGCAT GATCGGCAAG GGCAAAGGCA AGGACACGTC CGTGGAGTAC
TTCCAGGGCA AGGACGTGGA GATCACCCTT GAACACGCCG ATGATTACCA GCTCGACGGC
GACCATGAAG GAGACGGCAA GCACGTCCGC ATGACCATGC TTCCCGGCGC ACTGACGGTC
AGGATGAACG CGCCCGCTGC CGCCTAG
 
Protein sequence
MTQHNNEPEN TNTPEGSAAP QAGTGPKRAA VIINPAKPVD FDIRGMMAKH CADAGWDEPM 
WIETSKEDPG VGQAKEALSR GADVVIAAGG DGTVRCVAEV LSGTSTPMGL LPLGTGNLLA
RNLGMDVTDI EGAMAGALTG EDRKIDVVRA VRSDPDKEQH FLVMAGVGYD ATIMADTNED
LKDKVGWLAY VDAGIRNLPG KPVKASIVID GKSVVHRRVR SVMVGNCGKV QGGLEIFPDA
KVDDGLLDVV VLAPRGKLGW FSVVAGMIGK GKGKDTSVEY FQGKDVEITL EHADDYQLDG
DHEGDGKHVR MTMLPGALTV RMNAPAAA