Gene Sare_1168 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1168 
Symbol 
ID5704260 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1315684 
End bp1317309 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content75% 
IMG OID641270686 
ProductDak phosphatase 
Protein accessionYP_001536067 
Protein GI159036814 
COG category[R] General function prediction only 
COG ID[COG1461] Predicted kinase related to dihydroxyacetone kinase 
TIGRFAM ID[TIGR03599] DAK2 domain fusion protein YloV 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0701112 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000069838 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCTGGACA CCCTCGATGC CGCCGCGGTC CGCCGTTGGT GCGCCGGTGG TCTGGTCGCA 
CTGAAGCGCC ACCAGGGCGA AATCGACCAC CTCAACGTCT ACCCGGTGCC CGACGGTGAC
ACCGGCACGA ACCTGGTACT CACCCTCACC TCGGCGCAGC AGGCGCTGGC AATGGACCTG
GACACGCTGC CCGACGACGG GCCCACCCCG CACGGGCAGG CGCTTCGGTT GATGGCCCAG
GGCGCGCTGC TCGGTGCCCG CGGCAACTCC GGGGTGATCC TGGCGCAGAT CCTGCGCGGC
TTCGCCGACG CGCTGGCCAC CGTTCCCGTG GTGCGGGGAC GGGCGGTGGC CGTCGCCCTG
CGTACCGCCG CCACCGCCGC GTACGCCGCC GTCGTCGCTC CCGTCGAGGG GACGCTGCTC
AGTGTGGTGG CCGCCGCGGC GGGCGCCGCC GAGCGGGCCG ACCGTGACGA GCTGGGCCCG
GTGGTCCGGG CGGCGGCGGA CGAGGCCGTG CGGGCGCTCG ACCGTACCCC CCAACAGCTG
CCAGCGCTGG CCCGCGCCCG AGTGGTCGAC GCCGGTGGGC GGGGCCTCTG CCTGCTGCTC
GACGCCCTGG TCGAGGTGGT CACCGAGGAG CGGCCAGCGC GCCCGGCAGT CGCACCCGGG
CCGATCCAGC CACCAGCCGT CGCGGTTCGG GGAAGCGGCT CCCCGACGTA CGCCTACGAG
GTGCAGTACC TGCTCGACGC CGAGCCGGCC GCGGTGGACC GGCTACGGGC ACAACTGGTC
GCCCTCGGCG ACTCGCTGAC CGTCGTCGGC GACGGCGCGA CGACCGGCGG CACCTGGAAC
GTGCACGTGC ACGTCAACGA TGTCGGCGCG GCGATCGAGG CGGGAGTGGC CGCCGGCCGC
CCGCACCGGA TCACGGTGAC CCGTTTCGCC GATCCGCCCA CGTCGCCGGC GGCGGCCCGG
CCCGACCCGG CACCGGAGGG GCGGGCCGCT GTCGTGGTCG CCACCGGCGC CGGGATCGTC
GAGCTGTTCA CGGCGGCGGA GGCGACGGTG GTGCCGGGCA GCCCGGCCCC CAACGAGCTG
CTGAACGCCG TGCGCGCCAC CGGCGCCGCC AGCGTGGTGG TGCTGCCCAA CGACACGCTC
ACCCAGGCCA TGGCGAGTGA CGCGGTCGAG GAGGCGCACC GGTTCGGCGT CAAGGTCAGC
GTGGTCCCGA CCCAGTCGCC GGTGCAGGCG CTCGCCGCGC TCGCCGTCCG GGATCCGGGC
CGGCGCTTCG AGGACGACGT GATCGCGATG GCCGAGGCCG CCGGCGCCTG CAGGTACGCG
GAGATCTGCC ACGCCAGCCG GGAGGCACTG ACCATCGCCG GACCTTGCCG GAAGGGGGAC
GTACTCGCCC TGGTCGACGG CGAGGTGCAC CTCATCGGGT CGGATCTGCT CGACACCTGC
ACTGCCGTGG TGGACCGGAT GCTCGGCGGC GGCGGTGAAC TGGTCACCCT GCTGGCCGGG
GCGGACGCCC CCGAAGGCCT GACCGAGGCG GTCCGCGAAC ACGTTTTGCG GTCCTGGCCG
TTCGTCGAGG TGCACGTCTA CCCGGGTGGG CAGCCGCGCT ACCCGCTGCT GGTGGGGGTC
GAATGA
 
Protein sequence
MLDTLDAAAV RRWCAGGLVA LKRHQGEIDH LNVYPVPDGD TGTNLVLTLT SAQQALAMDL 
DTLPDDGPTP HGQALRLMAQ GALLGARGNS GVILAQILRG FADALATVPV VRGRAVAVAL
RTAATAAYAA VVAPVEGTLL SVVAAAAGAA ERADRDELGP VVRAAADEAV RALDRTPQQL
PALARARVVD AGGRGLCLLL DALVEVVTEE RPARPAVAPG PIQPPAVAVR GSGSPTYAYE
VQYLLDAEPA AVDRLRAQLV ALGDSLTVVG DGATTGGTWN VHVHVNDVGA AIEAGVAAGR
PHRITVTRFA DPPTSPAAAR PDPAPEGRAA VVVATGAGIV ELFTAAEATV VPGSPAPNEL
LNAVRATGAA SVVVLPNDTL TQAMASDAVE EAHRFGVKVS VVPTQSPVQA LAALAVRDPG
RRFEDDVIAM AEAAGACRYA EICHASREAL TIAGPCRKGD VLALVDGEVH LIGSDLLDTC
TAVVDRMLGG GGELVTLLAG ADAPEGLTEA VREHVLRSWP FVEVHVYPGG QPRYPLLVGV
E