Gene Noca_3286 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3286 
Symbol 
ID4599148 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3493505 
End bp3495163 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content76% 
IMG OID639777892 
ProductDak phosphatase 
Protein accessionYP_924475 
Protein GI119717510 
COG category[R] General function prediction only 
COG ID[COG1461] Predicted kinase related to dihydroxyacetone kinase 
TIGRFAM ID[TIGR03599] DAK2 domain fusion protein YloV 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAACTC CCCGCAGCGG CGCCATCACG TTGGAGGTGG TGCTGCGATT CGTCGACATC 
GCCACCGACG CGCTGGCCGA CGCCCGCGAG GAGATCGACG CGCTCAACGT CTACCCGGTC
CCCGACGGCG ACACCGGCAC CAACATGTAC CTCACGGTCT CGGCGGCCCG CGACGCCGTG
CGCGAGGCGA CCGGGGGAGA CCCGGCCTCC GACCTGGGTA CGGCGCTCGC CGCGTTCAGC
CGGGGCGCGC TGCTCGGCGC CCGCGGCAAC TCCGGGGTGA TCCTCAGCGA GATGCTCGGC
GCGATCGCGC GGCGGATCGG GAGCGCCGAG CCGGGGGAGC GCAACGCGCT GGTGATGGCC
GACGCGCTGC ACCGGGCGAC CGAGGCCAGC TACGCCGCTG TCGGGATCCC GGTCGAGGGC
ACCATGCTCA CCGTCACCCG GGCCGCCTCC GAGGCCGCGA CCGAGATCGC CCGGGACCCC
GGCTGCCGGG CCCGAGACGT GTTCACGGCC GCCGCGGCAG CCGCCCGCGA GGCCCTGGCG
CACACCCCGG AGCAGCTGCC CGTGCTCCGC GAGGCGGGGG TCGTCGACGC CGGCGGGCGG
GGCGTGAGCG TGATCCTCGA CGCGGCCGAG ACGGTGCTCA CGGGCCGCCG CCCGGTGCCG
GTCACCGCGC CGTTCGGCAG CCATCACATC CCGATCCCCA CCGCGGCGAA GACCGGCGAC
CTGACCCCGG ACGGACCCTC CTACGAGGTG ATGTACCTCC TGGACGCCGA CGACGCCGCG
ATCCCGGGCC TGCGAACCGC GCTCGGCGGG CTCGGCGACT CCCTGGTCGT CGTCGGTGGC
GAGGGCCTCT GGAACGTGCA CGTGCACGTC GACGACGTCG GCGCCGCGAT CGAGGCGGGC
ATCGCCGCCG GCCGGCCGCA CCGGGTGCGG GTCACCCACT TCGCCGAGCA GATCGCCGCG
GTCCGCGGCC GCACCGCCGC CCGCGACGGC CGCCGGGTCG TGGCCGTCGC GGCCGGGCCC
GGGCTCGCCG CGCTGTTCGA GGAGGCGGGC GCGGTCGTCG TGCCCGGCGG CCCGGGGCGC
CGACCCTCGA CGGGTCAGCT GCTCGAGGCG ATCACCGCAT GCGGCGCCTC CGAGGTCATC
GTGCTGCCCA ACGACCACGA CTCGGTGCGG GTCGCGCAGA TCGCGGCGAG CACGGCCGAG
GCTGACGCGG ACGGTGCGGT CCGGGTCGCG GTGATCCCGA CGCAGGCCCA GGTGACGGGC
CTGGCGGCGG TCGCCGTCCA CGAGCCCGGT CGCTCGTTCG AGCAGGACGT GCTCGAGATG
ACCGCCACCG CGCGCCACGC CCGTCAGGGG GCGGTCACGA TCGCGGCCAA GCAGGCGATG
ACGATGGCCG GGCCCTGCGA GACCGGCGAC GCCCTGGGCG TGATCGCCGG CGACTTCGCC
GTGGTGGGCA GCGACCTGTA CGCCGTCGCC GTCGAGGTGC TCGACCGCCT GCTCGGCGGT
GGTGGCGAGC TCGTCACGAT CGTGGCGGGG GCCGAGGACG CCGAGGGCTC CCTCGCGACC
CGGTGCGCGG GCTACGTCGA GGAGCACCAC CCCGCCGTCG ACGTCGTGGT GTACGACGGT
GGCCAGGAGC GCTACCCGCT CCTCATGTCG GTGGAGTAG
 
Protein sequence
METPRSGAIT LEVVLRFVDI ATDALADARE EIDALNVYPV PDGDTGTNMY LTVSAARDAV 
REATGGDPAS DLGTALAAFS RGALLGARGN SGVILSEMLG AIARRIGSAE PGERNALVMA
DALHRATEAS YAAVGIPVEG TMLTVTRAAS EAATEIARDP GCRARDVFTA AAAAAREALA
HTPEQLPVLR EAGVVDAGGR GVSVILDAAE TVLTGRRPVP VTAPFGSHHI PIPTAAKTGD
LTPDGPSYEV MYLLDADDAA IPGLRTALGG LGDSLVVVGG EGLWNVHVHV DDVGAAIEAG
IAAGRPHRVR VTHFAEQIAA VRGRTAARDG RRVVAVAAGP GLAALFEEAG AVVVPGGPGR
RPSTGQLLEA ITACGASEVI VLPNDHDSVR VAQIAASTAE ADADGAVRVA VIPTQAQVTG
LAAVAVHEPG RSFEQDVLEM TATARHARQG AVTIAAKQAM TMAGPCETGD ALGVIAGDFA
VVGSDLYAVA VEVLDRLLGG GGELVTIVAG AEDAEGSLAT RCAGYVEEHH PAVDVVVYDG
GQERYPLLMS VE