Gene RoseRS_3521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3521 
Symbol 
ID5210498 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4413038 
End bp4414591 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content65% 
IMG OID640597116 
ProductDak phosphatase 
Protein accessionYP_001277829 
Protein GI148657624 
COG category[R] General function prediction only 
COG ID[COG1461] Predicted kinase related to dihydroxyacetone kinase 
TIGRFAM ID[TIGR03599] DAK2 domain fusion protein YloV 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.400174 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGCG CGTGGAATGG TGAACATCTG CTGGAAGCGC TGCGCGCAGC CGCGCACGAT 
CTTGAGCGCC ACGCTGCGTC GCTCAATGCG CTGAATGTGT TTCCGGTTCC CGACGGCGAT
ACCGGCACGA ATATGGCGCT GACGCTCAGT GGGGCGTTGC GCGACATTGC GCCAGACCCA
TCGTGCGGTG TGGTGGCGGA ACGGGTCGGG TACTGGGCAA CACTGCGCGG GCGCGGTAAT
TCGGGCATCA TTCTGTCGCA GATGCTGCGC GGTGTCGCCG CTGCTCTCGC CGGGCATCAC
CTGATGAGCG GGCGCGAAAT GGCAGCAGCG CTGGCGCACG GAAGCGCGCG CGCCTATGAA
GCGGTGCTGC GCCCGGTGGA AGGCACGATG CTGACCGTCA TTCGCTGCGC CGGTGAAGCG
GCACAGCGTG CAATTGCGGC TGGCGAGGCG TCGCTTGGCG CTGTACTCGA TGCGGCGGTG
CGTGAAGCGC GTGCAGCGGT GGCACGAACG CCGCAACTGC TGGCGACCCT GCGCGAGGCA
GGGGTGGTCG ATGCCGGTGG TCAGGGCATG CTGGTGCTGC TCGAAGCGCT ACTGCGCTAC
GCGCGCGGCG AAACAGGCGG GTCGTTCACA CCGCCGGTCA CAGCGCCAGC GCTCGTCACG
GATCATGCAG ATGCAGTCGG CTACTGCACC AGTTTCGTGA TCCATCGGGT GACGGTATCG
CTGGACACGC TCCGGCGTGT GTTTGCAACG CTCGGCGACT CACTGGTGGT CGCGGGGGAT
CAGGCGCTGG TGAAAATTCA TCTCCACACC CTGCGACCGG GTGATGCGCT CAACCAGGCG
CTGGAGTATG GCGCACTCGA CCAGGTCGAA GTCGTCAACA TGGATCTGCA ACGTGCTGCA
TTGCACTCCA GCGCACCGGT TCCGGTCGAA CAGGCGGACA TTGCCGCAGC GCCCGAAGCG
GTCGGGATCA TTGCCCTGGC GCCGGGCGCT GGTTTTGCAG CAATCCTGCG CGACCTGGGC
GCCAGCCTGG TGGAGGAAAC CGTAGCGACG CCGACGGTCG ATGAATGGCT GGCGCTCTTT
GAGCGTGTGG CGGAGCGCAG CATGATCGTT CTGCCCAACG ATCCGCAGGC GCTGGAGACC
GCGCGCAGTG CAGCGCAACA GGTCAACCGA CGCATCGATG TCGTACCGGC AATGTCGTCG
CCCCAGGGGA TTGCCGCGCT GCTGGCGCTG AACTTTCAAG CCGGGATTGA TCAGAACCTC
CAGATGATGA AGGCGGCAGC GGAGCGCGTG CAGGTGATTA CATTCGAGGT GCAACAGGAC
GGCGAACGAC ACGTGCCTGC AGAAGCGCCG CAAGATGCGT ATAATGTGTG CCATGCACTG
CGCCAGCACG GGGCTGACGC TGCTGAGATC GCAACGCTGT ACTATAGTCG GGGGGGCGAT
GTTGCCCGGG CTGAAATGCT GGCGGAAGCG ATTCGCGCCG CTTTCCCCGC GCTGCATGTC
GAAATACACG CTGGCGGGCA ACCCGGCGAC AGCATGATCA TTGCGCTCGA ATAG
 
Protein sequence
MTGAWNGEHL LEALRAAAHD LERHAASLNA LNVFPVPDGD TGTNMALTLS GALRDIAPDP 
SCGVVAERVG YWATLRGRGN SGIILSQMLR GVAAALAGHH LMSGREMAAA LAHGSARAYE
AVLRPVEGTM LTVIRCAGEA AQRAIAAGEA SLGAVLDAAV REARAAVART PQLLATLREA
GVVDAGGQGM LVLLEALLRY ARGETGGSFT PPVTAPALVT DHADAVGYCT SFVIHRVTVS
LDTLRRVFAT LGDSLVVAGD QALVKIHLHT LRPGDALNQA LEYGALDQVE VVNMDLQRAA
LHSSAPVPVE QADIAAAPEA VGIIALAPGA GFAAILRDLG ASLVEETVAT PTVDEWLALF
ERVAERSMIV LPNDPQALET ARSAAQQVNR RIDVVPAMSS PQGIAALLAL NFQAGIDQNL
QMMKAAAERV QVITFEVQQD GERHVPAEAP QDAYNVCHAL RQHGADAAEI ATLYYSRGGD
VARAEMLAEA IRAAFPALHV EIHAGGQPGD SMIIALE