Gene Rcas_0604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0604 
Symbol 
ID5538067 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp804144 
End bp805694 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content64% 
IMG OID640892765 
ProductDak phosphatase 
Protein accessionYP_001430751 
Protein GI156740622 
COG category[R] General function prediction only 
COG ID[COG1461] Predicted kinase related to dihydroxyacetone kinase 
TIGRFAM ID[TIGR03599] DAK2 domain fusion protein YloV 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00036597 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.583532 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGGCG CGTGGAATGG CGAGCATCTG CTGGAAGCGT TACGCGCGGC GTCGCGCGAT 
CTGGAGCGTC ACGCCGCCTC GCTCAATGCA TTGAATGTGT TCCCTGTTCC CGACGGCGAT
ACCGGGACGA ACATGGCGCT GACATTGAGC AGCGCATTGC GCGACATTAC GCCGCATCCT
TCGTGCGGCA CAGTGGCGGA GCAGGTTGGC TACTGGGCGA CGATGCGCGG GCGCGGCAAC
TCAGGCATCA TTCTGTCGCA AATCCTGCGT GGTGTTGCCG CGGCGCTTGC CGGACATCAC
CTGATGAGCG GGCGCGAAAT GGCGGTGGCC CTGACGCATG GCAGCACGCG CGCCTACGAA
GCCGTGTTGC GTCCGGTCGA AGGAACGATG TTGACGGTCA TCCGCTGCGC CGGTGAGGCG
GCGCAGCGCG CCATCGCCGC AGGTGAAGCA TCGTTGAGCG CCGTGCTCGA GGCAGCCGTG
CGCGAAGCGC GCGCCGCCGT GGCGCGCACG CCGCAGTTGC TGGCGACCCT GCGCGACGCA
GGCGTGGTCG ATGCGGGCGG GCAGGGTTTG CTGGTCCTGC TCGAAGCGCT GCTGCGCTAT
GCCCGCGGCG AAGCCAGTGA TTCGCATGCC CCAACCGTGA CACCCACCGC AACCGTTGAT
GATCATGCCG AGAGCGCAGG GTACTGCACC AGTTTTGTCA TCCATCACGC AACCGCACCA
CCGGAAACGC TCCGACGAGT CTTTGCGGCG CTCGGCGAAT CGCTGGTGAT CGCCGGAGAT
CGCGCGCTAG TCAAAATACA TCTTCACACT CCACGACCGG GCGACGCGCT CAATCAGGCG
TTAGCGTATG GCATTCTCGA TCAGATCGAA GTCGTGAACA TGGATCTGCA ACGCATGGCG
CACCATTCGG GTGCGGCGCT TTCCGACACT CAACCGGATA CACCGGCGAA CCCTGCGCCG
GGAATCATTG CACTGGCGCC AGGCGCCGGA TACGCAGCCA TCCTGCGCGA CCTGCGCGCC
GATCTGGTGT GGGAGACGAA TACGCCGCCG ACCATCGACG AGTGGCGCGC AGCCTTTGAG
CGCCTGCCGC AGCAGGAGAT CATTGTGCTG CCCAATGATC CGCAGGCGGC GGAAACTGCG
CAGGCAACCG CACCGTTGTT CGCCAGGCGC ATTGCTATCG TGCCGGCAAC CTCGCCGCCA
CAGGGCATTG CCGCGCTGCT GGCGCTGAAC TTCCAGGCAG ACGTCGATCA GAACATTCGG
GCAATGACAG CAGCAGCAGA ACGGGTGCGG GTTATCACCT TCGATGGACA GCGTCGCAAC
GAGATGGAGA CGCCTGCAGA AGCGGTGCAA GATGCGTATA ATGTGTGCCA TACACTTCAG
CAGATGGGCG CGAACGCTGC CGAGGTCGTC ACGCTCTACT ATGGACAGGC TGTTGACCAG
ACGCATGCGG AGCGACTGGC GCAGGAGATT CGGGTTGCTT TCCCGATGCT GCACGTCGAA
GTTCATGCTG GCGGTCAACC GGGCAGTGGC GTCGCCATTG CCCTCGAATA A
 
Protein sequence
MTGAWNGEHL LEALRAASRD LERHAASLNA LNVFPVPDGD TGTNMALTLS SALRDITPHP 
SCGTVAEQVG YWATMRGRGN SGIILSQILR GVAAALAGHH LMSGREMAVA LTHGSTRAYE
AVLRPVEGTM LTVIRCAGEA AQRAIAAGEA SLSAVLEAAV REARAAVART PQLLATLRDA
GVVDAGGQGL LVLLEALLRY ARGEASDSHA PTVTPTATVD DHAESAGYCT SFVIHHATAP
PETLRRVFAA LGESLVIAGD RALVKIHLHT PRPGDALNQA LAYGILDQIE VVNMDLQRMA
HHSGAALSDT QPDTPANPAP GIIALAPGAG YAAILRDLRA DLVWETNTPP TIDEWRAAFE
RLPQQEIIVL PNDPQAAETA QATAPLFARR IAIVPATSPP QGIAALLALN FQADVDQNIR
AMTAAAERVR VITFDGQRRN EMETPAEAVQ DAYNVCHTLQ QMGANAAEVV TLYYGQAVDQ
THAERLAQEI RVAFPMLHVE VHAGGQPGSG VAIALE