Gene Rcas_0856 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0856 
Symbol 
ID5538322 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1121758 
End bp1123365 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content59% 
IMG OID640893007 
Productphosphodiesterase 
Protein accessionYP_001430990 
Protein GI156740861 
COG category[R] General function prediction only 
COG ID[COG1418] Predicted HD superfamily hydrolase 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG
[TIGR03319] conserved hypothetical protein YmdA/YtgF 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000301712 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0173757 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGAGC CCGTAGCCAC AGCCCGGCAG AGGGCTGTAG CATGCTGCAC CTTTACAACT 
GAGCAGGAGG ACGTTGTGCA ATGGCTCGTA TGGGCCTTGC CTGCGCTGGT CATCGGACTT
GCAATCGGCG CAGGCATCGG CATTCTTATC TACAAAAACA GTGTTCAGAG CCAGATACGG
CAGATCGAGG CGGAAGCCCG ATTGCAACTG GAGGCGACGC GATCTGAGCA GAAAGATCTC
ATTCTGCGCG CAACTGATGA GGCGCTCCGG CTCCGCACCG AAGCCGAGGC GCAGATCCGC
GAAGCGCGCG CGGCGCTGGC AAAGCAGGAA GAACGGTTGC AACGAAAGGA GGAGAACCTC
GACCGCAAAA TCGAGGGGCT TGAGCGACGC GAACGGCAGC TTCAGCAACG TGAGCGTCAG
ATGGAGCAAC TCCACCAGGA AGCCGAACAT CTGCGGCAGC AACAGCGCGC GGAGCTTGAG
CGCATTTCCG CATTGAGCCA GGAAGAAGCC CGCGCCATCA TTCTGAAGCG CGTTGAAGAC
GAGACGCGCG ACGAAGCGGC GCGTCGCATT CGTGAAATTG AAAAGACGAT GCACGAAGAG
GCGGACAAAC TGGCGCGGAA AGTGATCAGC ATGGCCATTC AGCGGTGCGC TTCGGACTAT
GTAGCCGAAG TGACCGTTTC GACCGTGGCG CTCCCGAGCG AGGAGTTGAA GGGGCGCATC
ATCGGGCGCG AAGGGCGAAA CATTCGCGCA TTCGAGCAAT TAACCGGGGT TGATATTATT
GTCGATGATA CCCCGGAAGC GGTCACGCTG TCGTGCCACG ATCCAGTGCG GCGGGAAGTG
GCGCGCCTGG CGCTGATCAA GTTACTCAAG GATGGGCGCA TCCATCCGAC GCGCATTGAA
GAGGTCGTCC ATAAAACGCA GCAGGAAGTC GATCAGGTTA TGCGCGAAGA GGGCGAGCGA
GTCGCCTACG AAGCGAATGT CCAGGGTCTG CACCCTGATC TGATCAAACT GCTGGGGCGC
CTGAAATATC GGACGAGTTA CGGGCAGAAC GTACTCCAGC ACTCGCTGGA GTGCGCGCTG
CTTGCGGCGC ATATTGCTGC TGAGATCGGT GCAAACATCA ATGTGGCGAA AACTGCTGCG
TTGCTGCACG ACATCGGGAA GGCAGTCGAC CATGAGGTGC AGGGACCGCA CGCATTGATC
GGAGCTGAGA TTGCGCGTCG CCTTGGGAAA TCACCGGCAA TCGTTCACGC GATTGCCGCT
CATCATAACG ATGAAGAACC GCAAACCGTC GAAGCCTGGC TGGTTCAGGC TGTTGACGCC
ATCTCCGGCG GGCGCCCCGG AGCGCGCCGC GAGACGCTCG ACCTCTACAT CAAGCGCCTC
GAAGCGCTTG AAACAGTGGC GACGTCCTTT TCTGGCGTTC AACGCGCCTT TGCTGTTCAA
GCCGGACGCG AAGTGCGGGT GATGGTGCAA CCCGACGCTA TCGATGATCT TGGCAGTATT
CATCTTGCCC GTGATGTTGC CAAAAAGATC GAAGAGAGTT TGCAGTATCC GGGACAGATC
AAGGTGACGG TCATCCGCGA GACGCGCGCA GTGGACTATG CGCGCTGA
 
Protein sequence
MFEPVATARQ RAVACCTFTT EQEDVVQWLV WALPALVIGL AIGAGIGILI YKNSVQSQIR 
QIEAEARLQL EATRSEQKDL ILRATDEALR LRTEAEAQIR EARAALAKQE ERLQRKEENL
DRKIEGLERR ERQLQQRERQ MEQLHQEAEH LRQQQRAELE RISALSQEEA RAIILKRVED
ETRDEAARRI REIEKTMHEE ADKLARKVIS MAIQRCASDY VAEVTVSTVA LPSEELKGRI
IGREGRNIRA FEQLTGVDII VDDTPEAVTL SCHDPVRREV ARLALIKLLK DGRIHPTRIE
EVVHKTQQEV DQVMREEGER VAYEANVQGL HPDLIKLLGR LKYRTSYGQN VLQHSLECAL
LAAHIAAEIG ANINVAKTAA LLHDIGKAVD HEVQGPHALI GAEIARRLGK SPAIVHAIAA
HHNDEEPQTV EAWLVQAVDA ISGGRPGARR ETLDLYIKRL EALETVATSF SGVQRAFAVQ
AGREVRVMVQ PDAIDDLGSI HLARDVAKKI EESLQYPGQI KVTVIRETRA VDYAR