Gene RoseRS_3977 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3977 
Symbol 
ID5210960 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4975178 
End bp4975954 
Gene Length777 bp 
Protein Length258 aa 
Translation table11 
GC content62% 
IMG OID640597569 
ProductHAD family hydrolase 
Protein accessionYP_001278275 
Protein GI148658070 
COG category[R] General function prediction only 
COG ID[COG1011] Predicted hydrolase (HAD superfamily) 
TIGRFAM ID[TIGR01428] 2-haloalkanoic acid dehalogenase, type II
[TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED
[TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.883947 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATACGGG CAGTCATTTT CGATATGGGT GGAACACTGC TGCAATACCC GCGCCCTGGA 
AACGGCGCCT GGCGCGAGTT CGAGGAACGC GGCATTCGCG GACTGTACCG CTACCTGGTA
GCGCAGGGAC ATCCAATCGC CGACGGAGAA GAAGCGTTCG TGGCGCGTAT GTTCGAGCGC
CTGGCGCAGG GATGGGAACA GGCGACCGGC GGACATATCA ACCTGCGCGC GGTTGACTGG
ATTGCGGCCG GAGCGGCCGA TCACGACCTG GATCTTCCGG AGAGCACACT GATCGAGGCA
GTCCACCACT ATGCCCGCCC CTTACGCGAT GGCGTCTCCG CTATGCCCGG GGCGGCAATG
GCGCTGGCGG AGTTGCGCGC GCGCGGCATT CATACCGGTC TGATCTCCAA CACCATCTGG
CCCGGCGACC TGCACCGCGA AGACCTGATG GCGCTGGGGC TGTGGTCATC CATCGAGTAC
GCCGTTTTTT CGGGCGACCT GGGCATCTGG AAGCCCCGCC CCCAGATTTT CCTCCATGTC
CTCGAACATT TCGGCGTCAG TCCGGCAGAA GCCATCTTCG TCGGCGATAG CCCCAAAGAA
GACATTCGCG GCGCACAACA GGCAGGTATG CGCGCCTTCT GGGTGCGCAG TCCCGAATTT
CCGCTCCCGC CAGACATTCA TCCTGATGCC ATTATCGAAA ACCCCGGCGA AATCGTGCCG
CTACTCGAAG CCTCTGGTCA ACTTCCGCCG CGCTCGAGCG TTTCTCGTAA AGGTTGA
 
Protein sequence
MIRAVIFDMG GTLLQYPRPG NGAWREFEER GIRGLYRYLV AQGHPIADGE EAFVARMFER 
LAQGWEQATG GHINLRAVDW IAAGAADHDL DLPESTLIEA VHHYARPLRD GVSAMPGAAM
ALAELRARGI HTGLISNTIW PGDLHREDLM ALGLWSSIEY AVFSGDLGIW KPRPQIFLHV
LEHFGVSPAE AIFVGDSPKE DIRGAQQAGM RAFWVRSPEF PLPPDIHPDA IIENPGEIVP
LLEASGQLPP RSSVSRKG