Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_3977 |
Symbol | |
ID | 5210960 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 4975178 |
End bp | 4975954 |
Gene Length | 777 bp |
Protein Length | 258 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640597569 |
Product | HAD family hydrolase |
Protein accession | YP_001278275 |
Protein GI | 148658070 |
COG category | [R] General function prediction only |
COG ID | [COG1011] Predicted hydrolase (HAD superfamily) |
TIGRFAM ID | [TIGR01428] 2-haloalkanoic acid dehalogenase, type II [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.883947 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATACGGG CAGTCATTTT CGATATGGGT GGAACACTGC TGCAATACCC GCGCCCTGGA AACGGCGCCT GGCGCGAGTT CGAGGAACGC GGCATTCGCG GACTGTACCG CTACCTGGTA GCGCAGGGAC ATCCAATCGC CGACGGAGAA GAAGCGTTCG TGGCGCGTAT GTTCGAGCGC CTGGCGCAGG GATGGGAACA GGCGACCGGC GGACATATCA ACCTGCGCGC GGTTGACTGG ATTGCGGCCG GAGCGGCCGA TCACGACCTG GATCTTCCGG AGAGCACACT GATCGAGGCA GTCCACCACT ATGCCCGCCC CTTACGCGAT GGCGTCTCCG CTATGCCCGG GGCGGCAATG GCGCTGGCGG AGTTGCGCGC GCGCGGCATT CATACCGGTC TGATCTCCAA CACCATCTGG CCCGGCGACC TGCACCGCGA AGACCTGATG GCGCTGGGGC TGTGGTCATC CATCGAGTAC GCCGTTTTTT CGGGCGACCT GGGCATCTGG AAGCCCCGCC CCCAGATTTT CCTCCATGTC CTCGAACATT TCGGCGTCAG TCCGGCAGAA GCCATCTTCG TCGGCGATAG CCCCAAAGAA GACATTCGCG GCGCACAACA GGCAGGTATG CGCGCCTTCT GGGTGCGCAG TCCCGAATTT CCGCTCCCGC CAGACATTCA TCCTGATGCC ATTATCGAAA ACCCCGGCGA AATCGTGCCG CTACTCGAAG CCTCTGGTCA ACTTCCGCCG CGCTCGAGCG TTTCTCGTAA AGGTTGA
|
Protein sequence | MIRAVIFDMG GTLLQYPRPG NGAWREFEER GIRGLYRYLV AQGHPIADGE EAFVARMFER LAQGWEQATG GHINLRAVDW IAAGAADHDL DLPESTLIEA VHHYARPLRD GVSAMPGAAM ALAELRARGI HTGLISNTIW PGDLHREDLM ALGLWSSIEY AVFSGDLGIW KPRPQIFLHV LEHFGVSPAE AIFVGDSPKE DIRGAQQAGM RAFWVRSPEF PLPPDIHPDA IIENPGEIVP LLEASGQLPP RSSVSRKG
|
| |