Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3992 |
Symbol | |
ID | 5541502 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 5206185 |
End bp | 5206847 |
Gene Length | 663 bp |
Protein Length | 220 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640896104 |
Product | HAD family hydrolase |
Protein accession | YP_001434043 |
Protein GI | 156743914 |
COG category | [R] General function prediction only |
COG ID | [COG1011] Predicted hydrolase (HAD superfamily) |
TIGRFAM ID | [TIGR01428] 2-haloalkanoic acid dehalogenase, type II [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.000540909 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCTACGAG CCATTCTGTT CGACCTGGAC GACACACTGT ACGATCTCAA GGCGCACTGG CTCGCATGCC TGCGCATCGC TCTCGCAGAC GCACCATGCA CTATATCGTG CGACCTCGAG ACGCTGGTGC AGCATGCATT CACGATGAAG ATCTGGATCA ACCAGTTGCC CGACTTTCTG CGCGATCAGG GAATGACCGA TCAGCGCATG ATTGATCGGG CTTTCGCGCG CTATCGTGAT ATCTGGTTCG AGACACTCAC CCTCGATCCA GAAGCGCTGC CATTGCTCAC TGCCCTCGGC GCGCGCTACC GTCTCGGATT GATTACCAAC GGACCATCAT GGTCACAGCG TCCCAAGATT GAACGCTTCG ATCTTGCATC GTATATGCAT GCCATCATCG TCTCCGAGGA AGTCGGAGTT GCCAAACCGG ACCCGCAGAT CTTTCACATC GCACTGCACG CCCTGGGAAT AACGCCTGAT GAGGCATTGT TCGTCGGCGA TTCGCCAGAG AATGACCTGC GGGGTGCGGC ACAGGCGGGC ATGCCGGCTA TTTGGGTCAA CCGCCACGGA GTGACGCTTC CGCCTGACGT GCCCCCACCT GTTGCGGTGG TTGATGGTTT GCGCGACCTT CTGGCGATCA TTGCCGCTTA CGACTCACAT TGA
|
Protein sequence | MLRAILFDLD DTLYDLKAHW LACLRIALAD APCTISCDLE TLVQHAFTMK IWINQLPDFL RDQGMTDQRM IDRAFARYRD IWFETLTLDP EALPLLTALG ARYRLGLITN GPSWSQRPKI ERFDLASYMH AIIVSEEVGV AKPDPQIFHI ALHALGITPD EALFVGDSPE NDLRGAAQAG MPAIWVNRHG VTLPPDVPPP VAVVDGLRDL LAIIAAYDSH
|
| |