Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_0607 |
Symbol | |
ID | 5538070 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 806453 |
End bp | 807268 |
Gene Length | 816 bp |
Protein Length | 271 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640892768 |
Product | Cof-like hydrolase |
Protein accession | YP_001430754 |
Protein GI | 156740625 |
COG category | [R] General function prediction only |
COG ID | [COG0561] Predicted hydrolases of the HAD superfamily |
TIGRFAM ID | [TIGR00099] Cof subfamily of IIB subfamily of haloacid dehalogenase superfamily [TIGR01484] HAD-superfamily hydrolase, subfamily IIB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000496024 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.512599 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCTTCC AACTGATCGC CCTTGATCTC GATGGCACGG TCATCGACCA TGATCTGACC ATTCATCCCG ACGTGCGCGA AACAATTGCC GCCGTGCAGG CGCGCGGTAT TGACGTAACG CTGGCGACCG GGCGCATGTT TGGCGCAGCA CTGCCGTTTG CGCGCGAACT CGATATTCGC GCGCCGATTA TCTGTTACCA GGGGGCGCTG GTGCGTCACC CGCTGACCGG CGATACGCTC TACCATGCGG CAATGCCTGC CGAACTGGCA GCGGCGGCAG TGCGCGAACT CCTCGATGCC GACATGTGTG TGGTTGCCTA TATCGATGAT ATTCACCATA TCACGGCGTA TCGACCGGAA CTCGAACGCT ACCTTGCCTT CCATCCCGAA GGGACAGAGA TGGTCGTTAC TCCCGATCTG GATCGTCTGG TTGAACGTGT GCCACCCACC AAATTGCTGT TCGTCGCCGA GCCGCCGGTG GTCGAGCGTG AACTGATGCG CCTGACCGCC AGATTCGGCG GCGCACTGGC AGTCGTTCGT TCGCACGCCA TCTTTGGCGA ACTGACAGCA CCGCACGTCA GCAAGGGAAA TGCACTATCC GCCCTGGCGC AGTCGCTGGG AGCGCCGCGT GAAGCAGTCC TGGCCATTGG CGATCAGGAG AACGACATCT CGATGATTAC CTGGGCAGGG CTTGGGCTGG CGATGGGGAA TGCGACACCG GCAGTGCGCG CACGAGCGCA TGCCGTGCTG CCGCCGGTCA GCGAGGCTGG CGTTGCCCAC GCGCTGCGAC GCTACGTCTT GAATTGCGCG TCCTGA
|
Protein sequence | MPFQLIALDL DGTVIDHDLT IHPDVRETIA AVQARGIDVT LATGRMFGAA LPFARELDIR APIICYQGAL VRHPLTGDTL YHAAMPAELA AAAVRELLDA DMCVVAYIDD IHHITAYRPE LERYLAFHPE GTEMVVTPDL DRLVERVPPT KLLFVAEPPV VERELMRLTA RFGGALAVVR SHAIFGELTA PHVSKGNALS ALAQSLGAPR EAVLAIGDQE NDISMITWAG LGLAMGNATP AVRARAHAVL PPVSEAGVAH ALRRYVLNCA S
|
| |