Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_3340 |
Symbol | |
ID | 5210317 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 4188887 |
End bp | 4190509 |
Gene Length | 1623 bp |
Protein Length | 540 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640596938 |
Product | sulfatase |
Protein accession | YP_001277651 |
Protein GI | 148657446 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.171629 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCCACG TTCTTCCATA TGCAGGCGCT CGACGTGGTA TACTAAAGCA ACCCGAACCA GCAGTTCCTG GAGGAGGAGC ATTTGTGAGT CGGCGCCCCG ATATTGTGTT GCTCGTGCTG GATACCCAGC GTTGCGATAG ACTTTCGTGC TACGGCTATT CTCGACCAAC CTCGCCCTGC CTCGATGAGC TTGCGGCTGA AGCGACCCTT TTCCGTCGCG TCTTTGCCAC TGCGCAGTGG ACGATCCCAT CACACGCTTC GATGTTTACC GGTCTCTATC CATCGGAGCA TGCGACCAAC CAATCGTCGG CGGCGCTCCC CTCCGGCATT CCGACGCTGG CGGAACGCCT GCGTGAAGGC GGATATATGA CGGCGGCGTT CTGCAACAAC CCGCTGGTGG GCGTCGTCAA CAACGGGTTG CGGCGCGGTT TTGAGAGTTT TCTGAATTAC AGCGGTCTGC TGACCTCGCG CCCGAATCAG GCAGGCGCGC ATCCGGGACT GATCAGTCGT TATCGTCAGT GGTTCAAGGG TCGTCTGGCG GCAACGCTCA ATCGCATTCA GAACTCATTC GCGCGTTCTG AATTCATGCT GGAATTCGCG TTTACGCCAT TGATGGTGCC AATCTGGCAG ACGGCGCTCA GTTTCAAGGG GAATACGCCC AAATCGCTCA GCGATGCTGC GCGTTTGCTG ATCGAACGGC GCGGCGTTGA GCCGAATCAG CCCATTTTTG CCTTCATCAA CCTGATGGGC GTTCATACGC CGTACCATCC GGATCGGCGG ATGCTCGAAC GGTTCGCGCC GGACGTGATC CGTGACCGGG AAGCAGCGCG CTATGTGCGT CGCTTCAACG GCGATGTGTT CGGCTGGCTT GCGCCATTCT CCAGTATTGA CGAACGGTAT CACCACGTGC TCAGCGATGT GTACGACGCC GAGGTCGCCA CGCAGGATGC GCATCTTGGC GTGTTCCTGC GCCGGATGCG CGAGAGCGGC GCGCTCGACC GCACCCTGCT GCTGGTATGC GCCGACCATG GTGATCACCT GGGCGAAAAA GGTCTCGTCG GGCACACGGT ATCGGTCTAC AACGAACTGA TCCATGTGCC GCTGATGGTG CGCGATCCGG ATGGTGACTT TCCACGAGGT GCGGTGGTCG ATCATCCGGT GTCGTTGCGA CGGGTTTTCC ACACCCTGCT GAGCGCCGCC AGGCTCGCCA GCGGCGTCGA GCGTGATCGC TCGCTGGCAC AATCGCCAGC TGCCGATCCC GATGGCGGCA CGGTCTTCAG TGAGGCAGAA CCGCTGCAAA ACGTTCTGGG GATCATGCTG CGACGCCAGC CCGATCGTGC GCGTGCGCGC CGCTTCGATC AACCGCGCCG CGCGGTGATC AACGGTTCGC ACAAACTGAT CCAGACCGGG GAAGACCAGG TTGAGTTGTA CGATCTCGAT GCTGATCCCC GCGAAACAGT TGACCTGGCG GCAATGTTGC CGGAACGAGT CGAAGAACTT CAGGCGCGTC TCAGCGCCTT TGTGCGCCGC GCCGATGCAA CAGCGCCGCT CATCCGACGC GCTGAGGGCG TGGATGATCC GACCGTGCAG CGGCGTTTGC GAGAACTGGG GTATCTTGAA TAA
|
Protein sequence | MLHVLPYAGA RRGILKQPEP AVPGGGAFVS RRPDIVLLVL DTQRCDRLSC YGYSRPTSPC LDELAAEATL FRRVFATAQW TIPSHASMFT GLYPSEHATN QSSAALPSGI PTLAERLREG GYMTAAFCNN PLVGVVNNGL RRGFESFLNY SGLLTSRPNQ AGAHPGLISR YRQWFKGRLA ATLNRIQNSF ARSEFMLEFA FTPLMVPIWQ TALSFKGNTP KSLSDAARLL IERRGVEPNQ PIFAFINLMG VHTPYHPDRR MLERFAPDVI RDREAARYVR RFNGDVFGWL APFSSIDERY HHVLSDVYDA EVATQDAHLG VFLRRMRESG ALDRTLLLVC ADHGDHLGEK GLVGHTVSVY NELIHVPLMV RDPDGDFPRG AVVDHPVSLR RVFHTLLSAA RLASGVERDR SLAQSPAADP DGGTVFSEAE PLQNVLGIML RRQPDRARAR RFDQPRRAVI NGSHKLIQTG EDQVELYDLD ADPRETVDLA AMLPERVEEL QARLSAFVRR ADATAPLIRR AEGVDDPTVQ RRLRELGYLE
|
| |