Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_4305 |
Symbol | |
ID | 5541816 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 5554341 |
End bp | 5555879 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640896411 |
Product | sulfatase |
Protein accession | YP_001434349 |
Protein GI | 156744220 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGTCGGC GCCCTGATAT TGTGTTGCTC GTACTGGATA CCCAGCGTAT CGATAGACTT TCATGCTACG GCTATTCCCG ACCGACTTCG CCCCACCTCG ATGATCTTGC CGCCGACGCG ACCCTGTTCC GCCGCGTGTT TGCCACGTCG CAATGGACCA TCCCTTCGCA TGCATCGATG TTTACCGGTC TCTACGCTGC CGAACATATG ACGAATCAGT CGTCTGCGGC GCTCCCTGCA AGCATTCCCA CCCTGGCAGA GCGTCTGCGC GACGGCGGGT ATATGACGGC GGCATTCTGC AACAACCCGC TCGTCGGTGT GGTCAACAAC GGTTTGAGGC GCGGCTTTGA GAGTTTTCTG AACTACAGCG GTTTGATGAC ATCGCGCCCC AACCAGGCAG GCGCGCATCC TGGCATAATC AGCCGCTACC GCCAATGGTT CAAAGGGCGT CTGGCGGAGA CGCTTAACCG CATTCAGGAC GCATTCGCGC ACTCCGAGAC GATGCTCGAA TTCGCGTTTA CGCCGTTGAT GGTGCCGCTG TGGCAGACGG CGCTCAGTTT CAAGGGCAAC ACGCCTAAAT CGCTCAACGA CGCAGCGCGT TTGCTGATCG AGCGGCGCGG CGTGGCACGC AACCAGCCAA TCTTCGCTTT CATCAACGTC ATGGGGGTCC ATACCCCATA CCATCCCGAT CGCCGCATGC TCGAACGATT TGCGCCGGAG GTGATCCGCA ACCGCGAGGC GGCACGCTAT GTGCGGCGCT TCAACAGTGA TGTGTTTGGC TGGCTGGCGC CGTTCTCCGG CGTCGATGAA CGGTATCACC ACGTGCTCAG CGATGTCTAC GACGCAGAAG TTGCCACCCA GGACGCACAC ATTGGCGCTT TCCTGCGGCG TTTGCGTGAA AGCGGCGTTC TTGATCGGAC GCTGCTCCTG GTGTGCGCCG ACCACGGCGA TCACCTGGGT GAGAAAGGGC TGATCGGGCA TACAGTGTCG GCATACAACG AACTGGTGCA TGTACCGCTG ATGGTGCGCG ATCCATTCGG CGACTTTCAA CGGAGCGCAG TGGTTGATCA CACGGTTTCA CTTCGACGGG TCTTCCACAC GCTGTTGAGC GCCGCCGGGC TTGCCAGCAG CATCGAGCGC GACCGGTCGC TGGCGCAGAC GCCAACCGCC GATCCCGAGG GGGGTGCCGT CTTCGTCGAG GCGGAACCAT TGCAGAATGT GCTGGGGATC ATGCTGCGCC GCCAGCCGGA CCTGGCGCGC GCCCGCCGGT TCGATCAACC GCGCCGCGCA GTGATCAGCG GATCGCACAA ACTGATCCAG ACCGGCAATG ACCATGTGGA GTTGTACGAC CTGGACGCCG ATCCGCGTGA AACCGTCGAT CTGGCGGCAA TCCTGCCGGA ACGTGTCGAG GAATTGCAAG AACGTCTCAG TGCATTTGTG CGGCGAATCA GCGCCAGCGC GCCATCGATC CGGCGCGCCG AAGGCGTGGA CGATCCCGCT GTGCAGCGCC GTTTGAAGGA GTTGGGGTAT CTGGAGTAG
|
Protein sequence | MSRRPDIVLL VLDTQRIDRL SCYGYSRPTS PHLDDLAADA TLFRRVFATS QWTIPSHASM FTGLYAAEHM TNQSSAALPA SIPTLAERLR DGGYMTAAFC NNPLVGVVNN GLRRGFESFL NYSGLMTSRP NQAGAHPGII SRYRQWFKGR LAETLNRIQD AFAHSETMLE FAFTPLMVPL WQTALSFKGN TPKSLNDAAR LLIERRGVAR NQPIFAFINV MGVHTPYHPD RRMLERFAPE VIRNREAARY VRRFNSDVFG WLAPFSGVDE RYHHVLSDVY DAEVATQDAH IGAFLRRLRE SGVLDRTLLL VCADHGDHLG EKGLIGHTVS AYNELVHVPL MVRDPFGDFQ RSAVVDHTVS LRRVFHTLLS AAGLASSIER DRSLAQTPTA DPEGGAVFVE AEPLQNVLGI MLRRQPDLAR ARRFDQPRRA VISGSHKLIQ TGNDHVELYD LDADPRETVD LAAILPERVE ELQERLSAFV RRISASAPSI RRAEGVDDPA VQRRLKELGY LE
|
| |