Gene Rcas_4305 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4305 
Symbol 
ID5541816 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5554341 
End bp5555879 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content61% 
IMG OID640896411 
Productsulfatase 
Protein accessionYP_001434349 
Protein GI156744220 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTCGGC GCCCTGATAT TGTGTTGCTC GTACTGGATA CCCAGCGTAT CGATAGACTT 
TCATGCTACG GCTATTCCCG ACCGACTTCG CCCCACCTCG ATGATCTTGC CGCCGACGCG
ACCCTGTTCC GCCGCGTGTT TGCCACGTCG CAATGGACCA TCCCTTCGCA TGCATCGATG
TTTACCGGTC TCTACGCTGC CGAACATATG ACGAATCAGT CGTCTGCGGC GCTCCCTGCA
AGCATTCCCA CCCTGGCAGA GCGTCTGCGC GACGGCGGGT ATATGACGGC GGCATTCTGC
AACAACCCGC TCGTCGGTGT GGTCAACAAC GGTTTGAGGC GCGGCTTTGA GAGTTTTCTG
AACTACAGCG GTTTGATGAC ATCGCGCCCC AACCAGGCAG GCGCGCATCC TGGCATAATC
AGCCGCTACC GCCAATGGTT CAAAGGGCGT CTGGCGGAGA CGCTTAACCG CATTCAGGAC
GCATTCGCGC ACTCCGAGAC GATGCTCGAA TTCGCGTTTA CGCCGTTGAT GGTGCCGCTG
TGGCAGACGG CGCTCAGTTT CAAGGGCAAC ACGCCTAAAT CGCTCAACGA CGCAGCGCGT
TTGCTGATCG AGCGGCGCGG CGTGGCACGC AACCAGCCAA TCTTCGCTTT CATCAACGTC
ATGGGGGTCC ATACCCCATA CCATCCCGAT CGCCGCATGC TCGAACGATT TGCGCCGGAG
GTGATCCGCA ACCGCGAGGC GGCACGCTAT GTGCGGCGCT TCAACAGTGA TGTGTTTGGC
TGGCTGGCGC CGTTCTCCGG CGTCGATGAA CGGTATCACC ACGTGCTCAG CGATGTCTAC
GACGCAGAAG TTGCCACCCA GGACGCACAC ATTGGCGCTT TCCTGCGGCG TTTGCGTGAA
AGCGGCGTTC TTGATCGGAC GCTGCTCCTG GTGTGCGCCG ACCACGGCGA TCACCTGGGT
GAGAAAGGGC TGATCGGGCA TACAGTGTCG GCATACAACG AACTGGTGCA TGTACCGCTG
ATGGTGCGCG ATCCATTCGG CGACTTTCAA CGGAGCGCAG TGGTTGATCA CACGGTTTCA
CTTCGACGGG TCTTCCACAC GCTGTTGAGC GCCGCCGGGC TTGCCAGCAG CATCGAGCGC
GACCGGTCGC TGGCGCAGAC GCCAACCGCC GATCCCGAGG GGGGTGCCGT CTTCGTCGAG
GCGGAACCAT TGCAGAATGT GCTGGGGATC ATGCTGCGCC GCCAGCCGGA CCTGGCGCGC
GCCCGCCGGT TCGATCAACC GCGCCGCGCA GTGATCAGCG GATCGCACAA ACTGATCCAG
ACCGGCAATG ACCATGTGGA GTTGTACGAC CTGGACGCCG ATCCGCGTGA AACCGTCGAT
CTGGCGGCAA TCCTGCCGGA ACGTGTCGAG GAATTGCAAG AACGTCTCAG TGCATTTGTG
CGGCGAATCA GCGCCAGCGC GCCATCGATC CGGCGCGCCG AAGGCGTGGA CGATCCCGCT
GTGCAGCGCC GTTTGAAGGA GTTGGGGTAT CTGGAGTAG
 
Protein sequence
MSRRPDIVLL VLDTQRIDRL SCYGYSRPTS PHLDDLAADA TLFRRVFATS QWTIPSHASM 
FTGLYAAEHM TNQSSAALPA SIPTLAERLR DGGYMTAAFC NNPLVGVVNN GLRRGFESFL
NYSGLMTSRP NQAGAHPGII SRYRQWFKGR LAETLNRIQD AFAHSETMLE FAFTPLMVPL
WQTALSFKGN TPKSLNDAAR LLIERRGVAR NQPIFAFINV MGVHTPYHPD RRMLERFAPE
VIRNREAARY VRRFNSDVFG WLAPFSGVDE RYHHVLSDVY DAEVATQDAH IGAFLRRLRE
SGVLDRTLLL VCADHGDHLG EKGLIGHTVS AYNELVHVPL MVRDPFGDFQ RSAVVDHTVS
LRRVFHTLLS AAGLASSIER DRSLAQTPTA DPEGGAVFVE AEPLQNVLGI MLRRQPDLAR
ARRFDQPRRA VISGSHKLIQ TGNDHVELYD LDADPRETVD LAAILPERVE ELQERLSAFV
RRISASAPSI RRAEGVDDPA VQRRLKELGY LE