Gene RoseRS_4011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4011 
Symbol 
ID5210994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5018827 
End bp5019909 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content63% 
IMG OID640597600 
Productpolysaccharide deacetylase 
Protein accessionYP_001278306 
Protein GI148658101 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0726] Predicted xylanase/chitin deacetylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00243359 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGGGG GAATGGAAAA GAACAGTACA AGCCTTCCAC TCATTCTTCT CACCCTGCTG 
ACAGGTATTG CGCTGGGATG GTTGCTCAAC GACCTGATGC GGCAACCGCC AGCGCCGCTG
GCGGTTGCCC CGACGCAAGC GCCGACGGCG ACCGCTCGTG CAACATCTGT ACTAACCGCA
CCGGTTCCAT CGTCAACAAT CGCACCCCAA TCGCCGGTCC CTTCACCGCC ACCTGCGGCG
ACGCCTCTGC TGGCGACCGC CATCCCGCCA ATTCCTACCG CATCCCCCCA ACCGCAGGTC
ATTGGCTACG TGGGGCATCG CGCTGCTGCT GGTGAAACCC TGGAACAGAT CGCAGCGCGC
TACCGCACCT CCCCGGCACT GATCAAAGCG TACAATATGA TCAATGCGCC GCTGCGCACC
GGGAGAGAAC TCGTCGTGCC GCTGATCGAA CCGGGCGATG CTGGCGAAGC GCTGCTGGTC
CAGCGTGGCA ATCCGATGCG CCCGTGGGTT GCGCTGACCC TCGACGCTGG CGCAGGTGCA
GCGCCAACGC CGCGCATCCT TGATGCGTTG CGCGTGCGCG GGATCACTAT CACCTTCTTC
CTGACCGGGC GCTGGATACG TGACAATCCC GGTCTGGTTC GCCGCATGGT GGCAGATGGG
CACGAACTCG CCAACCATAC GATGAACCAT CCTGATCTGA CAACCCTGGA CGACGAGGCG
ATTCGCCGCG AACTGAGCGA AACCGAGGCG ATCCTGCACG ACATCGCGCC GGATGCCGTC
ATACGCCCCT TCTTTCGACC GCCGTATGGC GCGTACAACG AGCGGGTCCT GCGTGTGGCG
CTATCAGAAG GATACCTGCC GGTCTACTGG TCGCTCGATA GTCTGGATTC GGTTGGGGAA
CCGAAGACGC CAGAGTTCCT CATCGCGCGG GTGACGCAAA AACTCAGCCC CGACGACCTG
CGCGGCGCAA TTATTCTGGC GCACTGCGGC AGTGATGCAA CCGCCGATGC GCTGCCGGAC
ATCCTGGACC GCTTTGCGGC GATGGGCTTC GAGGTGCGAA AACTCTCGGA CGTCATGCAG
TGA
 
Protein sequence
MDGGMEKNST SLPLILLTLL TGIALGWLLN DLMRQPPAPL AVAPTQAPTA TARATSVLTA 
PVPSSTIAPQ SPVPSPPPAA TPLLATAIPP IPTASPQPQV IGYVGHRAAA GETLEQIAAR
YRTSPALIKA YNMINAPLRT GRELVVPLIE PGDAGEALLV QRGNPMRPWV ALTLDAGAGA
APTPRILDAL RVRGITITFF LTGRWIRDNP GLVRRMVADG HELANHTMNH PDLTTLDDEA
IRRELSETEA ILHDIAPDAV IRPFFRPPYG AYNERVLRVA LSEGYLPVYW SLDSLDSVGE
PKTPEFLIAR VTQKLSPDDL RGAIILAHCG SDATADALPD ILDRFAAMGF EVRKLSDVMQ