Gene Rcas_0065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0065 
Symbol 
ID5537524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp80524 
End bp81621 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content64% 
IMG OID640892231 
Productpolysaccharide deacetylase 
Protein accessionYP_001430221 
Protein GI156740092 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0726] Predicted xylanase/chitin deacetylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00184975 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0058603 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAACAGC GCAACACAAG TCTCCCTCTC ATAGCCCTGA CCCTGCTGAC GGGAATCGCG 
CTTGGCTGGT TTCTCAACGA CCTGATTCGT CGTCCCGCAG CGCCGTTGGC GCTCGCACCG
GCTTCACCGC CTGCCGTTGC CGCGCCGACT GCCAGCATCT CGGTTGCGCC AACTACTGCT
GTGTCGATTG AGCCAACCAC TGCTGTGTCG GTCGCGCCGC CGGCGCTGCC ATCCGTGGCG
ACTCCTGCGC CCGCCGTGAT GCCCTCGCCG CTTACATCCG AGCCATTGCC CACAGCCACG
CCCCCGCTAC GTATCGTCGG CTATGCCGGG CATCGAGTCG CTGCGGGTGA GACGCTGGAG
ACCATCGCGG ACCGCTATGG GAGCACGGTG GCGCTTATTG AAACATACAA TCGACTTGAC
GCGCCACCGC GTATTGGAAG GGAACTGGTT GTGCCGTTGC TGGCGCCAAC CGACGCAGGT
GATGCGCTGC TGGTGCGGCG CGGCGGCGCA GAGCGTCCCT GGATCGCGTT GACCCTCGAC
GCTGGCGCAG GCGCGGCGCC GACGCCACGC ATCCTGGCAG CGCTGCGTGA GCGCGGCATC
ACCATCACCT TCTTCCTCAC CGGACGCTGG ATGCGCGCCA ATCCCGACCT GGTGCGCCAG
ATGGTGGCGG ATGGACACGA ACTTGCCAAT CACACGGTGA ATCATCCCGA TCTGACGACG
CTCGACGACA ATACCATCCG CCGTGAACTG AACGAGACCG AGGCTATTTT GCATGAGATT
GCTCCAGGCG CGACGACACG CCCCTTCTTC CGCCCTCCCT ACGGCGCCTA CAATGAGCGG
GTGCTGCGCG CGGCACTGGC GGAAGGGTAC CTGCCCATCT ACTGGACACT CGACAGCCTG
GACTCGGTCG GCGAACCGAA GACGCCTGAG TTCCTCGTCG AGCGAGTCAC CGGGAAACTC
AGCCAGAACG ATCTGCGCGG CGCGATCATT CTGGCACACT GTGGCAGCGA AGCAACTGCT
GATGCGTTAC CGGAGATTCT GGATCGATTT GCGGTAATGG GCTTCGAGGT GCGGAAATTG
TCGGATGTGC TCAAGTGA
 
Protein sequence
MEQRNTSLPL IALTLLTGIA LGWFLNDLIR RPAAPLALAP ASPPAVAAPT ASISVAPTTA 
VSIEPTTAVS VAPPALPSVA TPAPAVMPSP LTSEPLPTAT PPLRIVGYAG HRVAAGETLE
TIADRYGSTV ALIETYNRLD APPRIGRELV VPLLAPTDAG DALLVRRGGA ERPWIALTLD
AGAGAAPTPR ILAALRERGI TITFFLTGRW MRANPDLVRQ MVADGHELAN HTVNHPDLTT
LDDNTIRREL NETEAILHEI APGATTRPFF RPPYGAYNER VLRAALAEGY LPIYWTLDSL
DSVGEPKTPE FLVERVTGKL SQNDLRGAII LAHCGSEATA DALPEILDRF AVMGFEVRKL
SDVLK