Gene Rcas_4141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4141 
Symbol 
ID5541652 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5361925 
End bp5363790 
Gene Length1866 bp 
Protein Length621 aa 
Translation table11 
GC content65% 
IMG OID640896252 
Productpolysaccharide deacetylase 
Protein accessionYP_001434190 
Protein GI156744061 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.666658 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0171825 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGAAA TGTGGCGAAG GGCGGGCGTG ATGGCGGCAA TCGCGCTCGG CGTCATGCTA 
ATCGGGACGA GTATCGCGCT CACGCAGGAA CGCCTCGGGC GCTGCGCGGT TGTGCCGTTG
CTTGCCGGCG ATCTGTTAGC AGCGCCCGAT TTTGCGAACC TGGCGCCAGG CGCCCCATCC
GGCGTGCTGC TGCCGGTCGG GTGGAGTGCG GCAGCAACCG GCGTGCAGAT CGGCGACTTT
ACCGTTTCCG GTCGTGGGCG TTCGTTTCAG TTGCTCGGCA TTGCCAACCA TCTGCGCACC
CCTGATGTTG CTGTGCGCCC TGGCGCGTCG TACTGCGCCG TCGCGCAGGC GCTCGCCGAT
TCGGTCTCAG CGACGCAGGT GCGTCTGGGG TTTCACTGGA TCGACGCCGG AGGCGATGTT
CTGCGCACCG ATTGGAGTGG CTGGCAGGAG GTGCGGCGCT GGAATGGACC CGCCGATGTG
CAGCCCTGGT CGCAGATTGG CGGTGGGTTT ATTGCTCCTG CCGGCGCAGT GCGTCTCACG
GTCTCGTTCC ATCCCGCCTC TGATGACCGG ATATACCTGG ATACGATCCG CATCCGTCCA
GGACGCTTTC CGGCTTCTGG CGAGATGCCG CCGCAGTCGC CAGAGCGACC GACTGGAGTG
ACAGTGCTTC CCTGGCCCAA CGGCGCGCGC GCGGCGCTCT CATTTTCGTT CGACTGGGAA
ACGACGATGG GCGGGTTGAT CCACTCACGC TCGGTTGATG ACCCGAACTT CGATCAGGAT
CCGGTGCTGC GCGGGATGCG GATGCGCGAG GGGGTGACGA CCACCGTGGA CCTCTTCCGT
CCCTACGGCA TCCGCGCCAC GTATTATGCG ACGGGGTACA ATTTCCTGAG CGGTAACCGT
GAACGGCGAC GCTTTATGGG CGATCCGACG TTTGCCTGGG CGAACCGCGC CAATCGCTGG
CAGACCGACG CCTGGCAGCA GCAGCCCTGG TTCTCGCCCG ATCCGTATGG CACGGTCGCA
ACCGATCCCG CCTGGTACTT TGGCGACCTG ATTCCGATCT TGCAGCATGA GGGACACGAT
ATTCAGAGCC ATACGTTCAG CCATCTGTAT GGTGGGCTTG CCAGCGCCGA GGAGTGGCGC
AGCGACCTTT CCGAGTGGCG TGCCGTTGCC GCAGAACGTG GCGTTCCATC AGCGCGGTCG
CTGGCGTTTC CCTGGAGCAG CAGCGCCGGA ATGAGTTACG CAAACTGGCA GGCGCTGGCA
GAAGCGGGCA TCACATCGGT CACCCGCACC AACTGGAATC CGCGCCAGCC GCAGTATCAC
CTTGTGAGCC GGGAGGATCC GCACTGCCGC CCCATTCCAG GGCACGAAAC CATTCTGGCA
TGCCCGGATT TCTACCTGAC CGAACGCAGC GCTGCGCAGG CGCCGGACGT GATTGAGCGG
GCAATTGCCG CCGGTGGCAT GATCGACCTG TGGGCGCATA CGGAGGAGGT GGTCAGCCCG
GCTCAGATTG CCGCCTGGAG CGAGGTGGTG CGGTACGCTG CCGCGCGCCG TGACGCTGGC
GATCTCTGGA TCGCGCCGCT GGCAGAGATT GCCGACCGGC AGCAGGCAGT GGCGCAGGTG
CATGTCGAGG AGCACAAATC CGAACCCGTG AACGCCGATT CTTTCCAGGC AGCGCCTTTG
CGTCTGGCAG TGACCAATCG CAGCGCGCGC AATCTGGCAG GATTGACGCT CAGGTTGCCC
TTCGACGCGC ATCGGGCGAC GGTGCAGCAC GCGAATGACG CCGTCAACCC CCTCATTCGC
GGCGCGATGC TGGTCTTCGA TCTGGCAGCC GGCGAGACAG TCGAGGTGAC CGTATGGCCG
GCATAG
 
Protein sequence
MTEMWRRAGV MAAIALGVML IGTSIALTQE RLGRCAVVPL LAGDLLAAPD FANLAPGAPS 
GVLLPVGWSA AATGVQIGDF TVSGRGRSFQ LLGIANHLRT PDVAVRPGAS YCAVAQALAD
SVSATQVRLG FHWIDAGGDV LRTDWSGWQE VRRWNGPADV QPWSQIGGGF IAPAGAVRLT
VSFHPASDDR IYLDTIRIRP GRFPASGEMP PQSPERPTGV TVLPWPNGAR AALSFSFDWE
TTMGGLIHSR SVDDPNFDQD PVLRGMRMRE GVTTTVDLFR PYGIRATYYA TGYNFLSGNR
ERRRFMGDPT FAWANRANRW QTDAWQQQPW FSPDPYGTVA TDPAWYFGDL IPILQHEGHD
IQSHTFSHLY GGLASAEEWR SDLSEWRAVA AERGVPSARS LAFPWSSSAG MSYANWQALA
EAGITSVTRT NWNPRQPQYH LVSREDPHCR PIPGHETILA CPDFYLTERS AAQAPDVIER
AIAAGGMIDL WAHTEEVVSP AQIAAWSEVV RYAAARRDAG DLWIAPLAEI ADRQQAVAQV
HVEEHKSEPV NADSFQAAPL RLAVTNRSAR NLAGLTLRLP FDAHRATVQH ANDAVNPLIR
GAMLVFDLAA GETVEVTVWP A