Gene Smal_4042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmal_4042 
Symbol 
ID6474936 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStenotrophomonas maltophilia R551-3 
KingdomBacteria 
Replicon accessionNC_011071 
Strand
Start bp4568752 
End bp4571424 
Gene Length2673 bp 
Protein Length890 aa 
Translation table11 
GC content66% 
IMG OID642733255 
Productpolysaccharide deacetylase 
Protein accessionYP_002030424 
Protein GI194367814 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0726] Predicted xylanase/chitin deacetylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.785227 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCGTG CGTCGTCGTT CCGCCTGTTC CTGCCCTCGC TGTTGTTGAC CCTGGTCGTT 
GCCGGCTGTG GCGACAAGGA CGCGAAGGCA CCGGTAACCG GCGTGACCCA GCCCAGCGCA
CAGGCCGCGG CCGCCGCCGA TCCTGCCGCC GCGCCGTTGC TGGCTGCGCT GCAGAAGCAG
CTGGATGGCT ACCGCCGGAT CATCGTGCTG CTGGCCGACG AAGAACAGCA GTCGGCCGCC
GACCGCGGCA CCTCCACGCG TGTGGGCCAG CAGCTGTTCC ACGATGGCCT GGAACAACGC
ACGGTCATCG CCGCGCAGTT CGACACCCTG CTGCGCGGTT CCAGCCCGCA GCGCTTCGCC
ACCCTCGGCA CGGTGCTGGA TTACATCGAA TCCGCACCGG AACTGTTCGA TGCCGATCGC
CTGGCCTTCC GCGAAGTACT GCGTGATCTG CACGAGCGGG TCGGTACCGA TTCGTCGCTG
CCGGCGGTGA AGCTGCACCA GCGCATCGGC GAGGACCTGG AGGCGCTCGA CGAGATCGAA
CGCAACTACA ACCAGGAACT GACCCGCATC TTCAGCCGCT TCGAGCGCAC CCGCGCCATC
GAGCTGAAGC GCGAGAAGTG GGACGACTAC ATCGCCCACC TGCACAAGGA TTACAGCCGC
GAAGCGATCC TGCGCGACTA CGGGGTGATC GAGCCGTACC CGATGTCGAT GAAGGACAGC
GACCGCGAGA TCTTCGGCCG CGACCTGCCC GCCAAGACCG TGGTGCTGAC CTTCGACGAT
GGCCCGCACA AGGCCTACAC CGATGAAGTG GTGGCGATCC TCAAGCGCTA CGACGTACCG
GGCGTGTTCT TCGAAGTCGG CCGCAACCTC GGCAAGGTCG AGAGTGATGG CAAAGTCAGC
CTCGGGCCGA TGGCGAAGAT CAGCCGCAAC CTGATGGAAG AAGGCTATGC GGTCGGCAAC
CACAGCCTGA CCCACGCCCA GCTGTCGCGC ACCACCGGCG ATGCGCTGCG CCAGCAGGTG
CTCGACACCG ATACGCTGCT GAAGGATGTC GACAGCAAGC GCGCGCCGTT GTTCCGCTTC
CCGTACGGCG CGCGCAATGC CGAAGGCCTG CAGCTGCTCA ACGAGGCCGG ACTGAAGTCG
ATCATGTGGA ACATCGACTC GATGGACTGG GCCGACCCGG TGCCGGAGTC GATCGTGCAG
CGCGTGCTCG ACCAGGTGAA CAAGGAACAG CGCGGCATCA TCCTGTTCCA CGACATCCAC
GACCGGGCGG TGAAGGCGCT GCCGCAGATT CTCGACCGGC TGATTGCCGA CGGCTACCAG
TTCGCCGGCT GGAATGGCCG TGAATTCACT GTCGCCCGCG CGCGCAAGGG CGAGGCCCGT
GCCGCCACCG TCACCACCGG CTACGAGAAG TCGTGGGCGA TCGTGGTCGG CATCGACACC
TACGCCAAGT GGCCGAAGCT GGAGTACGCC AGCCATGACG CACAGGCGGT GGCCGATACC
CTGACCGGGC GGTTCGGCTT CCCGTCCTCG CAGGTGATCG TGCTGAAGAA CGAGCAGGCC
ACCCGCAACA ACATCCTGGC CGCCTTCCAC GACCGCCTGG CCGATGACCG CACCGGCAAG
AACGACCGCG TGTTCGTGTT CTTCGCCGGC CATGGCGCGA CCCGCCAGCT CGCCTCCGGG
CGCGATCTCG GCTACATCAT CCCGGTCGAT TCGGATCCGA AGGAATTTGC CACCGACGCC
ATCGCGATGA CCGACATCCA GAACATCGCC GAGAGCATGC AGGCCAAGCA TGTGATGTTC
GTGATGGATG CCTGCTACAG CGGCCTGGGC CTGACCCGCG GTGGCCCGTC TTCGTCGTCG
TTCCTGCGCG AGAACGCACG CCGCAGCGCG CGGCAGATGC TGACCGCCGG CGGTGCCGAC
CAGCAGGTGG CCGATGCCGG CCCGAACGGC CATTCGGTGT TCACCTGGGT GCTGCTGCAG
GCGCTGGCCG GCAAGGGGGA CCTCAATGGC GATGGCTTGA TCACCGGCAC CGAACTGGCC
GCCTACGTGG CGCCGGCGGT GTCCGCGGTT TCACACCAGA CACCGGCCTT CGGCAGCCTG
CCCGGTTCGC AGGGCGGCGA GTTCGTGTTC CAGGTGCCGG ACAGCCAGGA ATTCCTCAAT
GCCGACACGC GCCAGCTGAC TGCCGATGCG ATCGCGCTGA ACAACAAGGT GGATGCCGCC
AGCGAAGCCA AGGGTGCTCA GGCACCAGTG ACCGTGGCCG ACCTGCAGGG CGGCAAGGCC
AAGCTGGTGG TGCCCACTGC CGGCCCGGCC TCGGACCGCC AGCGCGCGCA GCAGGCCAAC
GACCGTGGCC TGCAGCTGTA CCGCGAGAAG CAGTACGACG AAGCCGTCGC GCAGTTCACC
GAAGCGTTGA AGCTGCGCCC GGACTTCGCC CAGGCCGCCA ACAACCTGGG CTTTGTCTAC
TACCGCCAGC AACGCTATGC CGAAGCTGCG CGCTGGCTGG AGAACACGCT GAAGATCGAC
CCGTCGCGTG CGGTGGCCTA TCTGAATCTT GGCGATGCCT ACTTCAACGC GGGCGACAAG
GCCAAGGCCA AACAGGCTTA CACCACCTAC CTTGCGCTGC AGCCGCAGGG CAGCGGCGCA
GCACAGGCGC GCGCGCAGCT GGAGAAACTC TGA
 
Protein sequence
MPRASSFRLF LPSLLLTLVV AGCGDKDAKA PVTGVTQPSA QAAAAADPAA APLLAALQKQ 
LDGYRRIIVL LADEEQQSAA DRGTSTRVGQ QLFHDGLEQR TVIAAQFDTL LRGSSPQRFA
TLGTVLDYIE SAPELFDADR LAFREVLRDL HERVGTDSSL PAVKLHQRIG EDLEALDEIE
RNYNQELTRI FSRFERTRAI ELKREKWDDY IAHLHKDYSR EAILRDYGVI EPYPMSMKDS
DREIFGRDLP AKTVVLTFDD GPHKAYTDEV VAILKRYDVP GVFFEVGRNL GKVESDGKVS
LGPMAKISRN LMEEGYAVGN HSLTHAQLSR TTGDALRQQV LDTDTLLKDV DSKRAPLFRF
PYGARNAEGL QLLNEAGLKS IMWNIDSMDW ADPVPESIVQ RVLDQVNKEQ RGIILFHDIH
DRAVKALPQI LDRLIADGYQ FAGWNGREFT VARARKGEAR AATVTTGYEK SWAIVVGIDT
YAKWPKLEYA SHDAQAVADT LTGRFGFPSS QVIVLKNEQA TRNNILAAFH DRLADDRTGK
NDRVFVFFAG HGATRQLASG RDLGYIIPVD SDPKEFATDA IAMTDIQNIA ESMQAKHVMF
VMDACYSGLG LTRGGPSSSS FLRENARRSA RQMLTAGGAD QQVADAGPNG HSVFTWVLLQ
ALAGKGDLNG DGLITGTELA AYVAPAVSAV SHQTPAFGSL PGSQGGEFVF QVPDSQEFLN
ADTRQLTADA IALNNKVDAA SEAKGAQAPV TVADLQGGKA KLVVPTAGPA SDRQRAQQAN
DRGLQLYREK QYDEAVAQFT EALKLRPDFA QAANNLGFVY YRQQRYAEAA RWLENTLKID
PSRAVAYLNL GDAYFNAGDK AKAKQAYTTY LALQPQGSGA AQARAQLEKL