Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smal_3814 |
Symbol | |
ID | 6474697 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stenotrophomonas maltophilia R551-3 |
Kingdom | Bacteria |
Replicon accession | NC_011071 |
Strand | - |
Start bp | 4293095 |
End bp | 4296454 |
Gene Length | 3360 bp |
Protein Length | 1119 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 642733016 |
Product | glycoside hydrolase family 31 |
Protein accession | YP_002030196 |
Protein GI | 194367586 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGAAATCT GCGTTCGTCG CTCGCCGCTG ATGCTGGCCC TGTTGCCGGC ATTGATGTTG GCGTCCATGC CGGCCCAGGC CGAGCCGGTC GGCAACCTGC GTTCGGTCAG CGCCAGCGCC AGCCGCGACG GCGTGCAGGG CTGGGACCTG CAGACCGACA AGGGCGCGCG CATCCGCATC GAACTGCCGG CCACCGACAT CATCCGCGTG CAGGCCGGCC GCAACGGCAA GCTCACCGGT GCAGGCGACA AGGCCGCGCC GATCGTGCTG CCGCAGCCCA AGGCGAACGT GCAGGCGCAG CTGGAAGAAG ACGCGCAGGA AATCCGCGTA CGCACCGATG CGCTGGTGCT GCATGTGCAG CGGCAACCGC TGCGCCTGCG CCTGGAGCGC TTGGACAACG GTCAGCCCAC CGCGCTGTGG CAGGAACTGC AGCCGCTGGA CCTGGATGCC ACGCAGAGCG TGCAGGTGCT GTCCTCGCAG GCCGATGAAG GCTATTACGG TGGTGGCCAG CAGAATGGCC GTTACCAGTT CAAGGGCCGC GAGCTGGAAG TGTCCTATTC CGGCGGCTGG GAAGAGGGCG ACCGCCCCAG CCCGGCACCG ATGCTGCTGA GCAGCCGCGG CTGGGGCATG CTGCGCAACA CGTGGAGCGA TGGCAGCTAC GACCTGCGCG AGGCTGACCA GGCCGCGCTG CTGCACCGCG AAGACCGCTT CGATGCGTAC TACTTCGTCG GCGCCGATCT GCCGAAACTG ATCGAGCGCT ATACCCAGCT GACCGGCCGC CCGAACATGG TGGCGCGCTG GGCGCTGTCC TACGGCGATG CCGATTGCTA CAACGACGGC GACAACAGCA AGAAGCCCGG CACCGTGCCC GAAGGCTGGA GCGATGGCCC AACCGGCACC ACGCCGGATG TGATCGACTC GGTGGCCAAG CAGTACCGCG CCAACGACAT GCCCGGTGGC TGGATCCTGC CCAATGATGG TTACGGCTGC GGCTACAAGC AGTTGCCGGA AACGGTGAAG GGGCTGGCCA AGTACGGCTT CCGCACCGGC CTGTGGACCG AGAACGGTGT CGACAAGATC GCCTGGGAAG TGGGCAAGGC CGGCAGCCGC GTGCAGAAGC TGGACGTGGC CTGGACCGGC AAGGGCTACC AGTTCGCGAT GGACGCCAAT CGCCAGGCCT TCAACGGCAT CCTCGACAAT TCCGATTCGC GCCCGTTCCT GTGGACGGTG ATGGGCTGGG CCGGCATCCA GCGCTATGCG GTGGCATGGA CCGGCGACCA GAGCAGCAGC TGGGATTACA TCCGCTGGCA CGTGCCGACC CTGGTCGGGT CGGGCCTGTC CGGCATGGCC TATGCCAGCG GCGATGTCGA TGCGATCTTC GGCGGCAGCG CCGAGACCTT CACCCGCGAC CTGCAGTGGA AGGCGTTCAC GCCGGTGCTG ATGGGCATGA GCGGCTGGTC GTCGAACGCG CGCAAGCACC CGTGGTGGTA CGACGAGCCC TACCGCAGCA TCAACCGCGA TTACCTGAAG TTGAAGATGC GCCTGACCCC GTACATGTAC GGGCTGGTGC ACGAGGCCGC ACAGACCGGC GCGCCGCCGG TGCGTGGCCT GATGTGGGAC AACCCGCGCG ACCCGCACGC GCAGGATGAA ACCTACAAGT ACCAGTTCCT GCTCGGCCGC GACCTGCTGG TGGCGCCGGT GTACCGCAGC CAGGCGGCCA GCCGTGGCTG GCGCCGCGAC ATCCATCTGC CGGCTGGTGG CTGGATCGAC TACTGGGATG GCCGCCGCGT GCAGGCCGCC GCCGACGGTC GCCAGCTCGA CCGCCAGGTG GACCTGGCCA CGCTGCCGGT GTTCGTGCGT GCCGGTGCGA TCCTGCCGAT GTACCCGTCG ATGCTGTTCG ACGGCGAAAA GCCGCTGGAT GAAGTGACGT TCGACCTGTA CCCGCAGGGC GAGTCGCAGT ACACGCTGTA CGAAGACGAT GGCAACACCC GCCGCTACCA GCAGGGTGAA TCGAGCACGC AGCAGATCCG CGTGCAGGCA CCTGCGCAAG GCAGTGGTGC GGTACAGGTG CAGATCGACG CGGTGCAGGG CCAGTACAAC GGCCAGCTGG CGCAGCGTCG CTATGGCCTG CGCGTGCTCA GCCGCCAGGC GCCACGCGCG GTGCAGGCTG GTGGTCGCGC GCTGCCGGCG CTGGCCGATG CGGCGGCGTT CAACAACGCC AGCGAAGGCT GGTACTTCGA TGCCAAGGAG CGCCGTGGCA CCGTGCATGT GCGCACCGCC GCGCAGGACA TCCGCCAACC GCTGCAGTTG CAGCTGGACT TCGCGGTGGC CGTCGCTGCT GCCGACGATG CCTATCCGGC CGCGCCGGTG CTGGGCCGCG AACTGCCGGC CGACAGCCTG CTGGTGGTCA ACCGCCCGGC CGAAGAGCCC GGCCATGCGC TGGAAAATGC CTTCGACGAC GATCCGGGCA CCTGGTTCCG CAGCGTGCGC AACCAGGCCG TGCGCACCGG TGCACATGAG TGGGTGATCG GCTTCGGCGA GCGCAGGATG ATCGACGGCA TCGACATCGC ACCGCGCAAC GACAAGAACT GGAAGCACGG CCAGGTCCGC GACTATGAGG TCTACCTGGG CGACAGCAAT GGCGAGTGGG GCGAGCCGAT CACCCGTGGC CGCCTGCAGT TGAAGGAAGG CGTGCAGCGC ATCGACTTCC CGGCCCATGC CGGACGCCTG CTGCGCTTCC GCGTGCTGAG CGTGCAGAAC CCAGAGGGCG ATGGCGCCAG CAGCACCGAC CCGATGGTTA CCGCCGCACA GGGCAGCGCC CGCGCCTTCG ATGCATTGCA GCCGCGCGAC GTCGGCCCGA TCGCGCTGTC CACCTTCCAC ATCCTCGAAC ACCAGGAACC GGAGCGTCCG GCCCGGCAGC GCTATCTCTC CGAGCTGCCG GTGCCGGCCG CGCTGGCCAG CCAGCTGCGC ACCGACCAGT CCTTCCGTGG CGACGCCGGC ATGCGCATGA ACGGCCTGCA GTTCCGTCGT GGCCTGGGCG TCGGCGCCAA CAGCCGCATC GACCTGCGCC TGCAGGGCGG CTGGCACCTG CTGCGTGCCG ATCTCGGCAT CGACGACGCC TGCCGCAGCG CCGGTGGCCT GCAGTTCCAG GTCTGGGGTG ACAACCGCCT GCTGTACGAC AGCGGCCTGG TGAAGGCGCC CGGCGTGGTC AAGCCGGAGC TGGATATCCG AGGCCTTTCC ACCCTGAGCC TGCGCACGCT GGGTGCGCAG GGCAGCCAAC CCGCCCAGGT CTGCGCCAAC TGGGCCAACG CCGTACTGAT CGGCCAGGAG GGCGACTCCG CCAGCATCGT CGCCCCATGA
|
Protein sequence | MEICVRRSPL MLALLPALML ASMPAQAEPV GNLRSVSASA SRDGVQGWDL QTDKGARIRI ELPATDIIRV QAGRNGKLTG AGDKAAPIVL PQPKANVQAQ LEEDAQEIRV RTDALVLHVQ RQPLRLRLER LDNGQPTALW QELQPLDLDA TQSVQVLSSQ ADEGYYGGGQ QNGRYQFKGR ELEVSYSGGW EEGDRPSPAP MLLSSRGWGM LRNTWSDGSY DLREADQAAL LHREDRFDAY YFVGADLPKL IERYTQLTGR PNMVARWALS YGDADCYNDG DNSKKPGTVP EGWSDGPTGT TPDVIDSVAK QYRANDMPGG WILPNDGYGC GYKQLPETVK GLAKYGFRTG LWTENGVDKI AWEVGKAGSR VQKLDVAWTG KGYQFAMDAN RQAFNGILDN SDSRPFLWTV MGWAGIQRYA VAWTGDQSSS WDYIRWHVPT LVGSGLSGMA YASGDVDAIF GGSAETFTRD LQWKAFTPVL MGMSGWSSNA RKHPWWYDEP YRSINRDYLK LKMRLTPYMY GLVHEAAQTG APPVRGLMWD NPRDPHAQDE TYKYQFLLGR DLLVAPVYRS QAASRGWRRD IHLPAGGWID YWDGRRVQAA ADGRQLDRQV DLATLPVFVR AGAILPMYPS MLFDGEKPLD EVTFDLYPQG ESQYTLYEDD GNTRRYQQGE SSTQQIRVQA PAQGSGAVQV QIDAVQGQYN GQLAQRRYGL RVLSRQAPRA VQAGGRALPA LADAAAFNNA SEGWYFDAKE RRGTVHVRTA AQDIRQPLQL QLDFAVAVAA ADDAYPAAPV LGRELPADSL LVVNRPAEEP GHALENAFDD DPGTWFRSVR NQAVRTGAHE WVIGFGERRM IDGIDIAPRN DKNWKHGQVR DYEVYLGDSN GEWGEPITRG RLQLKEGVQR IDFPAHAGRL LRFRVLSVQN PEGDGASSTD PMVTAAQGSA RAFDALQPRD VGPIALSTFH ILEHQEPERP ARQRYLSELP VPAALASQLR TDQSFRGDAG MRMNGLQFRR GLGVGANSRI DLRLQGGWHL LRADLGIDDA CRSAGGLQFQ VWGDNRLLYD SGLVKAPGVV KPELDIRGLS TLSLRTLGAQ GSQPAQVCAN WANAVLIGQE GDSASIVAP
|
| |