Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A1783 |
Symbol | treZ |
ID | 6872701 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 1726024 |
End bp | 1727808 |
Gene Length | 1785 bp |
Protein Length | 594 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642784918 |
Product | malto-oligosyltrehalose trehalohydrolase |
Protein accession | YP_002215586 |
Protein GI | 198244502 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0296] 1,4-alpha-glucan branching enzyme |
TIGRFAM ID | [TIGR02402] malto-oligosyltrehalose trehalohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.625431 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 68 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTCAA AAATTTTTTG CAAAAGCTGG GGGGCTGAAT ATATCGCCGC TGATGTTGTC CGCTTTCGTC TTTGGGCCAC CGGTCAGCAA AAGGTTATGC TCAGGCTTGC GGGTAAAGAC CAGGAAATGC AGGCGAGCGG TGACGGCTGG TTTACGCTGG ACGTCTCCGG GGTGACGCCA GGTACGGAGT ATAACTTTGT ACTCAACGAT GGCATGGTGG TCCCCGATCC GGCTTCCCGC GCCCAAAAAA CTGACGTCAA CGGTCCGTCA TATGTGGTTG ATCCAGGCAG CTACACGTGG CGCAACACCG GGTGGAAAGG TAGCCGTTGG GAGCAGGCCG TGGTGTATGA GATGCATACA GGCACGTTCA CTCCGGAAGG CACCTTCCGC GCCGCAATAG CGAAGCTGCC TTATCTCGCT GAACTCGGCG TTACCGTTAT TGAAGTGATG CCCGTTGCGC AATTTGGCGG CGAGCGTGGC TGGGGCTATG ACGGCGTACT GCTTTACGCG CCGCATTCTG CCTATGGGAC GCCGGATGAT TTCAAGGCGT TTATTGACGC CGCGCATGGG TATGGTCTTT CCGTCGTCCT GGATATTGTG CTGAACCATT TCGGCCCGGA GGGAAATTAT TTACCGCTAT TGGCGCCGGC GTTTTTCCAC AAAGAGCGCA TGACGCCGTG GGGAAATGGT ATCGCCTATG ATGTCGACGC CGTGCGGCGC TATATCATCG AGGCGCCGTT ATACTGGCTG ACAGAATACC ATCTCGACGG CTTACGCTTT GACGCTATCG ATCAGATTGA GGACAGTAGC GCCAGGCATG TGCTGGTTGA AATCGCACAA CGTATTCGGG AAGACATTAC CGACAGACCC ATTCATCTGA CTACCGAAGA TAGCCGCAAT ATTATTTCTC TGCATCCCCG TGATCAGGAT GGCAATGCGC CGCTGTTTAC CGCCGAATGG AATGACGATT TTCATAATGC CGTCCACGTT TTTGCGACCG GAGAGACCCA GGCCTACTAC AACGATTTTG CTGATACCCC GGAAAAACAC CTCGCGAGAG CGCTGGCCGA AGGATTCGCT TATCAGGGAG AAATTTCCCC CCAAACCGGC GAACCTCGCG GCGTAAAAAG TACCGGACAA CCTCCGGTCG CCTTTGTGGA TTTTATTCAG AACCACGATC AGGTCGGTAA CCGCGCCCAG GGCGACAGAC TGATAACCCT GGCGGGCGCT GAACGAACAA AAGTATTGCT CGCCACGTTG CTGCTTTCAC CGCATATTCC GCTGCTTTTT ATGGGCGAAG AGTATGGCGA AAGCCGTCCT TTTCTTTTTT TTACCGATTT CCATGGGGAT TTAGCCCGCG CCGTTCGTGA AGGTCGCGCA AAAGAGTTTG CCGATCATGC AGGGGAAAAT GTTCCGGACC CGAATGCGCC AGAGACCTTT CAACGCTCAA AACTTAACTG GAAGCAACAG CACAGTGAAG AGGGTAAAGC GTGGCTGGCA TTTACCCGCG AACTACTGCT TTTGCGCCAG AAGCATATCG TGCCGCTGTT GTCCGCTGCC CGTGAGAGCT CAGGAACGGT ATTGCAAACC GCGCCCGGGT TTATTGCCGT TAGCTGGCGT TTTCCGGGAG GAACGCTGTC ACTGGCGCTG AATATTAGCG CCACGACGGT ATTGCTGCCC GATTTACCGG GTAAGACCCT CTTCGCCTGG CCGAATGAAT CCACCGGGTC GCTTTCCCAA CATTCTCTTA TTGTCCGCTT AGCCCAGGGA GAGTCTGCAT CATGA
|
Protein sequence | MSSKIFCKSW GAEYIAADVV RFRLWATGQQ KVMLRLAGKD QEMQASGDGW FTLDVSGVTP GTEYNFVLND GMVVPDPASR AQKTDVNGPS YVVDPGSYTW RNTGWKGSRW EQAVVYEMHT GTFTPEGTFR AAIAKLPYLA ELGVTVIEVM PVAQFGGERG WGYDGVLLYA PHSAYGTPDD FKAFIDAAHG YGLSVVLDIV LNHFGPEGNY LPLLAPAFFH KERMTPWGNG IAYDVDAVRR YIIEAPLYWL TEYHLDGLRF DAIDQIEDSS ARHVLVEIAQ RIREDITDRP IHLTTEDSRN IISLHPRDQD GNAPLFTAEW NDDFHNAVHV FATGETQAYY NDFADTPEKH LARALAEGFA YQGEISPQTG EPRGVKSTGQ PPVAFVDFIQ NHDQVGNRAQ GDRLITLAGA ERTKVLLATL LLSPHIPLLF MGEEYGESRP FLFFTDFHGD LARAVREGRA KEFADHAGEN VPDPNAPETF QRSKLNWKQQ HSEEGKAWLA FTRELLLLRQ KHIVPLLSAA RESSGTVLQT APGFIAVSWR FPGGTLSLAL NISATTVLLP DLPGKTLFAW PNESTGSLSQ HSLIVRLAQG ESAS
|
| |