Gene SeD_A1783 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1783 
SymboltreZ 
ID6872701 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1726024 
End bp1727808 
Gene Length1785 bp 
Protein Length594 aa 
Translation table11 
GC content54% 
IMG OID642784918 
Productmalto-oligosyltrehalose trehalohydrolase 
Protein accessionYP_002215586 
Protein GI198244502 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0296] 1,4-alpha-glucan branching enzyme 
TIGRFAM ID[TIGR02402] malto-oligosyltrehalose trehalohydrolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.625431 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTCAA AAATTTTTTG CAAAAGCTGG GGGGCTGAAT ATATCGCCGC TGATGTTGTC 
CGCTTTCGTC TTTGGGCCAC CGGTCAGCAA AAGGTTATGC TCAGGCTTGC GGGTAAAGAC
CAGGAAATGC AGGCGAGCGG TGACGGCTGG TTTACGCTGG ACGTCTCCGG GGTGACGCCA
GGTACGGAGT ATAACTTTGT ACTCAACGAT GGCATGGTGG TCCCCGATCC GGCTTCCCGC
GCCCAAAAAA CTGACGTCAA CGGTCCGTCA TATGTGGTTG ATCCAGGCAG CTACACGTGG
CGCAACACCG GGTGGAAAGG TAGCCGTTGG GAGCAGGCCG TGGTGTATGA GATGCATACA
GGCACGTTCA CTCCGGAAGG CACCTTCCGC GCCGCAATAG CGAAGCTGCC TTATCTCGCT
GAACTCGGCG TTACCGTTAT TGAAGTGATG CCCGTTGCGC AATTTGGCGG CGAGCGTGGC
TGGGGCTATG ACGGCGTACT GCTTTACGCG CCGCATTCTG CCTATGGGAC GCCGGATGAT
TTCAAGGCGT TTATTGACGC CGCGCATGGG TATGGTCTTT CCGTCGTCCT GGATATTGTG
CTGAACCATT TCGGCCCGGA GGGAAATTAT TTACCGCTAT TGGCGCCGGC GTTTTTCCAC
AAAGAGCGCA TGACGCCGTG GGGAAATGGT ATCGCCTATG ATGTCGACGC CGTGCGGCGC
TATATCATCG AGGCGCCGTT ATACTGGCTG ACAGAATACC ATCTCGACGG CTTACGCTTT
GACGCTATCG ATCAGATTGA GGACAGTAGC GCCAGGCATG TGCTGGTTGA AATCGCACAA
CGTATTCGGG AAGACATTAC CGACAGACCC ATTCATCTGA CTACCGAAGA TAGCCGCAAT
ATTATTTCTC TGCATCCCCG TGATCAGGAT GGCAATGCGC CGCTGTTTAC CGCCGAATGG
AATGACGATT TTCATAATGC CGTCCACGTT TTTGCGACCG GAGAGACCCA GGCCTACTAC
AACGATTTTG CTGATACCCC GGAAAAACAC CTCGCGAGAG CGCTGGCCGA AGGATTCGCT
TATCAGGGAG AAATTTCCCC CCAAACCGGC GAACCTCGCG GCGTAAAAAG TACCGGACAA
CCTCCGGTCG CCTTTGTGGA TTTTATTCAG AACCACGATC AGGTCGGTAA CCGCGCCCAG
GGCGACAGAC TGATAACCCT GGCGGGCGCT GAACGAACAA AAGTATTGCT CGCCACGTTG
CTGCTTTCAC CGCATATTCC GCTGCTTTTT ATGGGCGAAG AGTATGGCGA AAGCCGTCCT
TTTCTTTTTT TTACCGATTT CCATGGGGAT TTAGCCCGCG CCGTTCGTGA AGGTCGCGCA
AAAGAGTTTG CCGATCATGC AGGGGAAAAT GTTCCGGACC CGAATGCGCC AGAGACCTTT
CAACGCTCAA AACTTAACTG GAAGCAACAG CACAGTGAAG AGGGTAAAGC GTGGCTGGCA
TTTACCCGCG AACTACTGCT TTTGCGCCAG AAGCATATCG TGCCGCTGTT GTCCGCTGCC
CGTGAGAGCT CAGGAACGGT ATTGCAAACC GCGCCCGGGT TTATTGCCGT TAGCTGGCGT
TTTCCGGGAG GAACGCTGTC ACTGGCGCTG AATATTAGCG CCACGACGGT ATTGCTGCCC
GATTTACCGG GTAAGACCCT CTTCGCCTGG CCGAATGAAT CCACCGGGTC GCTTTCCCAA
CATTCTCTTA TTGTCCGCTT AGCCCAGGGA GAGTCTGCAT CATGA
 
Protein sequence
MSSKIFCKSW GAEYIAADVV RFRLWATGQQ KVMLRLAGKD QEMQASGDGW FTLDVSGVTP 
GTEYNFVLND GMVVPDPASR AQKTDVNGPS YVVDPGSYTW RNTGWKGSRW EQAVVYEMHT
GTFTPEGTFR AAIAKLPYLA ELGVTVIEVM PVAQFGGERG WGYDGVLLYA PHSAYGTPDD
FKAFIDAAHG YGLSVVLDIV LNHFGPEGNY LPLLAPAFFH KERMTPWGNG IAYDVDAVRR
YIIEAPLYWL TEYHLDGLRF DAIDQIEDSS ARHVLVEIAQ RIREDITDRP IHLTTEDSRN
IISLHPRDQD GNAPLFTAEW NDDFHNAVHV FATGETQAYY NDFADTPEKH LARALAEGFA
YQGEISPQTG EPRGVKSTGQ PPVAFVDFIQ NHDQVGNRAQ GDRLITLAGA ERTKVLLATL
LLSPHIPLLF MGEEYGESRP FLFFTDFHGD LARAVREGRA KEFADHAGEN VPDPNAPETF
QRSKLNWKQQ HSEEGKAWLA FTRELLLLRQ KHIVPLLSAA RESSGTVLQT APGFIAVSWR
FPGGTLSLAL NISATTVLLP DLPGKTLFAW PNESTGSLSQ HSLIVRLAQG ESAS