Gene SNSL254_A1674 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A1674 
SymboltreZ 
ID6484741 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp1637052 
End bp1638836 
Gene Length1785 bp 
Protein Length594 aa 
Translation table11 
GC content54% 
IMG OID642737056 
Productmalto-oligosyltrehalose trehalohydrolase 
Protein accessionYP_002040808 
Protein GI194446265 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0296] 1,4-alpha-glucan branching enzyme 
TIGRFAM ID[TIGR02402] malto-oligosyltrehalose trehalohydrolase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones72 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTCAA AAATTTTTTG CAAAAGCTGG GGGGCTGAAT ATATCGCCGC TGATGTTGTC 
CGCTTTCGTC TTTGGGCCAC CGGTCAGCAA AAGGTTATGC TCAGGCTTGC GGGTAAAGAC
CAGGAAATGC AGGCGAGCGG TGACGGCTGG TTTACGCTGG ACGTCTCCGG GGTGATGCCA
AGTACGGAGT ATAACTTTGT ACTCAGCGAT GGCATGGTGG TCCCCGATCC GGCTTCCCGC
GCCCAAAAAA CTGACGTCAA CGGTCCGTCA TATGTGGTTG ATCCAGGCAG CTACGCGTGG
CGCAACACCG GGTGGAAAGG TAGCCGTTGG GAGCAGGCCG TGGTGTACGA GATGCATACA
GGCACGTTCA CTCCGGAAGG CACCTTCCGC GCCGCAATAG CGAAGCTGCC TTATCTCGCT
GAACTCGGCG TTACCGTTAT TGAAGTGATG CCCGTTGCGC AATTTGGCGG CGAGCGCGGC
TGGGGCTATG ACGGCGTACT GCTTTACGCG CCGCATTCTG CCTATGGGAC GCCGGATGAT
TTCAAAGCGT TTATTGACGC CGCGCATGGG TATGGTCTTT CCGTCGTCCT GGATATTGTG
CTGAACCATT TCGGCCCGGA AGGAAATTAT TTACCGCTAT TGGCGCCGGC GTTTTTCCAC
AAAGAGCGCA TGACGCCGTG GGGAAATGGT ATCGCCTATA ATGTCGACGC CGTGCGGCGC
TATATCATCG AGGCGCCGTT ATACTGGCTG ACAGAATACC ATCTCGACGG CTTACGCTTT
GACGCTATCG ATCAGATTGA GGACAGTAGC GCCAGGCATG TGCTGGTTGA AATCGCACAA
CGTATTCGGG AAGACATTAC CGACAGGCCC ATTCATCTGA CCACCGAAGA TAGCCGCAAT
ATTATTTCTC TGCATCCCCG AGATCAGGAT GGCAATGCGC CGCTGTTTAC CGCCGAATGG
AATGACGATT TTCATAATGC CGTCCACGTT TTTGCGACCG GAGAGACCCT GGCCTACTAC
AACGATTTTG CTGATGCCCC GGAAAAACAC CTCGCAAGAG CGCTGGCCGA AGGATTCGCT
TATCAGGGAG AAATTTCACC GCAAACCGGC GAACCTCGCG GCGTAAAAAG TACCGGACAA
CCCCCGGTCG CTTTTGTGGA TTTTATTCAG AATCACGATC AGGTCGGTAA CCGCGCCCAG
GGCGACAGAC TGATAACCCT GGCGGGCGCT GAACGAACAA AAGTATTGCT CGCCACGTTG
CTGCTTTCAC CGCATATTCC GCTGCTTTTT ATGGGCGAAG AGTATGGCGA AAGCCGTCCT
TTTCTTTTTT TTACCGATTT CCATGGGGAT TTAGCCCGCG CCGTTCGTGA AGGTCGCGCA
AAAGAGTTTG CCGATCATGC AGGGGAGAAT GTTCCGGACC CGAATGCGCC AGAGACCTTT
CAACGCTCAA AACTTAACTG GAAGCAACAG CACAGTGAAG AGGGTAAAGC GTGGCTGGCG
TTTACCCGCG AACTACTGCT TTTGCGCCAG AAGCATATCG TGCCGCTGTT GTCCGCTGCC
CGTGAGAGCT CAGGAACGGT ATTGCAAACC GCGCCCGGGT TTATTGCCGT TAGCTGGCCT
TTTCCGGGAG GAACACTGTC ACTGGCGCTG AATATTAGCG CCACGACAGT ATTGCTGCCC
GATTTACCGG GTAAGACTCT CTTCGCCTGG CCGAATGAAT CCACCGGGTC GCTTTCCCAA
CATTCTCTTA TTGTCCGCTT AGCCCAGGGA GAGTCTGCAT CATGA
 
Protein sequence
MSSKIFCKSW GAEYIAADVV RFRLWATGQQ KVMLRLAGKD QEMQASGDGW FTLDVSGVMP 
STEYNFVLSD GMVVPDPASR AQKTDVNGPS YVVDPGSYAW RNTGWKGSRW EQAVVYEMHT
GTFTPEGTFR AAIAKLPYLA ELGVTVIEVM PVAQFGGERG WGYDGVLLYA PHSAYGTPDD
FKAFIDAAHG YGLSVVLDIV LNHFGPEGNY LPLLAPAFFH KERMTPWGNG IAYNVDAVRR
YIIEAPLYWL TEYHLDGLRF DAIDQIEDSS ARHVLVEIAQ RIREDITDRP IHLTTEDSRN
IISLHPRDQD GNAPLFTAEW NDDFHNAVHV FATGETLAYY NDFADAPEKH LARALAEGFA
YQGEISPQTG EPRGVKSTGQ PPVAFVDFIQ NHDQVGNRAQ GDRLITLAGA ERTKVLLATL
LLSPHIPLLF MGEEYGESRP FLFFTDFHGD LARAVREGRA KEFADHAGEN VPDPNAPETF
QRSKLNWKQQ HSEEGKAWLA FTRELLLLRQ KHIVPLLSAA RESSGTVLQT APGFIAVSWP
FPGGTLSLAL NISATTVLLP DLPGKTLFAW PNESTGSLSQ HSLIVRLAQG ESAS