Gene SNSL254_A2089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2089 
SymbolotsA 
ID6482798 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2028249 
End bp2029670 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content53% 
IMG OID642737445 
Producttrehalose-6-phosphate synthase 
Protein accessionYP_002041195 
Protein GI194446277 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0380] Trehalose-6-phosphate synthase 
TIGRFAM ID[TIGR02400] alpha,alpha-trehalose-phosphate synthase [UDP-forming] 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.148883 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value5.88528e-25 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTCGTT TAGTCGTAGT ATCTAATCGA ATTGCCCCCC CGGATAATAA AGGCGGCGCC 
GGCGGCCTCG CCGTTGGCGT GCTTGGCGCG CTAAAAGCGG CTGGCGGGTT GTGGTTCGGC
TGGAGTGGCG AGACAGGTAA CGAGGATGAG CCATTAAAAA AGGTGACAAA AGGTAATATT
ACCTGGGCAT CGTTTAACCT GAGTGAACAA GATTACGAAG ATTATTACTG TCAATTTTCC
AATGCGGTTC TCTGGCCAGC GTTCCACTAT CGTCTGGACC TGGTACAGTT TCAGCGCCCT
GCATGGGAAG GCTATATGCG GGTGAATGCG CTATTAGCGG ATAAGTTATT GCCCCTCATT
AAAGAGAACG ACATCATTTG GGTGCATGAC TACCACCTGT TACCGTTCGC CAGCGAGCTG
CGTAAACGCG GCGTGAACAA CCGAATTGGT TTTTTCCTGC ATATTCCATT CCCGACCCCG
GAGATTTTTA ACGCTTTACC GCCGCATGAT GAACTGCTGG AGCAGTTGTG TGACTTTGAT
CTGCTCGGGT TCCAGACCGA AAATGATCGC CTGGCTTTTC TGGATAGCCT TTCGAGTCAA
ACGCGAGTCA CGACTCGCAG CGGCAAGCAG CATATCGCGT GGGGTAAAGA CTTCCAGACA
GAAGTGTATC CTATCGGTAT TGAGCCCGAT GAGATTGCTC TGCAGGCTGC CGGGCCGTTG
CCGCCTAAAC TGGCGCAGCT CAAGGCGGAA CTGAAAAATG TGAAGAATAT TTTTTCCGTT
GAGCGGCTGG ATTATTCGAA AGGGCTGCCG GAACGTTTTC TGGCGTATGA AGCGCTACTG
GAAAACTATC CGCAGCATCG GGGAAAAATT CGTTATACCC AAATTGCGCC TACGTCACGC
GGCGAAGTAC AGGCATATCA GGATATTCGC CACCAGCTTG AGACGGAAGC AGGCCGGATT
AATGGGAAAT ATGGACAATT GGGCTGGACG CCGCTCTATT ATCTGAATCA GCATTTCGAC
CGTAAACTGT TAATGAAGAT ATTCCGTTAT TCAGACGTCG GGCTCGTCAC CCCGTTGCGT
GACGGGATGA ACCTGGTGGC GAAAGAGTTT GTCGCCGCGC AGGACCCCGC TAATCCTGGC
GTACTGGTAC TGTCACAGTT TGCCGGCGCG GCGAATGAAC TGACGTCGGC GTTAATCGTC
AATCCTTACG ATCGGGATGA CGTGGCGGCG GCGCTCAATC GTGCGTTAAC GATGCCCCTT
GCCGAGCGTA TTTCGCGCCA TGCGGAAATG CTGGACGCGA TCGTTAAAAA TGACATTAAC
CGCTGGCAGG AACGTTTTAT TCATGACCTA AAGGAGGTCA CGCCGCGTAG CCCTGAGCGT
CAGCAGCAGA ACAACGTGGC GACGTTCCCT AAGCTGGCCT GA
 
Protein sequence
MSRLVVVSNR IAPPDNKGGA GGLAVGVLGA LKAAGGLWFG WSGETGNEDE PLKKVTKGNI 
TWASFNLSEQ DYEDYYCQFS NAVLWPAFHY RLDLVQFQRP AWEGYMRVNA LLADKLLPLI
KENDIIWVHD YHLLPFASEL RKRGVNNRIG FFLHIPFPTP EIFNALPPHD ELLEQLCDFD
LLGFQTENDR LAFLDSLSSQ TRVTTRSGKQ HIAWGKDFQT EVYPIGIEPD EIALQAAGPL
PPKLAQLKAE LKNVKNIFSV ERLDYSKGLP ERFLAYEALL ENYPQHRGKI RYTQIAPTSR
GEVQAYQDIR HQLETEAGRI NGKYGQLGWT PLYYLNQHFD RKLLMKIFRY SDVGLVTPLR
DGMNLVAKEF VAAQDPANPG VLVLSQFAGA ANELTSALIV NPYDRDDVAA ALNRALTMPL
AERISRHAEM LDAIVKNDIN RWQERFIHDL KEVTPRSPER QQQNNVATFP KLA