Gene SeD_A1316 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1316 
SymbolotsA 
ID6875327 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1293700 
End bp1295121 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content53% 
IMG OID642784484 
Producttrehalose-6-phosphate synthase 
Protein accessionYP_002215154 
Protein GI198246060 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0380] Trehalose-6-phosphate synthase 
TIGRFAM ID[TIGR02400] alpha,alpha-trehalose-phosphate synthase [UDP-forming] 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.899957 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value2.5238599999999998e-21 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTCGTT TAGTCGTAGT ATCTAATCGA ATTGCCCCCC CGGATAATAA AGGCGGCGCC 
GGCGGCCTCG CCGTTGGCGT GCTTGGCGCG CTAAAAGCGG CTGGCGGGTT GTGGTTCGGC
TGGAGTGGCG AGACAGGTAA CGAGGATGAG CCATTAAAAA AGGTGACAAA AGGTAATATT
ACCTGGGCAT CGTTTAACCT GAGCGAACAA GATTACGAAG ATTATTACTG TCAATTTTCC
AATGCGGTTC TCTGGCCTGC GTTCCACTAT CGTCTGGACT TGGTACAGTT TCAGCGTCCT
GCATGGGAAG GCTATATGCG GGTGAATGCG TTATTAGCGG ATAAGTTATT GCCCCTCATT
AAAGAGAACG ACATCATTTG GGTGCATGAC TACCACCTGT TACCGTTCGC CAGCGAGCTG
CGTAAACGCG GCGTGAACAA CCGAATTGGT TTTTTCCTGC ATATTCCATT CCCGACCCCG
GAGATTTTTA ACGCTTTACC GCCGCATGAT GAACTGCTGG AGCAGTTGTG TGACTTTGAT
CTGCTAGGGT TCCAGACCGA AAATGATCGC CTGGCTTTTC TGGATAGCCT TTCGAGTCAA
ACGCGAGTCA CGACTCGCAG CGGCAAGCAG CATATCGCGT GGGGTAAAGA CTTCCAGACA
GAAGTGTATC CCATCGGTAT TGAGCCCGAT GAGATTGCTC TGCAGGCTGC CGGGCCGTTG
CCGCCTAAAC TGGCGCAGCT CAAGGCGGAA CTGAAAAATG TGAAGAATAT TTTTTCCGTT
GAGCGGCTGG ATTATTCGAA AGGGCTGCCG GAACGTTTTC TGGCGTATGA AGCGCTACTG
GAAAACTACC CGCAGCATCG GGGAAAAATT CGTTATACCC AAATTGCGCC TACGTCACGC
GGCGAAGTAC AGGCATATCA GGATATTCGC CACCAGCTTG AGACGGAAGC AGGCCGGATT
AATGGGAAAT ATGGACAATT GGGCTGGACG CCGCTCTATT ATCTGAATCA GCATTTCGAC
CGTAAACTGT TAATGAAGAT ATTCCGTTAT TCAGACGTCG GGCTCGTCAC CCCGTTGCGT
GACGGGATGA ACCTGGTGGC GAAAGAGTTT GTCGCCGCGC AGGACCCCGC TAACCCTGGC
GTACTGGTAC TGTCACAGTT TGCCGGCGCG GCGAATGAAC TGACGTCGGC GTTAATCGTC
AATCCTTACG ATCGGGATGA CGTGGCGGCG GCGCTCAATC GTGCGCTAAC GATGCCCCTT
GCCGAGCGTA TTTCGCGTCA TGCGGAGATG CTGGACGTGA TCGTTAAAAA TGACATTAAC
CGCTGGCAGG AGCGTTTTAT TCATGACCTA AAGGAGGTCA CGCCGCGTAG CCCTGAGCGT
CAGCAGCAGA ACAACGTGGC GACGTTCCCT AAGCTGGCCT GA
 
Protein sequence
MSRLVVVSNR IAPPDNKGGA GGLAVGVLGA LKAAGGLWFG WSGETGNEDE PLKKVTKGNI 
TWASFNLSEQ DYEDYYCQFS NAVLWPAFHY RLDLVQFQRP AWEGYMRVNA LLADKLLPLI
KENDIIWVHD YHLLPFASEL RKRGVNNRIG FFLHIPFPTP EIFNALPPHD ELLEQLCDFD
LLGFQTENDR LAFLDSLSSQ TRVTTRSGKQ HIAWGKDFQT EVYPIGIEPD EIALQAAGPL
PPKLAQLKAE LKNVKNIFSV ERLDYSKGLP ERFLAYEALL ENYPQHRGKI RYTQIAPTSR
GEVQAYQDIR HQLETEAGRI NGKYGQLGWT PLYYLNQHFD RKLLMKIFRY SDVGLVTPLR
DGMNLVAKEF VAAQDPANPG VLVLSQFAGA ANELTSALIV NPYDRDDVAA ALNRALTMPL
AERISRHAEM LDVIVKNDIN RWQERFIHDL KEVTPRSPER QQQNNVATFP KLA