Gene SeSA_A2083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A2083 
SymbolotsA 
ID6518549 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp2001187 
End bp2002608 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content52% 
IMG OID642747163 
Producttrehalose-6-phosphate synthase 
Protein accessionYP_002114964 
Protein GI194736914 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0380] Trehalose-6-phosphate synthase 
TIGRFAM ID[TIGR02400] alpha,alpha-trehalose-phosphate synthase [UDP-forming] 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.388095 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000038171 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTCGTT TAGTCGTAGT ATCTAATCGA ATTGCCCCCC CGGATAATAA AGGCGGCGCC 
GGCGGCCTCG CCGTTGGCGT GCTTGGCGCG CTAAAAGCGG CTGGCGGGTT GTGGTTCGGC
TGGAGTGGCG AGACAGGTAA CGAGGATGAG CCATTAAAAA AGGTGACAAA AGGTAATATT
ACCTGGGCAT CGTTTAACCT GAGTGAACAA GATTACGAAG ATTATTACTG TCAATTTTCC
AATGCGGTTC TCTGGCCTGC GTTCCATTAT CGTCTGGACC TGGTACAGTT TCAGCGTCCT
GCATGGGAAG GCTATATGCG GGTGAATGCG TTATTAGCGG ATAAGTTATT GCCCCTCATT
AAAGAGAACG ACATCATTTG GGTGCATGAC TACCACCTGT TACCGTTCGC CAGCGAGCTG
CGTAAACGCG GCGTGAACAA CCGAATTGGT TTTTTCCTGC ATATTCCATT CCCGACCCCG
GAGATTTTTA ACGCTTTACC GCCGCATGAT GAACTGCTGG AGCAGTTGTG TGACTTTGAT
CTGCTCGGGT TCCAGACCGA AAATGATCGC CTGGCTTTTC TGGATAGCCT TTCGAGTCAA
ACGCGAGTCA CGACTCGCAG CGGCAAGCAG CATATGGCGT GGGGTAAAGA CTTCCAGACA
GAAGTGTATC CTATCGGTAT TGAGCCCGAT GAGATTGCTC TGCAGGCAGC CGGGCCGTTG
CCGCCTAAAC TGGCGCAGCT CAAGGCGGAA CTGAAAAATG TGAAGAATAT TTTTTCCGTT
GAGCGGCTGG ATTATTCGAA AGGGCTGCCG GAACGCTTTC TGGCGTATGA AGCGCTACTG
GAAAACTACC CGCAGCATCG GGGAAAAATT CGTTATACCC AAATTGCGCC TACGTCACGC
GGCGAAGTAC AGGCATATCA GGATATTCGC CACCAGCTTG AGACGGAAGC AGGCCGGATT
AATGGGAAAT ATGGACAATT GGGCTGGACG CCGCTCTATT ATCTGAATCA GCATTTTGAC
CGTAAACTGT TAATGAAGAT ATTCCGTTAT TCAGACGTCG GGCTCGTCAC CCCGTTGCGT
GACGGGATGA ACCTGGTGGC GAAAGAGTTT GTCGCCGCGC AGGACCCCGC TAATCCTGGC
GTACTGGTAC TGTCACAGTT TGCCGGCGCG GCGAATGAAC TGACGTCGGC GTTAATCGTC
AATCCTTACG ATCGGGATGA CGTGGCGGCG GCGCTCAATC GTGCGTTAAC GATGCCCCTT
GCCGAGCGTA TTTCGCGCCA TGCGGAAATG CTGGACGTGA TCGTTAAAAA TGACATTAAC
CGCTGGCAGG AGCGTTTTAT TCATGACCTA AAGGAGGTCA CGCCGCGTAG CCCTGAGCGT
CAGCAGCAGA ACAACGTGGC GACGTTCCCT AAGCTGGCCT GA
 
Protein sequence
MSRLVVVSNR IAPPDNKGGA GGLAVGVLGA LKAAGGLWFG WSGETGNEDE PLKKVTKGNI 
TWASFNLSEQ DYEDYYCQFS NAVLWPAFHY RLDLVQFQRP AWEGYMRVNA LLADKLLPLI
KENDIIWVHD YHLLPFASEL RKRGVNNRIG FFLHIPFPTP EIFNALPPHD ELLEQLCDFD
LLGFQTENDR LAFLDSLSSQ TRVTTRSGKQ HMAWGKDFQT EVYPIGIEPD EIALQAAGPL
PPKLAQLKAE LKNVKNIFSV ERLDYSKGLP ERFLAYEALL ENYPQHRGKI RYTQIAPTSR
GEVQAYQDIR HQLETEAGRI NGKYGQLGWT PLYYLNQHFD RKLLMKIFRY SDVGLVTPLR
DGMNLVAKEF VAAQDPANPG VLVLSQFAGA ANELTSALIV NPYDRDDVAA ALNRALTMPL
AERISRHAEM LDVIVKNDIN RWQERFIHDL KEVTPRSPER QQQNNVATFP KLA