Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C2144 |
Symbol | otsA |
ID | 6489412 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | - |
Start bp | 2071647 |
End bp | 2073068 |
Gene Length | 1422 bp |
Protein Length | 473 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642742340 |
Product | trehalose-6-phosphate synthase |
Protein accession | YP_002045983 |
Protein GI | 194449257 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0380] Trehalose-6-phosphate synthase |
TIGRFAM ID | [TIGR02400] alpha,alpha-trehalose-phosphate synthase [UDP-forming] |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.800706 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 3.96327e-25 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGTCGTT TAGTCGTAGT ATCTAATCGA ATTGCCCCCC CGGATAATAA AGGCGGCGCC GGCGGCCTCG CCGTTGGCGT GCTTGGCGCG CTAAAAGCGG CTGGCGGGTT GTGGTTCGGC TGGAGTGGCG AGACAGGTAA CGAGGATGAG CCATTAAAAA AGGTGACAAA AGGTAATATT ACCTGGGCAT CGTTTAACCT GAGTGAACAA GATTACGAAG ATTATTACTG TCAATTTTCC AATGCGGTTC TCTGGCCTGC GTTCCACTAT CGTCTGGACC TGGTACAGTT TCAGCGTCCT GCATGGGAAG GCTATATGCG GGTGAATGCG TTATTAGCGG ATAAGTTATT GCCCCTCATT AAAGAGAACG ACATCATTTG GGTGCATGAC TACCACCTGT TACCGTTCGC CAGCGAGCTG CGTAAACGCG GCGTGAACAA CCGAATTGGT TTTTTCCTGC ATATTCCATT CCCGACCCCG GAGATTTTTA ACGCTTTACC GCCGCATGAT GAACTGCTGG AGCAGTTGTG TGACTTTGAT CTGCTCGGGT TCCAGACCGA AAATGATCGC CTGGCTTTTC TGGATAGCCT TTCGAGTCAA ACGCGAGTCA CGACTCGCAG CGGCAAGCAG CATATTGCGT GGGGGAAAAA CTTCCAGACA GAAGTGTATC CTATCGGTAT TGAGCCCGAT GAGATTGCTC TGCAGGCTGC CGGGCCGTTG CCGCCTAAAC TGGCGCAGCT CAAGGCGGAA CTGAAAAATG TGAAGAATAT TTTTTCCGTT GAGCGGCTGG ATTATTCGAA AGGGTTGCCG GAACGCTTTC TGGCGTATGA AGCGCTACTG GAAAACTACC CGCAGCATCG GGGAAAAATT CGTTATACCC AAATTGCGCC TACGTCACGC GGCGAAGTAC AGGCATATCA GGATATTCGC CACCAGCTTG AGACGGAAGC AGGCCGGATT AATGGGAAAT ATGGACAATT GGGCTGGACG CCGCTCTATT ATCTGAATCA GCATTTCGAC CGTAAACTGT TAATGAAGAT ATTCCGTTAT TCAGACGTCG GGCTCGTCAC CCCGTTGCGT GACGGGATGA ACCTGGTGGC GAAAGAGTTT GTCGCCGCGC AGGATCCCGC TAACCCTGGC GTACTGGTAC TGTCACAGTT TGCCGGCGCG GCGAATGAAC TGACGTCGGC GTTAATCGTC AATCCTTACG ATCGGGATGA CGTGGCGGCG GCGCTCAATC GCGCGTTAAC GATGCCCCTT GCCGAGCGTA TTTCGCGCCA TGCGGAAATG CTGGACGTGA TCGTTAAAAA TGACATTAAC CGCTGGCAGG AGCGTTTTAT TCATGACCTA AAGGAGGTCA CGCCGCGTAG CCCTGAGCGT CAGCAGCAGA ACAACGTGGC GACGTTCCCT AAGCTGGCCT GA
|
Protein sequence | MSRLVVVSNR IAPPDNKGGA GGLAVGVLGA LKAAGGLWFG WSGETGNEDE PLKKVTKGNI TWASFNLSEQ DYEDYYCQFS NAVLWPAFHY RLDLVQFQRP AWEGYMRVNA LLADKLLPLI KENDIIWVHD YHLLPFASEL RKRGVNNRIG FFLHIPFPTP EIFNALPPHD ELLEQLCDFD LLGFQTENDR LAFLDSLSSQ TRVTTRSGKQ HIAWGKNFQT EVYPIGIEPD EIALQAAGPL PPKLAQLKAE LKNVKNIFSV ERLDYSKGLP ERFLAYEALL ENYPQHRGKI RYTQIAPTSR GEVQAYQDIR HQLETEAGRI NGKYGQLGWT PLYYLNQHFD RKLLMKIFRY SDVGLVTPLR DGMNLVAKEF VAAQDPANPG VLVLSQFAGA ANELTSALIV NPYDRDDVAA ALNRALTMPL AERISRHAEM LDVIVKNDIN RWQERFIHDL KEVTPRSPER QQQNNVATFP KLA
|
| |