Gene SeHA_C2144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C2144 
SymbolotsA 
ID6489412 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp2071647 
End bp2073068 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content52% 
IMG OID642742340 
Producttrehalose-6-phosphate synthase 
Protein accessionYP_002045983 
Protein GI194449257 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0380] Trehalose-6-phosphate synthase 
TIGRFAM ID[TIGR02400] alpha,alpha-trehalose-phosphate synthase [UDP-forming] 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.800706 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value3.96327e-25 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTCGTT TAGTCGTAGT ATCTAATCGA ATTGCCCCCC CGGATAATAA AGGCGGCGCC 
GGCGGCCTCG CCGTTGGCGT GCTTGGCGCG CTAAAAGCGG CTGGCGGGTT GTGGTTCGGC
TGGAGTGGCG AGACAGGTAA CGAGGATGAG CCATTAAAAA AGGTGACAAA AGGTAATATT
ACCTGGGCAT CGTTTAACCT GAGTGAACAA GATTACGAAG ATTATTACTG TCAATTTTCC
AATGCGGTTC TCTGGCCTGC GTTCCACTAT CGTCTGGACC TGGTACAGTT TCAGCGTCCT
GCATGGGAAG GCTATATGCG GGTGAATGCG TTATTAGCGG ATAAGTTATT GCCCCTCATT
AAAGAGAACG ACATCATTTG GGTGCATGAC TACCACCTGT TACCGTTCGC CAGCGAGCTG
CGTAAACGCG GCGTGAACAA CCGAATTGGT TTTTTCCTGC ATATTCCATT CCCGACCCCG
GAGATTTTTA ACGCTTTACC GCCGCATGAT GAACTGCTGG AGCAGTTGTG TGACTTTGAT
CTGCTCGGGT TCCAGACCGA AAATGATCGC CTGGCTTTTC TGGATAGCCT TTCGAGTCAA
ACGCGAGTCA CGACTCGCAG CGGCAAGCAG CATATTGCGT GGGGGAAAAA CTTCCAGACA
GAAGTGTATC CTATCGGTAT TGAGCCCGAT GAGATTGCTC TGCAGGCTGC CGGGCCGTTG
CCGCCTAAAC TGGCGCAGCT CAAGGCGGAA CTGAAAAATG TGAAGAATAT TTTTTCCGTT
GAGCGGCTGG ATTATTCGAA AGGGTTGCCG GAACGCTTTC TGGCGTATGA AGCGCTACTG
GAAAACTACC CGCAGCATCG GGGAAAAATT CGTTATACCC AAATTGCGCC TACGTCACGC
GGCGAAGTAC AGGCATATCA GGATATTCGC CACCAGCTTG AGACGGAAGC AGGCCGGATT
AATGGGAAAT ATGGACAATT GGGCTGGACG CCGCTCTATT ATCTGAATCA GCATTTCGAC
CGTAAACTGT TAATGAAGAT ATTCCGTTAT TCAGACGTCG GGCTCGTCAC CCCGTTGCGT
GACGGGATGA ACCTGGTGGC GAAAGAGTTT GTCGCCGCGC AGGATCCCGC TAACCCTGGC
GTACTGGTAC TGTCACAGTT TGCCGGCGCG GCGAATGAAC TGACGTCGGC GTTAATCGTC
AATCCTTACG ATCGGGATGA CGTGGCGGCG GCGCTCAATC GCGCGTTAAC GATGCCCCTT
GCCGAGCGTA TTTCGCGCCA TGCGGAAATG CTGGACGTGA TCGTTAAAAA TGACATTAAC
CGCTGGCAGG AGCGTTTTAT TCATGACCTA AAGGAGGTCA CGCCGCGTAG CCCTGAGCGT
CAGCAGCAGA ACAACGTGGC GACGTTCCCT AAGCTGGCCT GA
 
Protein sequence
MSRLVVVSNR IAPPDNKGGA GGLAVGVLGA LKAAGGLWFG WSGETGNEDE PLKKVTKGNI 
TWASFNLSEQ DYEDYYCQFS NAVLWPAFHY RLDLVQFQRP AWEGYMRVNA LLADKLLPLI
KENDIIWVHD YHLLPFASEL RKRGVNNRIG FFLHIPFPTP EIFNALPPHD ELLEQLCDFD
LLGFQTENDR LAFLDSLSSQ TRVTTRSGKQ HIAWGKNFQT EVYPIGIEPD EIALQAAGPL
PPKLAQLKAE LKNVKNIFSV ERLDYSKGLP ERFLAYEALL ENYPQHRGKI RYTQIAPTSR
GEVQAYQDIR HQLETEAGRI NGKYGQLGWT PLYYLNQHFD RKLLMKIFRY SDVGLVTPLR
DGMNLVAKEF VAAQDPANPG VLVLSQFAGA ANELTSALIV NPYDRDDVAA ALNRALTMPL
AERISRHAEM LDVIVKNDIN RWQERFIHDL KEVTPRSPER QQQNNVATFP KLA