Gene SNSL254_A4800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A4800 
SymboltreC 
ID6483968 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp4674183 
End bp4675835 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content55% 
IMG OID642740014 
Producttrehalose-6-phosphate hydrolase 
Protein accessionYP_002043692 
Protein GI194443467 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID[TIGR02403] alpha,alpha-phosphotrehalase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.235757 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones96 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTATTC CCCTCTGGTG GCAAAACGGC GTTATTTATC AGATCTACCC GAAAAGTTTT 
CAGGACACGA CCGGCAGCGG CACTGGCGAT TTACGCGGCG TCACGCAGCG CCTTGACTAT
CTACAGCGAC TCGGCGTAGA TGCCATCTGG CTCACGCCGT TTTATATCTC GCCGCAGGTG
GATAACGGTT ATGACGTCGC AAATTATACG GCTATCGACC CGACCTACGG CACGCTGGAT
GATTTTGACG AGCTGGTCGC GCAGGCGAAA GCACGCGGTA TTCGTATCAT CCTAGATATG
GTGTTTAACC ATACTTCTAC CCAGCACGCC TGGTTTCGCG AAGCGCTGAA CAAAGAGAGT
CCATACCGTC AGTTTTATAT CTGGCGCGAT GGCACGCCGG ATGTCTGCCC GAATAACTGG
CAATCCAAAT TTGGCGGCAG CGCCTGGCGC TGGCATAGCC AGAGCGAACA ATATTATTTA
CACCTTTTCG CGCCTGAACA GGCCGACCTG AACTGGGAAA ACCCCGCCGT GCGCGCCGAG
CTGAAAAAGG TCTGCGAATT CTGGGCCGAT CGCGGCGTGG ATGGCCTGCG TCTGGACGTG
GTGAACCTGA TCGCCAAAGA TCAGAATTTT CCTGACGATC CGACGGGCGA CGGACGCCGC
TTTTATACCG ACGGGCCGCG CGCGCATACG TTTTTACGCG AAATGAACCG TGACGTTTTT
ACGCCGCGTA ATCTGATGAC GGTAGGCGAA ATGTCCTCTA CCACGCTGGA AAACTGCCAG
CAATATGCCG CGCTTAGCGG CGACGAACTC TCGATGACCT TCAATTTTCA TCATCTGAAG
GTGGATTACC CCAATGGCGA AAAGTGGACG CTGGCAAAAC CTGATTATGT GGCGCTGAAA
GCGTTGTTCC GCCACTGGCA ACAGGGGATG CATAACGTCG CCTGGAACGC GTTATTCTGG
TGTAATCACG ATCAGCCGCG CATCGTTTCC CGCTTTGGCG ACGAGGGTGA ATACCGGGTT
CCAGCCGCGA AAATGTTGGC GATGGCGCTG CACGGTATGC AAGGGACGCC TTATATCTAT
CAGGGAGAAG AAATCGGCAT GACCAACCCA CACTTTACCC GCATCACCGA TTATCGCGAC
GTGGAAAGTC ATAATATGTT TGCCGCGCTA CGCGCCGCCG GGCGCGACCC CGACGAACTG
CTGGCTATCC TGGCCAGTAA ATCCCGCGAC AACAGCCGCA CGCCGATGCA GTGGGACAAC
GGTAAAAACG CCGGTTTCAC CCAGGGCGAG CCGTGGATAA ACCTGTGCGA TAACTATGCG
GAGGTTAACG TCGCGGCAGC ATTACGCGAT GAAAACTCGG TGTTTTACAC CTATCAAAAG
CTGATTGCGC TACGTAAAAC CCAGCCTGTA CTGATCTGGG GCGATTATCA GGATCTCCTC
CCGGATAGCC CATCAGTATG GTGTTATCGC CGCCAGTGGC AGGGGCAAAT CCTGCTGGTT
GTCGCCAATC TGAGTAACCA GTGTCAGGAG TGGCATCCAC CGCATATCAA AGGACAGTGG
CAGGCGCTAC TGCACAATTA TGGCGAGGTC ACCAGCCAGC CAGCCGCGAT GACGCTCCGC
CCATTTGAAG CCATCTGGTG GTTACAACGC TAA
 
Protein sequence
MTIPLWWQNG VIYQIYPKSF QDTTGSGTGD LRGVTQRLDY LQRLGVDAIW LTPFYISPQV 
DNGYDVANYT AIDPTYGTLD DFDELVAQAK ARGIRIILDM VFNHTSTQHA WFREALNKES
PYRQFYIWRD GTPDVCPNNW QSKFGGSAWR WHSQSEQYYL HLFAPEQADL NWENPAVRAE
LKKVCEFWAD RGVDGLRLDV VNLIAKDQNF PDDPTGDGRR FYTDGPRAHT FLREMNRDVF
TPRNLMTVGE MSSTTLENCQ QYAALSGDEL SMTFNFHHLK VDYPNGEKWT LAKPDYVALK
ALFRHWQQGM HNVAWNALFW CNHDQPRIVS RFGDEGEYRV PAAKMLAMAL HGMQGTPYIY
QGEEIGMTNP HFTRITDYRD VESHNMFAAL RAAGRDPDEL LAILASKSRD NSRTPMQWDN
GKNAGFTQGE PWINLCDNYA EVNVAAALRD ENSVFYTYQK LIALRKTQPV LIWGDYQDLL
PDSPSVWCYR RQWQGQILLV VANLSNQCQE WHPPHIKGQW QALLHNYGEV TSQPAAMTLR
PFEAIWWLQR