Gene SNSL254_A0786 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A0786 
Symbol 
ID6483596 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp796333 
End bp798282 
Gene Length1950 bp 
Protein Length649 aa 
Translation table11 
GC content36% 
IMG OID642736198 
Productputative glycosyl transferase 
Protein accessionYP_002039964 
Protein GI194443619 
COG category[R] General function prediction only 
COG ID[COG5610] Predicted hydrolase (HAD superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value0.0373928 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATATGA ACTTTAAAAA ATACAAGACC GTAAGCTTTG ATATCTTTGA TACATTGGTT 
AGCCGGAGGG TTTACCGTCC CAGAGATTTG TTTTCATTGA TGCAATCAAC TTTAGCAACT
GAGAACTTTT TTATATCAGC GTGCGAGATT GATATTATTG ATAATTTTCC AGAGATAAGA
GTTCAGGCGG AAGTAAGTGC CAGAGAGAAT AGGGTCAGGC GTTTTGGCGG CGAGCCGGAA
GTACTCATAT CTGAAATATA CGATGAAATT TTAAAAAAGC ATCCGCAGCT TTCACCAGCG
ACAGTAGAAA AGATAATCGA TCTGGAAATA CAAATGGAGA AGATTGTTTT ATATAAAAAT
TCGCGTGGAA GCTGTTTGTT TGAAAAGGCT ATTAGTGATG GTTGCAAAGT TATTTTAATT
AGTGATATGT ACCTTCCATC AGCAATATTA AAGGAGTTGT TAACATCATG TGGCTATGAT
ATCAGTAACA TTCCAGTTTA TTCATCTGGT GAAGAGCGGC ACTCTAAAAA TAGTGGCAAG
TTGTTTTCAA TAGTCAAGAA AAATGAAAAT GTAGATATTG CATCGTGGAT GCATGTTGGC
GACAATGTTC ATGCAGATAT TCTGAACGCT AAAAAACTCG GTATAAATAC TCTCCATGCT
GATTGGTCAG AGTATAATCA TGGGGTATCT AATCATTGGA AAGCTAAAGA TATTATTGGT
GAATCTATTT GTAAGGCTTT ATTACTTAAA CAAGTTTCTG CTTTCCATCA AAATGATCCT
TTAAACGAGA TAGGATTTAA AGTATTTGGT CCGTTATTAT TAGGTTATGT ATCCTGGTTA
GCGAATCAGT TAAAGATTCA TAAAATTGAT AAAGCGCTTT TTTTAGCACG CGATGCTCAC
TTAATCTATA AAATTTATAA TGAATACTTT TCAGAAGAAC ATGTAAAATG TGAATATTTA
TATATATCCC GCGCATCAGC TTATATGGTG GGGATGACTG ATTGGCCGAT GCACAGGATT
TGGCATCTTT TTGGTGGTAA GAATAAGAAA AGTATTAAAA AGATACTTGC TATCGCGGGG
TTAGATGCGA GTGAGCATAT TTCAGATATA CATCATGTTG GTTTTCCTGA CGAGGAGTAT
ATTCCTGTTT CAGGAGAGGA ACATAAGGTT CACTGGCTTA TAAATAAATT ATTTCCATAT
ATTTTATTAA AAAATACTCA GCACAGGGAA GTTTACGCTG ATTACTTTAA AACGGCCTGT
GAAGGTTATA AAAATATAGC ACTTATCGAT GTAGGATGGA TGGGTAATAT TCAATCAGTA
TTTGCTCGTT CTTTAGGTGC GCAATGGGCA GAAAAACAAA TACATGGGTT TTATTTGGCA
ACTTTTGCTG GCGCCAATGA TAACCGATCT ATTTATAATA AGATGTTTGG TTGGCTAACC
AACTATGGCC ATCCCAACGA TAAGTGTGAT CTTTTCTTAT CAGGAGGGGT GGAAATAATG
GAGTTCGCTA TGGCTGACAA TACTGGGTCA ACAATTGGCT ATAAAAAAAC GGATAATGGA
ATAATTCCTG TACGTGAAGA TAGCAGTGGT TCTGAAATTG AGTATTTAAA AAAAGCAGCA
AGATTGCAAT CAGGGATTAT TTCTTTTTTT GAGTACGTCA AACCGCTCAT ACAAAAAGGA
AATTATGCAG CACTTAGTAG TGTTGTATTG TCAGAACCTT TTTTTGAATT GATAGCCAGA
CCCTCAAGCG TTCAACTGGA CGCCTTATCT TCCCTCACAC ATTCAGAGTC CGCGGGATCT
AACGCAGAAA GAATCGTGCT AGCCAAGAAA CTGCCTTTAA AGGATAAACT TTTTCCCGGA
GAAAATTATA TCAAAGAGTT GAATGCCAGT TATTGGAAAG AAGGCTTTAA AAGGATCAAC
AGAAAAAAAT TTTGGGCAAA ATATAACTAA
 
Protein sequence
MDMNFKKYKT VSFDIFDTLV SRRVYRPRDL FSLMQSTLAT ENFFISACEI DIIDNFPEIR 
VQAEVSAREN RVRRFGGEPE VLISEIYDEI LKKHPQLSPA TVEKIIDLEI QMEKIVLYKN
SRGSCLFEKA ISDGCKVILI SDMYLPSAIL KELLTSCGYD ISNIPVYSSG EERHSKNSGK
LFSIVKKNEN VDIASWMHVG DNVHADILNA KKLGINTLHA DWSEYNHGVS NHWKAKDIIG
ESICKALLLK QVSAFHQNDP LNEIGFKVFG PLLLGYVSWL ANQLKIHKID KALFLARDAH
LIYKIYNEYF SEEHVKCEYL YISRASAYMV GMTDWPMHRI WHLFGGKNKK SIKKILAIAG
LDASEHISDI HHVGFPDEEY IPVSGEEHKV HWLINKLFPY ILLKNTQHRE VYADYFKTAC
EGYKNIALID VGWMGNIQSV FARSLGAQWA EKQIHGFYLA TFAGANDNRS IYNKMFGWLT
NYGHPNDKCD LFLSGGVEIM EFAMADNTGS TIGYKKTDNG IIPVREDSSG SEIEYLKKAA
RLQSGIISFF EYVKPLIQKG NYAALSSVVL SEPFFELIAR PSSVQLDALS SLTHSESAGS
NAERIVLAKK LPLKDKLFPG ENYIKELNAS YWKEGFKRIN RKKFWAKYN