Gene SeHA_C0850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C0850 
Symbol 
ID6490277 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp840131 
End bp842080 
Gene Length1950 bp 
Protein Length649 aa 
Translation table11 
GC content36% 
IMG OID642741099 
Productputative glycosyl transferase 
Protein accessionYP_002044757 
Protein GI194447526 
COG category[R] General function prediction only 
COG ID[COG5610] Predicted hydrolase (HAD superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.996088 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.00000136346 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGATATGA ACTTTAAAAA ATACAAGACC GTAAGCTTTG ATATCTTTGA TACATTGGTT 
AGCCGGAGGA TTTACCGTCC CAGAGATTTG TTTTCATTAA TGCAATCAAC TTTAGCAACT
GAGAAATTTT TTATATCAGC GTACGAGATT GGTATTATTG ATAATTTCCC AGAGATAAGA
GTTCAGGCGG AAGTAAGTGC CAGAGAGAAT AGGGTCAGGC GTTTTGGCGG CGAGCCGGAA
ATACTTATAT CTGAAATATA CGATGAAATT TTAAAAAAGC ATCCGCAGCT TTCACCAGCG
ACAGTAAAAA AGATAATCGA TCTGGAAATA CAAATGGAGA AGATTGTTTT ATATAAAAAT
GCGCGTGGAA GCTGTTTGTT TGAAAAGGCT ATTAGTGATG GTTGCAAAGT CATTTTAATT
AGTGACATGT ACCTTCCATC AGCAATATTA AAGGAGTTGT TAACATCATG TGGCTATGAT
ATCAGTAACA TTCCAGTTTA TTCATCTGGC GAAGAGCGGT ACTCTAAAAA TAGTGGTAAA
TTATTTTCAA TAGTCAAGAA AAATGAAAAT GTAGATATTG CATCGTGGAT GCATGTTGGC
GACAATGTTC ATGCTGATAT TCTGAATGCT AAAAAACTCG GCATAAATAC TCTCCATGCT
GATTGGTCAG AGTATAATCA TGGGATATCT AATCATTGGA AAGCTAAAGA TATTATTGGT
GAATCTATTT GTAAGACTTT ATTACTTAAA CAAGTTTCTG CTTTCCATCA AAATGATCCT
TTAAACGAGA TAGGATTTAA AGTATTTGGT CCGTTATTAT TAGGTTATGT ATCCTGGTTA
GCGAATCAGT TAAAGATTCA TAAAATTGAT AAAGCGCTTT TTTTAGCACG CGATGCTCAC
TTAATCTATA AAATTTATAA TGAATACTTT TCAGAAGAAC ATGTAAAATG TGAATATTTA
TATATATCCC GCGCATCAGC TTATATGGTG GGGATGACTG ATTGGCCGAT GCACAGGATT
TGGCATCTTT TTGGTGGTAA GAATAAGAAA AGTATTAAAA AGATACTTGC TATCGCGGGG
TTAGATGCGA GTGAGCATAT TTCAGATATA CATCATGTTG GTTTTCCTGA CGAGGAGTAT
ATTCCTGTTT CAGGAGAGGA ACATAAGGTT CACTGGCTTA TAAATAAATT ATTTCCATAT
ATTTTATTAA AAAATACTCA GCACAGGGAA GTTTACGCTG ATTACTTTAA AACGGCCTGT
GAAGGTTATA AAAATATAGC ACTTATCGAT GTAGGATGGA TGGGTAATAT TCAATCAGTA
TTTGCTCGTT CTTTAGGTGC GCAATGGGCA GAAAAACAAA TACATGGGTT TTATTTGGCA
ACTTTTGCTG GCGCCAATGA TAACCGATCT ATTTATAATA AGATGTTTGG TTGGCTAACC
AACTATGGCC ATCCCAACGA TAAGTGTGAT CTTTTCTTAT CAGGAGGGGT GGAAATAATG
GAGTTCGCTA TGGCTGACAA TACTGGGTCA ACAATTGGCT ATAAAAAAAC GGATAATGGA
ATAATTCCTG TACGTGAAGA TAGCAGTGGT TCTGAAATTG AGTATTTAAA AAAAGCAGCA
AGATTGCAAT CAGGGATTAT TTCTTTTTTT GAGTACGTCA AACCGCTCAT ACAAAAAGGA
AATTATGCAG CACTTAGTAG TGTTGTATTG TCAGAACCTT TTTTTGAATT GATAGCCAGA
CCCTCAAGCG CTCAACTGGA CGCCTTATCT TCCCTCACAC ATTCAGAGTC CGCGGGATCT
AACGCAGAAA GAATCGTGCT AGCCAAGAAA CTGCCTTTAA AGGATAAACT TTTTCCCGGA
GAAAATTATA TCAAAGAGTT GAATGCCAGT TATTGGAAAG AAGGCTTTAA AAGGATCAAC
AGAAAAAAAT TTTGGGCAAA ATATAACTAA
 
Protein sequence
MDMNFKKYKT VSFDIFDTLV SRRIYRPRDL FSLMQSTLAT EKFFISAYEI GIIDNFPEIR 
VQAEVSAREN RVRRFGGEPE ILISEIYDEI LKKHPQLSPA TVKKIIDLEI QMEKIVLYKN
ARGSCLFEKA ISDGCKVILI SDMYLPSAIL KELLTSCGYD ISNIPVYSSG EERYSKNSGK
LFSIVKKNEN VDIASWMHVG DNVHADILNA KKLGINTLHA DWSEYNHGIS NHWKAKDIIG
ESICKTLLLK QVSAFHQNDP LNEIGFKVFG PLLLGYVSWL ANQLKIHKID KALFLARDAH
LIYKIYNEYF SEEHVKCEYL YISRASAYMV GMTDWPMHRI WHLFGGKNKK SIKKILAIAG
LDASEHISDI HHVGFPDEEY IPVSGEEHKV HWLINKLFPY ILLKNTQHRE VYADYFKTAC
EGYKNIALID VGWMGNIQSV FARSLGAQWA EKQIHGFYLA TFAGANDNRS IYNKMFGWLT
NYGHPNDKCD LFLSGGVEIM EFAMADNTGS TIGYKKTDNG IIPVREDSSG SEIEYLKKAA
RLQSGIISFF EYVKPLIQKG NYAALSSVVL SEPFFELIAR PSSAQLDALS SLTHSESAGS
NAERIVLAKK LPLKDKLFPG ENYIKELNAS YWKEGFKRIN RKKFWAKYN