Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C0850 |
Symbol | |
ID | 6490277 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | + |
Start bp | 840131 |
End bp | 842080 |
Gene Length | 1950 bp |
Protein Length | 649 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 642741099 |
Product | putative glycosyl transferase |
Protein accession | YP_002044757 |
Protein GI | 194447526 |
COG category | [R] General function prediction only |
COG ID | [COG5610] Predicted hydrolase (HAD superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.996088 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.00000136346 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGATATGA ACTTTAAAAA ATACAAGACC GTAAGCTTTG ATATCTTTGA TACATTGGTT AGCCGGAGGA TTTACCGTCC CAGAGATTTG TTTTCATTAA TGCAATCAAC TTTAGCAACT GAGAAATTTT TTATATCAGC GTACGAGATT GGTATTATTG ATAATTTCCC AGAGATAAGA GTTCAGGCGG AAGTAAGTGC CAGAGAGAAT AGGGTCAGGC GTTTTGGCGG CGAGCCGGAA ATACTTATAT CTGAAATATA CGATGAAATT TTAAAAAAGC ATCCGCAGCT TTCACCAGCG ACAGTAAAAA AGATAATCGA TCTGGAAATA CAAATGGAGA AGATTGTTTT ATATAAAAAT GCGCGTGGAA GCTGTTTGTT TGAAAAGGCT ATTAGTGATG GTTGCAAAGT CATTTTAATT AGTGACATGT ACCTTCCATC AGCAATATTA AAGGAGTTGT TAACATCATG TGGCTATGAT ATCAGTAACA TTCCAGTTTA TTCATCTGGC GAAGAGCGGT ACTCTAAAAA TAGTGGTAAA TTATTTTCAA TAGTCAAGAA AAATGAAAAT GTAGATATTG CATCGTGGAT GCATGTTGGC GACAATGTTC ATGCTGATAT TCTGAATGCT AAAAAACTCG GCATAAATAC TCTCCATGCT GATTGGTCAG AGTATAATCA TGGGATATCT AATCATTGGA AAGCTAAAGA TATTATTGGT GAATCTATTT GTAAGACTTT ATTACTTAAA CAAGTTTCTG CTTTCCATCA AAATGATCCT TTAAACGAGA TAGGATTTAA AGTATTTGGT CCGTTATTAT TAGGTTATGT ATCCTGGTTA GCGAATCAGT TAAAGATTCA TAAAATTGAT AAAGCGCTTT TTTTAGCACG CGATGCTCAC TTAATCTATA AAATTTATAA TGAATACTTT TCAGAAGAAC ATGTAAAATG TGAATATTTA TATATATCCC GCGCATCAGC TTATATGGTG GGGATGACTG ATTGGCCGAT GCACAGGATT TGGCATCTTT TTGGTGGTAA GAATAAGAAA AGTATTAAAA AGATACTTGC TATCGCGGGG TTAGATGCGA GTGAGCATAT TTCAGATATA CATCATGTTG GTTTTCCTGA CGAGGAGTAT ATTCCTGTTT CAGGAGAGGA ACATAAGGTT CACTGGCTTA TAAATAAATT ATTTCCATAT ATTTTATTAA AAAATACTCA GCACAGGGAA GTTTACGCTG ATTACTTTAA AACGGCCTGT GAAGGTTATA AAAATATAGC ACTTATCGAT GTAGGATGGA TGGGTAATAT TCAATCAGTA TTTGCTCGTT CTTTAGGTGC GCAATGGGCA GAAAAACAAA TACATGGGTT TTATTTGGCA ACTTTTGCTG GCGCCAATGA TAACCGATCT ATTTATAATA AGATGTTTGG TTGGCTAACC AACTATGGCC ATCCCAACGA TAAGTGTGAT CTTTTCTTAT CAGGAGGGGT GGAAATAATG GAGTTCGCTA TGGCTGACAA TACTGGGTCA ACAATTGGCT ATAAAAAAAC GGATAATGGA ATAATTCCTG TACGTGAAGA TAGCAGTGGT TCTGAAATTG AGTATTTAAA AAAAGCAGCA AGATTGCAAT CAGGGATTAT TTCTTTTTTT GAGTACGTCA AACCGCTCAT ACAAAAAGGA AATTATGCAG CACTTAGTAG TGTTGTATTG TCAGAACCTT TTTTTGAATT GATAGCCAGA CCCTCAAGCG CTCAACTGGA CGCCTTATCT TCCCTCACAC ATTCAGAGTC CGCGGGATCT AACGCAGAAA GAATCGTGCT AGCCAAGAAA CTGCCTTTAA AGGATAAACT TTTTCCCGGA GAAAATTATA TCAAAGAGTT GAATGCCAGT TATTGGAAAG AAGGCTTTAA AAGGATCAAC AGAAAAAAAT TTTGGGCAAA ATATAACTAA
|
Protein sequence | MDMNFKKYKT VSFDIFDTLV SRRIYRPRDL FSLMQSTLAT EKFFISAYEI GIIDNFPEIR VQAEVSAREN RVRRFGGEPE ILISEIYDEI LKKHPQLSPA TVKKIIDLEI QMEKIVLYKN ARGSCLFEKA ISDGCKVILI SDMYLPSAIL KELLTSCGYD ISNIPVYSSG EERYSKNSGK LFSIVKKNEN VDIASWMHVG DNVHADILNA KKLGINTLHA DWSEYNHGIS NHWKAKDIIG ESICKTLLLK QVSAFHQNDP LNEIGFKVFG PLLLGYVSWL ANQLKIHKID KALFLARDAH LIYKIYNEYF SEEHVKCEYL YISRASAYMV GMTDWPMHRI WHLFGGKNKK SIKKILAIAG LDASEHISDI HHVGFPDEEY IPVSGEEHKV HWLINKLFPY ILLKNTQHRE VYADYFKTAC EGYKNIALID VGWMGNIQSV FARSLGAQWA EKQIHGFYLA TFAGANDNRS IYNKMFGWLT NYGHPNDKCD LFLSGGVEIM EFAMADNTGS TIGYKKTDNG IIPVREDSSG SEIEYLKKAA RLQSGIISFF EYVKPLIQKG NYAALSSVVL SEPFFELIAR PSSAQLDALS SLTHSESAGS NAERIVLAKK LPLKDKLFPG ENYIKELNAS YWKEGFKRIN RKKFWAKYN
|
| |