Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C2949 |
Symbol | |
ID | 6489091 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | + |
Start bp | 2890884 |
End bp | 2891999 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642743107 |
Product | glycosyltransferase |
Protein accession | YP_002046731 |
Protein GI | 194447861 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | [TIGR01426] glycosyltransferase, MGT family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1.63295e-18 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGTATTC TGTTTGTCGG TCCACCACTG TATGGACTGC TATACCCTGT GCTGTCTCTG GCGCAAGCGT TTCGTGTTAA TGGCCATGAA GTACTGATTG CAAGCGGTGG CAAATTTGCA CAGAAAGCAG CCGAAGCTGG GTTGGTGGTA TTTGACGCTG CGCCTGGTTT CGATTCGGAA GCGGGTTATC GCCGTCAGGA GGCATTACGA AAAGAAAATA ACATTGGAAC AAAAATGGGG AACTTCTCAT TCTTCAGCGA AGAGATGACT GACCCGCTGG TCGCGTTCGC CGGGCAGTGG CGACCAGATC TCATCGTCTA CCCTCCCCTT GGGGTCGTTG GACCACTGAT TGCCGCTAAG TATGACATTC CGGTAGTGAT GCAAACCGTC GGCTTCGGTC ATACGCCCTG GCACATCAAA GGCGTGACGA AATCACTTTC TAACGCCTAC CGCCGCCATG GGGTCAGCGC GCCACCAAGA GATCTGGCGT GGATAGACGT CACACCGCCC AGCATGAGCA TACTGCAAAA TGACGGAGAG CCGGTTATCT CCATGCAATA CGTCCCGTAT AACGGCGGCG CCGTCTGGGA AGAATGGTGG GAACGTACAC CTGATCGCAA GCGTCTTCTG GTCAGCCTCG GCACCGTCAA ACCGATGGTG GATGGCCTGG ATCTGATTTC CTGGGTAATG GATTCTGCCG GCGAAGTAGA TGCCGAAATC ATCCTGCATC TTCCGGCAAA CGCCCGCTCG GATTTACGTT CACTGCCGCC GAATGTCCGT CTGGTCGACT GGCTTCCGAT GGGCGTTTTC CTTAACGGCG CCGACGGTTT TATCCACCAT GGCGGCGCAG GCAACACCCT GACGGCGCTG CATGCCGGCA TTCCGCAGAT AGTCTTTGGC CAGGGTGCCG ACAGACCCGT CAATGCCCGC GCCGTGGTCG AGCGCGGATG CGGCATTATT CCCGGTAAGA GCGGGCTTAC GAGCAGCATG ATTAATACCT TCCTCGGTAA TCGCGCGCTT CGCGAGGCGT CGCAGGAGGT CGCGGCGGAA ATGGCGGCCC AGCCTTGCCC GACCGAGGTG GCAAAAAAAC TGATCGCCAT GCTGCAACAC GGCTAA
|
Protein sequence | MRILFVGPPL YGLLYPVLSL AQAFRVNGHE VLIASGGKFA QKAAEAGLVV FDAAPGFDSE AGYRRQEALR KENNIGTKMG NFSFFSEEMT DPLVAFAGQW RPDLIVYPPL GVVGPLIAAK YDIPVVMQTV GFGHTPWHIK GVTKSLSNAY RRHGVSAPPR DLAWIDVTPP SMSILQNDGE PVISMQYVPY NGGAVWEEWW ERTPDRKRLL VSLGTVKPMV DGLDLISWVM DSAGEVDAEI ILHLPANARS DLRSLPPNVR LVDWLPMGVF LNGADGFIHH GGAGNTLTAL HAGIPQIVFG QGADRPVNAR AVVERGCGII PGKSGLTSSM INTFLGNRAL REASQEVAAE MAAQPCPTEV AKKLIAMLQH G
|
| |