Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeSA_A2915 |
Symbol | |
ID | 6519623 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 |
Kingdom | Bacteria |
Replicon accession | NC_011094 |
Strand | + |
Start bp | 2823496 |
End bp | 2824611 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642747945 |
Product | IroB |
Protein accession | YP_002115727 |
Protein GI | 194738018 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | [TIGR01426] glycosyltransferase, MGT family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0618266 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTATTC TGTTTGTCGG TCCACCACTG TATGGACTGC TATACCCTGT GCTGTCCCTG GCGCAAGCGT TTCGTGTTAA TGGCCATGAA GTACTGATTG CAAGCGGTGG AAAATTTGCA CAGAAAGCAG CCGAAGCTGG GTTGGTGGTA TTTGACGCTG CGCCTGGTTT CGATTCGGAA GCAGGTTATC GCCGTCAGGA GGCATTGCGA AAAGAAAATA ACATTGGAAC AAAAATGGGG AACTTCTCAT TCTTCAGCGA AGAGATGACT GACCCACTGG TCGCGTTCGC CGGGCAGTGG CGGCCAGATC TCATCGTCTA CCCTCCCCTT GGGGTCGTTG GACCACTGAT TGCCGCTAAG TATGAAATTC CGGTAGTGAT GCAAACCGTC GGCTTCGGTC ATACGCCCTG GCACATCAAA GGCGTGACGA AATCACTTTC TAACGCCTAC CGCCGCCATG GGGTCAGCGC GCCACCAAGA GATCTGGCGT GGATAGACGT CACACCGCCC AGCATGAGCA TACTGCAAAA TGACGGAGAG CCGGTTATCT CCATGCAATA CGTCCCGTAT AACGGCGGCG CCGTCTGGGA AGAATGGTGG GAACGTACAC CTGATCGCAA GCGTCTTCTG GTCAGCCTCG GCACCGTCAA ACCGATGGTG GATGGCCTGG ATCTGATTTC CTGGGTAATG GATTCTGCCG GCGAAGTAGA TGCCGAAATC ATCCTGCATC TTCCGGCAAA CGCCCGCTCG GATTTACGTT CTCTGCCGCC GAATGTCCGT CTGGTCGACT GGCTTCCGAT GGGCGTTTTC CTTAACGGCG CCGACGGTTT TATCCACCAT GGCGGCGCAG GCAACACCCT GACGGCGCTG CATGCCGGCA TTCCGCAGAT AGTCTTTGGC CAGGGTGCCG ACAGACCCGT CAATGCCCGC GCCGTGGTCG AGCGCGGATG CGGCATTATT CCCGGTAAGA GCGGGCTTAC CAGCAGCATG ATCAATACCT TCCTCGGTAA TCGCGCGCTT CGCGAGGCGT CGCAGGAGGT CGCGGCGGAA ATGGCGGCCC AGCCTTGCCC GACCGAGGTG GCAAAAAAAC TGATCGCCAT GCTGCAACAC GGCTAA
|
Protein sequence | MRILFVGPPL YGLLYPVLSL AQAFRVNGHE VLIASGGKFA QKAAEAGLVV FDAAPGFDSE AGYRRQEALR KENNIGTKMG NFSFFSEEMT DPLVAFAGQW RPDLIVYPPL GVVGPLIAAK YEIPVVMQTV GFGHTPWHIK GVTKSLSNAY RRHGVSAPPR DLAWIDVTPP SMSILQNDGE PVISMQYVPY NGGAVWEEWW ERTPDRKRLL VSLGTVKPMV DGLDLISWVM DSAGEVDAEI ILHLPANARS DLRSLPPNVR LVDWLPMGVF LNGADGFIHH GGAGNTLTAL HAGIPQIVFG QGADRPVNAR AVVERGCGII PGKSGLTSSM INTFLGNRAL REASQEVAAE MAAQPCPTEV AKKLIAMLQH G
|
| |