Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B3935 |
Symbol | |
ID | 6793829 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | - |
Start bp | 3831855 |
End bp | 3832865 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 642778055 |
Product | lipopolysaccharide 1,2-glucosyltransferase |
Protein accession | YP_002148650 |
Protein GI | 197248675 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1442] Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 41 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATTCAT TTCCTGAGAT AGAAATAGCT GAATATAAAG TTTTTGATGA AAGTAATAAT AATGATGATA ACGTATTAAA CATTTCTTAT GGCGTTGATG AAAACTATCT TGATGGTGTG GGGGTATCAA TCGCTTCAGT TGTATTAAAC AATAATATCC CGCTCGCTTT TCACATTATT TGTGATTCAT ACTCCCCGTG TTTTGTAAAA TATATAGAGC GTTTAGCCGT ACAGCATCAC ATAAAAATTT CTCTTTATCT TATTAAAGTA GAAAGCCTTG AGGTATTGCC TCAAACTAAA GTATGGTCGA GAGCAATGTA TTTTCGTTTA TTTGCTTTCG ATTATCTCAG CAAGAAGGTA AACACCTTAC TTTATTTGGA TGCCGATGTT GTATGCAAAG GATCTTTGCA AGATCTTCTA CAGCTTGATC TGACAGAGAA GATTGCTGCG GTCGTAAAAG ATGTTGATTC CATCCAGAAT AAGGTAAATG AGAGATTAAG CGCTTTTAAT TTACAAGGTG GTTATTTTAA CTCCGGCGTG GTTTTTGTTA ACCTGAAATT ATGGCAAGAG AATGCCTTAA CCAAAAAGGC ATTTTTACTT TTGGCAGGTA AAGAGGCTGA CTCTTTTAAA TATCCCGATC AGGATGTTTT GAATATTCTC CTACAGGATA AAGTCATTTT TCTACCGCGA CCGTATAATA CCATTTATAC TATTAAAAGT GAGTTGAAAG ATAAGTCACA TAAAAAATAT AGCAATATAA TTAATGATAA TACTATTTTA ATTCATTATA CGGGTGCTAC AAAACCATGG CATGCCTGGG CAAATTATCC TTCAGTTATC TATTATAAAA ATGCACGACT TAACTCGCCC TGGAAGGATT TTCCCGCAAA AGATGCGCGT ACCATAGTCG AATTTAAGAA GCGATATAAA CATCTTCTCG TGCAGGGTCA TTATTTTAAA GGCCTTATGG CTGGAAGCGC ATATCTTTAT CGTAAACTTT TCCACAAATA A
|
Protein sequence | MDSFPEIEIA EYKVFDESNN NDDNVLNISY GVDENYLDGV GVSIASVVLN NNIPLAFHII CDSYSPCFVK YIERLAVQHH IKISLYLIKV ESLEVLPQTK VWSRAMYFRL FAFDYLSKKV NTLLYLDADV VCKGSLQDLL QLDLTEKIAA VVKDVDSIQN KVNERLSAFN LQGGYFNSGV VFVNLKLWQE NALTKKAFLL LAGKEADSFK YPDQDVLNIL LQDKVIFLPR PYNTIYTIKS ELKDKSHKKY SNIINDNTIL IHYTGATKPW HAWANYPSVI YYKNARLNSP WKDFPAKDAR TIVEFKKRYK HLLVQGHYFK GLMAGSAYLY RKLFHK
|
| |