Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A4104 |
Symbol | |
ID | 6875589 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 3952185 |
End bp | 3953198 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 642787051 |
Product | lipopolysaccharide 1,2-glucosyltransferase |
Protein accession | YP_002217678 |
Protein GI | 198242856 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1442] Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.049978 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 77 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATTCAT TTCCTGAGAT AGAAATAGCT GAATATAAAG TTTTTGATGA AAGTAATAAT AATAATGATG ATAACGTATT AAACATTTCT TATGGTGTTG ATGAAAACTA TCTTGATGGT GTGGGGGTAT CAATCGCTTC AGTTGTATTA AACAATAATA TCCCGCTCGC TTTTCACATT ATTTGTGATT CATACTCCCC GTGTTTTGTA AAATATATAG AGCGTTTAGC CGTACAGCAT CACATAAAAA TTTCTCTTTA TCTTATTAAA GTAGAAAGCC TTGAGGTATT GCCTCAAACT AAAGTATGGT CGAGAGCAAT GTATTTTCGT TTATTTGCTT TCGATTATCT CAGCAAGAAG GTAAATACCT TACTTTATTT GGATGCCGAT GTTGTATGCA AAGGATCTTT GCAAGATCTT CTACGGCTTG ATTTGACAGA GAAGATTGCT GCGGTCGTAA AAGATGTTGA TTCCATCCAG AATAAGGTAA ATGAGAGATT AAGCGCTTTT AATTTACAAG GTGGTTATTT TAACTCCGGC GTGGTTTTTG TTAACCTGAA ATTATGGAAA GAGAATGCCT TAACCAAAAA GGCATTTTTA CTTTTGGCAG GTAAAGAGGC TGACTCTTTT AAATATCCCG ATCAGGATGT TTTGAATATT CTCCTACAGG ATAAAGTCAT TTTTCTACCG CGACCATATA ATACCATTTA TACTATCAAA AGTGAGTTGA AAGATAAGTC ACATAAAAAA TATAGCAATA TAATTAATGA TAATACTGTT TTAATTCATT ATACGGGCGC TACAAAACCA TGGCATGCCT GGGCAAATTA TCCTTCAGTT ATCTATTATA AAAATGCACG ACTGAACTCG CCCTGGAAAG ATTCTCCTGC AAAAGATGCG CGTACCATAG TCGAATTTAA GAAGCGATAT AAACATCTTC TCGTGCAGGG TCATTATTTT AAAGGCCTTA TGGCTGGAAG CGCATATCTT TATCGTAAAC TTTTCCACAA ATAA
|
Protein sequence | MDSFPEIEIA EYKVFDESNN NNDDNVLNIS YGVDENYLDG VGVSIASVVL NNNIPLAFHI ICDSYSPCFV KYIERLAVQH HIKISLYLIK VESLEVLPQT KVWSRAMYFR LFAFDYLSKK VNTLLYLDAD VVCKGSLQDL LRLDLTEKIA AVVKDVDSIQ NKVNERLSAF NLQGGYFNSG VVFVNLKLWK ENALTKKAFL LLAGKEADSF KYPDQDVLNI LLQDKVIFLP RPYNTIYTIK SELKDKSHKK YSNIINDNTV LIHYTGATKP WHAWANYPSV IYYKNARLNS PWKDSPAKDA RTIVEFKKRY KHLLVQGHYF KGLMAGSAYL YRKLFHK
|
| |