Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A3997 |
Symbol | |
ID | 6484800 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 3884924 |
End bp | 3885934 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 642739257 |
Product | lipopolysaccharide 1,2-glucosyltransferase |
Protein accession | YP_002042967 |
Protein GI | 194446313 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1442] Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.321916 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 70 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATTCAT TTCCTGAGAT AGAAATAGCT GAATATAAAA TTTTTGATGA AAGTAATAAT AATGATGATA ACGTATTAAA CATTTCTTAT GGCGTTGATG AAAACTATCT TGATGGTGTG GGGGTGTCAA TTGCTTCAGT TGTACTAAAC AATAATATCC CGCTCGCTTT TCACATTATT TGTGATTCAT ACTCTCCGTG TTTTGTAAAA TATATAGAGC GTTTAGCCGT ACAGCATCAC ATAAAAATTT CTCTTTATCT TATTAAAGTA GAAAGCCTTG AGGTATTGCC TCAAACTAAA GTATGGTCGA GAGCAATGTA TTTTCGTTTA TTTGCTTTCG ATTATCTCAG CAAGAAGGTA AACACACTAC TTTATTTGGA TGCCGATGTT GTATGCAAAG GATCTTTGCA AGATCTTCTA CAGCTTGATT TGACAGAGAA GATTGCTGCG GTCGTAAAAG ATGTTGATTC CATCCAGAAT AAGGTAAATG AGAGATTAAG CGCTTTTAAT TTACAAGGTG GTTATTTTAA CTCCGGCGTG GTTTTTGTTA ACCTGAAATT ATGGAAAGAG AATGCCTTAA CTGAAAAGGC ATTTTTACTT CTGGCAGGTA AAGAAGCTGA CTCTTTTAAA TATCCCGATC AGGATGTTTT GAATATTCTC CTACAGGATA AAGTCATTTT TCTACCGCGA CCGTATAATA CTATTTATAC TATTAAAAGT GAGTTGAAAG ATAAGTCACA TAAAAAATAT AGCAATATAA TTAATGATAA TACTATTTTA ATTCATTATA CGGGCGCTAC AAAACCATGG CATGCCTGGG CAAATTATCC TTCAGTTATC TATTATAAAA ATGCACGACT GAACTCGCCC TGGAAAGATT TTCCCGCAAA AGATGCGCGT ACCATAGTCG AATTTAAGAA GCGATATAAA CATCTTCTCG TGCAAGGTCA TTATTTTAAA GGCCTTCTGG CTGGAAGCGC ATATCTTTAT CGTAAACTTT TCCACAAATA A
|
Protein sequence | MDSFPEIEIA EYKIFDESNN NDDNVLNISY GVDENYLDGV GVSIASVVLN NNIPLAFHII CDSYSPCFVK YIERLAVQHH IKISLYLIKV ESLEVLPQTK VWSRAMYFRL FAFDYLSKKV NTLLYLDADV VCKGSLQDLL QLDLTEKIAA VVKDVDSIQN KVNERLSAFN LQGGYFNSGV VFVNLKLWKE NALTEKAFLL LAGKEADSFK YPDQDVLNIL LQDKVIFLPR PYNTIYTIKS ELKDKSHKKY SNIINDNTIL IHYTGATKPW HAWANYPSVI YYKNARLNSP WKDFPAKDAR TIVEFKKRYK HLLVQGHYFK GLLAGSAYLY RKLFHK
|
| |