Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A4003 |
Symbol | |
ID | 6486825 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 3891011 |
End bp | 3892081 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 642739263 |
Product | lipopolysaccharide core biosynthesis protein |
Protein accession | YP_002042973 |
Protein GI | 194443756 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0859] ADP-heptose:LPS heptosyltransferase |
TIGRFAM ID | [TIGR02201] lipopolysaccharide heptosyltransferase III, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.043966 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 76 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGAAAAGC CATTTCGAAG AATTCTGATT ATAAAAATGC GTTTTCATGG AGACATGTTA CTTACCACTC CTGTCATCAG CACGCTGAAG CAGAATTATC CCGATGCAAA AATCGATGTG CTTCTTTACC AGAACACCAT ACCGATATTG TCGGAAAATC CGGAGATTAA TGCACTCTAC GGCATCAGTA ACAAAGGCGC AGGAACAAAA GAGAAGATCA AAAACGCGCT ATCGCTAATC AAAAAATTGC GTGCTAACTC GTATGATCTG GTCGTCAATC TCACCGATCA GTGGAGCGTG GCGCTTATTG TGCGTTTTTT AAACGCAAAA ATAAAAATCT CGCAGGATTT CGGTAATCGT CAGTCTGCTT TATGGAAAAA AAGCTTTACG CATTTAGTAC CTTATGCGGG AGAACATGCT GTTGATCGCA CATTATCCGC GCTAAAGCCG TTAGCGCTAA AACAGTATGT CACGGAAACC ACCATGAGCT ATCGCCCGGA ACATTGGGAA AACATGCGAC AACAGCTCAA ACAACTTGGT GTGACCCGAC AGTATGTTGT TATTCAGCCT ACGGCACGGC AGCTATTTAA GTGCTGGGAT AATGATAAAT TCTCGCAGGT TATTGATGCC GTGCAGCGCC GCGGTTATCA AGTTGTACTG ACGTCAGGCC CGGCCGCAGA CGAGATGGCC TGCGTGGATG CTATTGCACG CGGCTGTGAA ACAAAACCGG TTACTGGTCT GGCGGGTAAA ACTCGCTTTC CTGAATTGGG CGCGTTGATT GACCATGCGG ATCTCTTTAT CGGCGTTGAT TCGGCCCCTG GTCATATTGC CGCAGCAGTC AAAACGCCGG TCATTTGCCT TTTCGGCGCG ACGGATCATG TGTTCTGGCG TCCCTGGACC GACGACATCA TTCAGTTTTG GGCAGGAAAT TATCAGCCGA TGCCTGAACG GCATGAGCTT GATCGTAATA AAAAATACCT TTCCGTGATT CCGGCTGAAG ATGTTATCGC CGCGACGGAA AAAATGCTGC CGCATGAGGC ACGCGCCATA GATTTGGACA GCCTGCTATG A
|
Protein sequence | MEKPFRRILI IKMRFHGDML LTTPVISTLK QNYPDAKIDV LLYQNTIPIL SENPEINALY GISNKGAGTK EKIKNALSLI KKLRANSYDL VVNLTDQWSV ALIVRFLNAK IKISQDFGNR QSALWKKSFT HLVPYAGEHA VDRTLSALKP LALKQYVTET TMSYRPEHWE NMRQQLKQLG VTRQYVVIQP TARQLFKCWD NDKFSQVIDA VQRRGYQVVL TSGPAADEMA CVDAIARGCE TKPVTGLAGK TRFPELGALI DHADLFIGVD SAPGHIAAAV KTPVICLFGA TDHVFWRPWT DDIIQFWAGN YQPMPERHEL DRNKKYLSVI PAEDVIAATE KMLPHEARAI DLDSLL
|
| |