Gene SNSL254_A4003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A4003 
Symbol 
ID6486825 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3891011 
End bp3892081 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content48% 
IMG OID642739263 
Productlipopolysaccharide core biosynthesis protein 
Protein accessionYP_002042973 
Protein GI194443756 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0859] ADP-heptose:LPS heptosyltransferase 
TIGRFAM ID[TIGR02201] lipopolysaccharide heptosyltransferase III, putative 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.043966 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones76 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAAAAGC CATTTCGAAG AATTCTGATT ATAAAAATGC GTTTTCATGG AGACATGTTA 
CTTACCACTC CTGTCATCAG CACGCTGAAG CAGAATTATC CCGATGCAAA AATCGATGTG
CTTCTTTACC AGAACACCAT ACCGATATTG TCGGAAAATC CGGAGATTAA TGCACTCTAC
GGCATCAGTA ACAAAGGCGC AGGAACAAAA GAGAAGATCA AAAACGCGCT ATCGCTAATC
AAAAAATTGC GTGCTAACTC GTATGATCTG GTCGTCAATC TCACCGATCA GTGGAGCGTG
GCGCTTATTG TGCGTTTTTT AAACGCAAAA ATAAAAATCT CGCAGGATTT CGGTAATCGT
CAGTCTGCTT TATGGAAAAA AAGCTTTACG CATTTAGTAC CTTATGCGGG AGAACATGCT
GTTGATCGCA CATTATCCGC GCTAAAGCCG TTAGCGCTAA AACAGTATGT CACGGAAACC
ACCATGAGCT ATCGCCCGGA ACATTGGGAA AACATGCGAC AACAGCTCAA ACAACTTGGT
GTGACCCGAC AGTATGTTGT TATTCAGCCT ACGGCACGGC AGCTATTTAA GTGCTGGGAT
AATGATAAAT TCTCGCAGGT TATTGATGCC GTGCAGCGCC GCGGTTATCA AGTTGTACTG
ACGTCAGGCC CGGCCGCAGA CGAGATGGCC TGCGTGGATG CTATTGCACG CGGCTGTGAA
ACAAAACCGG TTACTGGTCT GGCGGGTAAA ACTCGCTTTC CTGAATTGGG CGCGTTGATT
GACCATGCGG ATCTCTTTAT CGGCGTTGAT TCGGCCCCTG GTCATATTGC CGCAGCAGTC
AAAACGCCGG TCATTTGCCT TTTCGGCGCG ACGGATCATG TGTTCTGGCG TCCCTGGACC
GACGACATCA TTCAGTTTTG GGCAGGAAAT TATCAGCCGA TGCCTGAACG GCATGAGCTT
GATCGTAATA AAAAATACCT TTCCGTGATT CCGGCTGAAG ATGTTATCGC CGCGACGGAA
AAAATGCTGC CGCATGAGGC ACGCGCCATA GATTTGGACA GCCTGCTATG A
 
Protein sequence
MEKPFRRILI IKMRFHGDML LTTPVISTLK QNYPDAKIDV LLYQNTIPIL SENPEINALY 
GISNKGAGTK EKIKNALSLI KKLRANSYDL VVNLTDQWSV ALIVRFLNAK IKISQDFGNR
QSALWKKSFT HLVPYAGEHA VDRTLSALKP LALKQYVTET TMSYRPEHWE NMRQQLKQLG
VTRQYVVIQP TARQLFKCWD NDKFSQVIDA VQRRGYQVVL TSGPAADEMA CVDAIARGCE
TKPVTGLAGK TRFPELGALI DHADLFIGVD SAPGHIAAAV KTPVICLFGA TDHVFWRPWT
DDIIQFWAGN YQPMPERHEL DRNKKYLSVI PAEDVIAATE KMLPHEARAI DLDSLL