Gene SeD_A1359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1359 
SymbolmsbB 
ID6872257 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1334404 
End bp1335375 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content53% 
IMG OID642784527 
Productlipid A biosynthesis (KDO)2-(lauroyl)-lipid IVA acyltransferase 
Protein accessionYP_002215197 
Protein GI198243024 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1560] Lauroyl/myristoyl acyltransferase 
TIGRFAM ID[TIGR02208] lipid A biosynthesis (KDO)2-(lauroyl)-lipid IVA acyltransferase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.00203152 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAAACCA AAAAAAATAA TAGTGAGTAT ATCCCTGAAT TCGAAAAATC CTTTCGCTAT 
CCGCAGTATT GGGGCGCCTG GTTGGGCGCT GCGGCAATGG CGGGGATCGC ATTAACACCG
GCATCATTCC GTGACCCTTT GCTGGCGACA CTGGGGCGCT TTGCCGGACG GCTGGGGAAG
AGTTCTCGTC GCCGGGCGCT AATTAATCTG TCTTTGTGCT TTCCGCAGCG TAGCGAAGCT
GAGCGCGAAG CGATTGTCGA TGAGATGTTC GCCACCGCGC CGCAGGCAAT GGCGATGATG
GCTGAGTTGG CGATGCGCGG TCCGAAAAAA ATCCAACAGC GTGTTGACTG GGAAGGTCTG
GAAATCATTG AGGAGATGCG TCGTAACGAC GAAAAAGTCA TTTTTCTCGT ACCGCATGGC
TGGGGCGTCG ACATTCCGGC CATGCTGATG GCCTCTCAGG GGCAAAAAAT GGCGGCGATG
TTTCATAATC AGGGTAATCC GGTTTTTGAC TATATCTGGA ACACAGTGCG TCGGCGTTTT
GGCGGACGTT TGCATGCGCG TAATGACGGG ATTAAACCCT TTATTCAGTC TGTTCGTCAG
GGCTACTGGG GCTACTACCT GCCGGACCAG GATCACGGCC CGGAGCATAG TGAATTCGTT
GATTTCTTTG CGACATACAA AGCGACGCTG CCTGCGATTG GTCGGCTGAT GAAAGTATGC
CGCGCACGCG TGATACCGCT TTTCCCGGTG TATAATGGTA AAACGCATCG CCTGACTATC
CAGATTCGCC CGCCAATGGA CGATCTGCTC ACGGCTGACG ATCACACAAT CGCCAGACGG
ATGAACGAAG AGGTCGAAAT TTTTGTCGGC CCGCATCCGG AACAGTACAC CTGGATCCTG
AAGCTGCTCA AAACCCGCAA GCCAGGCGAG ATTCAGCCGT ATAAGCGTAA AGATCTTTAT
CCCATCAAAT AA
 
Protein sequence
METKKNNSEY IPEFEKSFRY PQYWGAWLGA AAMAGIALTP ASFRDPLLAT LGRFAGRLGK 
SSRRRALINL SLCFPQRSEA EREAIVDEMF ATAPQAMAMM AELAMRGPKK IQQRVDWEGL
EIIEEMRRND EKVIFLVPHG WGVDIPAMLM ASQGQKMAAM FHNQGNPVFD YIWNTVRRRF
GGRLHARNDG IKPFIQSVRQ GYWGYYLPDQ DHGPEHSEFV DFFATYKATL PAIGRLMKVC
RARVIPLFPV YNGKTHRLTI QIRPPMDDLL TADDHTIARR MNEEVEIFVG PHPEQYTWIL
KLLKTRKPGE IQPYKRKDLY PIK