Gene SeD_A4097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4097 
SymbolrfaF 
ID6871416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3945950 
End bp3946996 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content59% 
IMG OID642787046 
ProductADP-heptose:LPS heptosyltransferase II 
Protein accessionYP_002217673 
Protein GI198243825 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0859] ADP-heptose:LPS heptosyltransferase 
TIGRFAM ID[TIGR02195] lipopolysaccharide heptosyltransferase II 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones76 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTT TGGTCATTGG CCCGTCCTGG GTGGGCGACA TGATGATGTC GCAAAGTCTC 
TATCGCACGC TTAAAGCGCG CTATCCCCAG GCGATAATCG ACGTGATGGC GCCAGCCTGG
TGTCGTCCGT TGTTATCGCG TATGCCGGAA GTTAACGAGG CGATACCCAT GCCGCTGGGC
CACGGCGCGC TGGAAATCGG CGAGCGCCGT AGATTGGGCC ATAGCCTGCG TGAGAAGCGC
TACGATCGCG CCTGGGTGCT GCCAAATTCG TTTAAATCGG CGCTGGTTCC TTTCTTTGCC
AATATCCCGC ACCGTACCGG CTGGCGCGGC GAAATGCGCT ATGGCCTGCT GAACGATGCG
CGCGTCCTTG ATAAAGACGC CTGGCCGCTG ATGGTGGAGC GCTACGTGGC GCTGGCCTAT
GACAAGGGGG TGATGCGCAC GGCGAAAGAT CTGCCGCAGC CGCTACTCTG GCCACAGCTC
CTGGTTAGCG AGGGTGAAAA GTCGTTGATA CGCAGCGACT TTTCGCTATC TTCTGAACGT
CCTCTGATCG GCTTTTGCCC CGGCGCGGAA TTTGGCCCGG CAAAACGTTG GCCGCACTAT
CACTACGCCG AACTGGCAAA GCAGCTCATT AACGAAGGGT ACCAGGTCGT ACTGTTTGGC
TCGGCAAAAG ACCATGAAGC CGGAAATGAG ATCCTGGCGG CGCTGAATAG CGAGCAGCAG
GCATGGTGTC GCAACCTGGC GGGGGAAACC CAGCTGGAAC AGGCCGTCAT TCTGATAGCC
GCCTGTAAAG CCATCGTCAC CAACGATTCC GGGCTGATGC ACGTCGCGGC GGCGCTCGAC
CGCCCGCTGG TCGCCTTGTA TGGCCCAAGT AGCCCGGATT TCACGCCACC GCTGTCTCAT
AAGGCCCGGG TGATTCGTCT CATTACGGGT TATCACAAAG TGCGTAAAGG CGATACGGCG
CAAGGCTATC ACCAGAGCCT GATCGATATC ACGCCGCAGC GGGTTCTGGA AGAGCTTCAT
TCGCTGTTGT CGGAAGAGGG CGTTTAA
 
Protein sequence
MKILVIGPSW VGDMMMSQSL YRTLKARYPQ AIIDVMAPAW CRPLLSRMPE VNEAIPMPLG 
HGALEIGERR RLGHSLREKR YDRAWVLPNS FKSALVPFFA NIPHRTGWRG EMRYGLLNDA
RVLDKDAWPL MVERYVALAY DKGVMRTAKD LPQPLLWPQL LVSEGEKSLI RSDFSLSSER
PLIGFCPGAE FGPAKRWPHY HYAELAKQLI NEGYQVVLFG SAKDHEAGNE ILAALNSEQQ
AWCRNLAGET QLEQAVILIA ACKAIVTNDS GLMHVAAALD RPLVALYGPS SPDFTPPLSH
KARVIRLITG YHKVRKGDTA QGYHQSLIDI TPQRVLEELH SLLSEEGV