Gene EcSMS35_3957 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3957 
SymbolrfaF 
ID6147338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4034924 
End bp4035970 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content54% 
IMG OID641618783 
ProductADP-heptose:LPS heptosyltransferase II 
Protein accessionYP_001745922 
Protein GI170683841 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0859] ADP-heptose:LPS heptosyltransferase 
TIGRFAM ID[TIGR02195] lipopolysaccharide heptosyltransferase II 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000313085 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATAC TGGTGATCGG CCCGTCTTGG GTTGGCGACA TGATGATGTC GCAAAGTCTC 
TATCGCACGC TCCAGGCGCG CTATCCCCAG GCGATAATCG ACGTGATGGC ACCGGCATGG
TGCCGTCCAT TATTATCGCG GATGCCGGAA GTTAACGAAG CGATCCCTAT GCCTCTCGGT
CACGGAGCGC TGGAAATCGG CGAACGCCGC AAACTGGGTC ATAGTCTGCG TGAAAAGCGC
TACGACCGCG CCTACGTCTT ACCCAACTCC TTCAAATCCG CATTAGTGCC TTTCTTCGCG
GGTATTCCTC ATCGCACCGG CTGGCGCGGC GAGATGCGCT ACGGTTTACT TAACGATGTA
CGCGTGCTCG ATAAAGAAGC CTGGCCGCTA ATGGTGGAAC GCTATGTGGC GCTGGCCTAT
GACAAAGGCA TTATGCGTAC AGCACAAGAT CTGCCGCAGC CATTGTTATG GCCGCAGTTG
CAGGTGAGCG AAGGTGAAAA ATCATATACC TGTAATCAAT TTTCGCTCTC ATCAGAACGT
CCGATGATTG GCTTTTGCCC CGGAGCGGAG TTTGGTCCGG CAAAACGCTG GCCACACTAC
CACTATGCGG AGCTGGCAAA GCAGCTGATT GATGAAGGTT ATCAGGTGGT TCTGTTTGGC
TCTGCGAAAG ATCATGAAGC GGGCAATGAG ATTCTTGCCG CTTTAAATAC TGAGCAGCAG
GCATGGTGCC GGAACCTGGC AGGGGAAACA CAGCTTGATC AAGCGGTTAT CCTGATTGCA
GCCTGTAAAG CCATTGTCAC TAACGATTCT GGCCTGATGC ACGTTGCGGC GGCGCTCAAT
CGTCCGCTGG TTGCCTTGTA TGGTCCGAGT AGCCCGGACT TCACACCGCC GCTATCCCAT
AAAGCACGCG TGATCCGCCT GATTACCGGC TATCACAAAG TGCGTAAAGG TGACGCAGCG
GAGGGTTATC ACCAGAGCTT GATCGACATT ACTCCCCAGC GCGTACTGGA AGAACTCAAC
GCGCTATTGT TACAAGAGGA AGCCTGA
 
Protein sequence
MKILVIGPSW VGDMMMSQSL YRTLQARYPQ AIIDVMAPAW CRPLLSRMPE VNEAIPMPLG 
HGALEIGERR KLGHSLREKR YDRAYVLPNS FKSALVPFFA GIPHRTGWRG EMRYGLLNDV
RVLDKEAWPL MVERYVALAY DKGIMRTAQD LPQPLLWPQL QVSEGEKSYT CNQFSLSSER
PMIGFCPGAE FGPAKRWPHY HYAELAKQLI DEGYQVVLFG SAKDHEAGNE ILAALNTEQQ
AWCRNLAGET QLDQAVILIA ACKAIVTNDS GLMHVAAALN RPLVALYGPS SPDFTPPLSH
KARVIRLITG YHKVRKGDAA EGYHQSLIDI TPQRVLEELN ALLLQEEA