Gene SbBS512_E4047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4047 
SymbolrfaF 
ID6272930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp3779716 
End bp3780762 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content54% 
IMG OID641727887 
ProductADP-heptose:LPS heptosyltransferase II 
Protein accessionYP_001882319 
Protein GI187733078 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0859] ADP-heptose:LPS heptosyltransferase 
TIGRFAM ID[TIGR02195] lipopolysaccharide heptosyltransferase II 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000910947 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATAC TGGTGATCGG CCCGTCTTGG GTTGGCGACA TGATGATGTC GCAAAGTCTC 
TATCGCACGC TCCAGGCGCG CTATCCCCAG GCGATAATCG ATGTGATGGC ACCGGCATGG
TGCCGTCCAT TATTATCGCG GATGCCGGAA GTTAACGAAG CTATCCCTAT GCCTCTCGGT
CACGGAGCGC TGGAAATCGG CGAACGCCGC AAACTGGGTC ATAGCCTGCG TGAAAAGCGC
TACGACCGCG CCTACGTCTT ACCCAACTCC TTCAAATCTG CATTAGTGCC TTTCTTCGCG
GGTATTCCTC ATCGCACCGG CTGGCGCGGC GAGATGCGCT ACGGTTTACT CAACGATGTA
CGCGTGCTCG ATAAAGAAGC CTGGCCGCTA ATGGTGGAAC GCTATGTCGC GCTGGCCTAT
GACAAAGGCA TTATGCGTAC CGCACAAGAT CTGCCGCAGC CATTGTTATG GCCGCAGTTG
CAGGTGAGCG AAGGTGAAAA ATCATATACC TGTAATCAAT TTTCGCTTTC ATCAGAACGT
CCGATGATTG GCTTTTGCCC GGGTGCGGAG TTTGGTCCGG CAAAACGCTG GCCACACTAC
CACTATGCGG AACTGGCAAA GCAGCTGATT GATGAAGGTT ATCAGGTGGT TCTGTTTGGC
TCTGCGAAAG ATCATGAAGC GGGCAATGAG ATTCTTGCCG CTTTGAATAC CGAGCAGCAG
GCATGGTGTC GGAACCTGGC GGGGGAAACA CAGCTTGATC AAGCGGTTAT CCTGATTGCA
GCCTGTAAAG CCATTGTCAC TAACGATTCT GGCCTGATGC ACGTTGCGGC GGCGCTCAAT
CGTCCGCTGG TTGCCCTGTA TGGTCCAAGT AGCCCGGACT TCACACCGCC GCTATCCCAT
AAAGCGCGCG TGATCCGTCT GATTACCGGC TATCACAAAG TGCGTAAAGG TGACGCTGCG
GAGGGTTATC ACCAGAGCTT GATCGACATT ACTCCCCAGC GCGTACTGGA AGAACTCAAC
ACGCTATTGT TACAAGAGGA AGCCTGA
 
Protein sequence
MKILVIGPSW VGDMMMSQSL YRTLQARYPQ AIIDVMAPAW CRPLLSRMPE VNEAIPMPLG 
HGALEIGERR KLGHSLREKR YDRAYVLPNS FKSALVPFFA GIPHRTGWRG EMRYGLLNDV
RVLDKEAWPL MVERYVALAY DKGIMRTAQD LPQPLLWPQL QVSEGEKSYT CNQFSLSSER
PMIGFCPGAE FGPAKRWPHY HYAELAKQLI DEGYQVVLFG SAKDHEAGNE ILAALNTEQQ
AWCRNLAGET QLDQAVILIA ACKAIVTNDS GLMHVAAALN RPLVALYGPS SPDFTPPLSH
KARVIRLITG YHKVRKGDAA EGYHQSLIDI TPQRVLEELN TLLLQEEA