Gene EcHS_A3831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3831 
SymbolrfaF 
ID5593281 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3826484 
End bp3827530 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content54% 
IMG OID640922943 
ProductADP-heptose:LPS heptosyltransferase II 
Protein accessionYP_001460421 
Protein GI157163103 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0859] ADP-heptose:LPS heptosyltransferase 
TIGRFAM ID[TIGR02195] lipopolysaccharide heptosyltransferase II 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.000000276456 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATAC TGGTGATCGG CCCGTCTTGG GTTGGCGACA TGATGATGTC GCAAAGTCTC 
TATCGCACGC TCCAGGCGCG CTATCCCCAG GCGATAATCG ATGTGATGGC ACCGGCATGG
TGCCGTCCAT TATTATCGCG GATGCCGGAA GTTAACGAAG CTATCCCTAT GCCTCTCGGT
CACGGAGCGC TGGAAATCGG CGAACGCCGC AAACTGGGTC ATAGCCTGCG TGAAAAGCGC
TACGACCGCG CCTACGTCTT ACCAAACTCC TTCAAATCTG CATTAGTGCC TTTCTTCGCG
GGTATTCCTC ATCGCACTGG CTGGCGCGGC GAGATGCGCT ACGGTTTACT CAACGATGTA
CGCGTGCTCG ATAAAGAAGC CTGGCCGCTA ATGGTGGAAC GCTATGTCGC GCTGGCCTAT
GACAAAGGCA TTATGCGTAC CGCACAAGAT CTGCCGCAGC CATTGTTATG GCCGCAGTTG
CAGGTGAGCG AAGGTGAAAA ATCATATACC TGTAATCAAT TTTCGCTTTC ATCAGAACGT
CCGATGATTG GCTTTTGCCC GGGTGCGGAG TTTGGTCCGG CAAAACGCTG GCCACACTAC
CACTATGCGG AGCTGGCAAA GCAGCTGATT GATGAAGGTT ATCAGGTGGT TCTGTTTGGC
TCTGCGAAAG ATCATGAAGC GGGCAATGAG ATTCTTGCCG CTTTGAATAC CGAGCAGCAG
GCATGGTGTC GGAACCTGGC GGGGGAAACA CAGCTTGATC AAGCGGTTAT CCTGATTGCA
GCCTGTAAAG CCATTGTCAC TAACGATTCT GGCCTAATGC ACGTTGCGGC GGCGCTCAAT
CGTCCGCTGG TTGCCCTGTA TGGTCCGAGT AGCCCGGACT TCACACCGCC GCTATCCCAT
AAAGCGCGCG TGATCCGTCT GATTACCGGC TATCACAAAG TGCGTAAAGG TGACGCTGCG
GAGGGTTATC ACCAGAGCTT GATCGACATT ACTCCCCAGC GCGTACTGGA AGAACTCAAC
GCGCTATTGT TACAAGAGGA AGCCTGA
 
Protein sequence
MKILVIGPSW VGDMMMSQSL YRTLQARYPQ AIIDVMAPAW CRPLLSRMPE VNEAIPMPLG 
HGALEIGERR KLGHSLREKR YDRAYVLPNS FKSALVPFFA GIPHRTGWRG EMRYGLLNDV
RVLDKEAWPL MVERYVALAY DKGIMRTAQD LPQPLLWPQL QVSEGEKSYT CNQFSLSSER
PMIGFCPGAE FGPAKRWPHY HYAELAKQLI DEGYQVVLFG SAKDHEAGNE ILAALNTEQQ
AWCRNLAGET QLDQAVILIA ACKAIVTNDS GLMHVAAALN RPLVALYGPS SPDFTPPLSH
KARVIRLITG YHKVRKGDAA EGYHQSLIDI TPQRVLEELN ALLLQEEA