Gene EcolC_0089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0089 
Symbol 
ID6068622 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp93826 
End bp94872 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content54% 
IMG OID641599493 
ProductADP-heptose:LPS heptosyltransferase II 
Protein accessionYP_001723102 
Protein GI170018148 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0859] ADP-heptose:LPS heptosyltransferase 
TIGRFAM ID[TIGR02195] lipopolysaccharide heptosyltransferase II 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000834595 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.156713 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATAC TGGTGATCGG CCCGTCTTGG GTTGGCGACA TGATGATGTC GCAAAGTCTC 
TATCGCACGC TCCAGGCGCG CTATCCCCAG GCGATAATCG ATGTGATGGC ACCGGCATGG
TGCCGTCCAT TATTATCGCG GATGCCGGAA GTTAACGAAG CTATCCCTAT GCCTCTCGGT
CACGGAGCGC TGGAAATCGG CGAACGCCGC AAACTGGGTC ATAGCCTGCG TGAAAAGCGC
TACGACCGCG CCTACGTCTT ACCAAACTCC TTCAAATCTG CATTAGTGCC TTTCTTCGCG
GGTATTCCTC ATCGCACTGG CTGGCGCGGC GAGATGCGCT ACGGTTTACT CAACGATGTA
CGCGTGCTCG ATAAAGAAGC CTGGCCGCTA ATGGTGGAAC GCTATGTCGC GCTGGCCTAT
GACAAAGGCA TTATGCGTAC CGCACAAGAT CTGCCGCAGC CATTGTTATG GCCGCAGTTG
CAGGTGAGCG AAGGTGAAAA ATCATATACC TGTAATCAAT TTTCGCTTTC ATCAGAACGT
CCGATGATTG GCTTTTGCCC GGGTGCGGAG TTTGGTCCGG CAAAACGCTG GCCACACTAC
CACTATGCGG AGCTGGCAAA GCAGCTGATT GATGAAGGTT ATCAGGTGGT TCTGTTTGGC
TCTGCGAAAG ATCATGAAGC GGGCAATGAG ATTCTTGCCG CTTTGAATAC CGAGCAGCAG
GCATGGTGTC GGAACCTGGC GGGGGAAACA CAGCTTGATC AAGCGGTTAT CCTGATTGCA
GCCTGTAAAG CCATTGTCAC TAACGATTCT GGCCTAATGC ACGTTGCGGC GGCGCTCAAT
CGTCCGCTGG TTGCCCTGTA TGGTCCGAGT AGCCCGGACT TCACACCGCC GCTATCCCAT
AAAGCGCGCG TGATCCGTCT GATTACCGGC TATCACAAAG TGCGTAAAGG TGACGCTGCG
GAGGGTTATC ACCAGAGCTT GATCGACATT ACTCCCCAGC GCGTACTGGA AGAACTCAAC
GCGCTATTGT TACAAGAGGA AGCCTGA
 
Protein sequence
MKILVIGPSW VGDMMMSQSL YRTLQARYPQ AIIDVMAPAW CRPLLSRMPE VNEAIPMPLG 
HGALEIGERR KLGHSLREKR YDRAYVLPNS FKSALVPFFA GIPHRTGWRG EMRYGLLNDV
RVLDKEAWPL MVERYVALAY DKGIMRTAQD LPQPLLWPQL QVSEGEKSYT CNQFSLSSER
PMIGFCPGAE FGPAKRWPHY HYAELAKQLI DEGYQVVLFG SAKDHEAGNE ILAALNTEQQ
AWCRNLAGET QLDQAVILIA ACKAIVTNDS GLMHVAAALN RPLVALYGPS SPDFTPPLSH
KARVIRLITG YHKVRKGDAA EGYHQSLIDI TPQRVLEELN ALLLQEEA