Gene EcHS_A2184 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2184 
Symbol 
ID5594251 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2163469 
End bp2164899 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content33% 
IMG OID640921317 
Productputative polysaccharide biosynthesis protein 
Protein accessionYP_001458856 
Protein GI157161538 
COG category[R] General function prediction only 
COG ID[COG2244] Membrane protein involved in the export of O-antigen and teichoic acid 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.00000339542 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACATTGC TTTCAAATGC TAAATGGAAC ATGGTATCAC AATTTATCAA AATGTTAGTG 
CAACTAACAA ACATAGTCTA TTTAGCAAAA ATTATTCCTC CAAGCGAATA TGGTTTAATG
GCTATGGCTC TTGTTGTTGT GAACTTAGGT ATCCTGTTAC GAGATTTAGG CACTTCTGCT
GCATTAATAC AAAGGAAAGA TCTTACTGAG TCTCTAATTA ATACTGTTTT TTGGTTAAAT
CTCCTCATGG GATTGACTTT ATTTGTTCTG GTTTTTTCGG GCTCTAGTGT GATATCCAGT
ATCTATCATC AGCCGAAATT AACTTTAGTA TTGATGTTGC TTAGTTTTAC ATTTCCTCTT
TCGAGCTGTG CGGCAGCTCA CCTTGCTTTG CTTGAGAGGG ACTCGAAATT CAAAACAGTC
TCTAGGATAG AAATTTCTTC TTCACTTGCT TCATTAGTAC TAGCGATTAC TTTAGCAAAT
ATGGGCTTTG GTGTTTTCAG TCTTGTTGGT CAAGCTTTAA TACTTAATCT GATGTCTGCT
ATACAATTCT GGCTTGCATC AAATTGGAAA CCTTCTATTA GGGTTTTTAT TAATTATAAA
GATTTGAAAA GTATTTTTAG TTTTAGTGCA AATTTATCAA TGTTTAATTT TATAAATTAT
TTTTCAAGGA ACGCAGATAG CTTTATTATT GGTAAATTCA TGTCCGCTTC AATTCTCGGG
AGTTATAATC TTGCTTATAG AATTATGCTT TTCCCATTGC AAAGTCTTAC CTTTGTCGCA
ACTAGGTCAC TTTATCCAAT ATTAAGTAAA CAACAAAGTA ATAATCAGCA TATATCAAAA
ATATATTTAA AGTGTGTCTA TGTTGTGTTG TTTATAACAT GTCCATTGAT GTCTGGACTG
GCATTTTATA GTGAGCCTTT TATCCGATTG ATATTCGGTG ATGAATGGTA TCTGACGGGA
ATAGTATTAA AATGGTTAGC GCCAACTGCA ATAATACAAG CGGTACTTAG TACTTCTGGT
TCAGTTTTTA TGGCTAAAGG TCGTACAGAT ATTTTATTAA AGCTTGGCAT TATTGGAATG
ATCTTGCAAG TGGGAGCTTT CATCATAGGT GTTCAATATA CAATAACCAC ATTTGCTATG
TGTTATCTTC TTGCTAATGT TATTAATTTT TTCCCTGTCA TGTGGTCTTT AATGGGTTTG
TTAGGGGAGA ATCTCAATGT TTTTTTTAAA AAGAACTATT CAATTGTTTT GGCTACATTA
GTCATGTTAA GTTACTTAAA ACTCATTGAT TATTTCTTTT TCCCCAATGC TATTGCGAGC
CTTAATATTT TAGTTCTATT ATCCTTTTCT GGCGCCGTTG TTTATTTATT AGCCTCTCTC
ACTCTTTCCG CAACTCTTAG AAAATTTGTG CTATCAAAGA TTAAAAAATG A
 
Protein sequence
MTLLSNAKWN MVSQFIKMLV QLTNIVYLAK IIPPSEYGLM AMALVVVNLG ILLRDLGTSA 
ALIQRKDLTE SLINTVFWLN LLMGLTLFVL VFSGSSVISS IYHQPKLTLV LMLLSFTFPL
SSCAAAHLAL LERDSKFKTV SRIEISSSLA SLVLAITLAN MGFGVFSLVG QALILNLMSA
IQFWLASNWK PSIRVFINYK DLKSIFSFSA NLSMFNFINY FSRNADSFII GKFMSASILG
SYNLAYRIML FPLQSLTFVA TRSLYPILSK QQSNNQHISK IYLKCVYVVL FITCPLMSGL
AFYSEPFIRL IFGDEWYLTG IVLKWLAPTA IIQAVLSTSG SVFMAKGRTD ILLKLGIIGM
ILQVGAFIIG VQYTITTFAM CYLLANVINF FPVMWSLMGL LGENLNVFFK KNYSIVLATL
VMLSYLKLID YFFFPNAIAS LNILVLLSFS GAVVYLLASL TLSATLRKFV LSKIKK