Gene EcHS_A2190 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2190 
Symbol 
ID5594125 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2171069 
End bp2172496 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content35% 
IMG OID640921323 
Productsugar transferase 
Protein accessionYP_001458862 
Protein GI157161544 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03022] Undecaprenyl-phosphate galactose phosphotransferase, WbaP
[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000000191042 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGCTTC TCGTAAAGAA TGTCATAGTT AGTATCACAT TGGCTTTATC AGATTTCATT 
TCATTCATTC TTTCTTTATA CATAGCGTTG GATGTGTTGT CATTAATTGT AGAAAAGAAT
GATTATTTAT TATTAACTAA TGAAATTGAT GGGTGGTTTT TACTTCATTG GATTCTCGGT
GTTTGTTGTG TAGGCTGGTA TTCAGTTAGA CTAAGGCATT ATTTTTATCG TAAACCATTT
TGGTTTGAAC TTAAAGAAAT ACTTCGAACC TTGGTTATTT TTGCAATAAT CGAAATTGCT
GTTATGGCTT TTACCACTTG GTCATTCTCA CGTTACTTAT GGGTGATAAC GTGGTTATTT
GTAATCATAC TTGTTCCATT GTCAAGAATG GTAACAAAGA CCTTATTGAA TAAAGCGGGG
GTGTGGCAAA GAGATACTTG GATAGTAGGT AGTAGTGAAA ATGCTCATGA AGCATATAAA
GCAATTAGCG GAGAAAAAAA CTTAGGTTTA AACATTGTCG GTTTCATATC AAGTCCTGGC
GGAGTTCCAT CAGGAGAGGC AATTGATGGT ATACCAGTGA TTGAAAATAA TCTAGAATGG
CTTAATGATA TTGATCCCAA AACACAATTC ATTGTTGCTG TTGAGTCTCA TCAAAGTAAA
ATAAGAAATG CGTGGCTTAG AAACTTTATG ATTAAAGGCT ATCGATATGT CTCAGTTATT
CCAACATTGC GTGGAATGCC TCTTGATAGC ACAGATATGT CATTCATATT TAGTCATGAA
GTTATGATTT TTCGTGTACA GCAAAATCTA GCCAAATGGT CGTCAAGAAT TCTCAAAAGA
CTTTTTGATA TTATAGGCTC ATTGTCTATA ATTACTCTTC TTTCACCAGT ATTATTATAT
ATAAGTCTTA AAGTTAAAAA AGATGGAGGC CCGGCAATAT ATGGTCATGA AAGGATCGGG
AAAGGAGGTA AACCCTTTAA GTGTTTGAAG TTCAGGTCTA TGGTAATCAA TTCCAAAGAA
GTTCTTGAAG AACTTCTAAA TAACGATATA AATGCGCGAG AAGAATGGAA TTTAACATTC
AAATTGAAAA ACGACCCAAG AATAACTAAG ATAGGCGGTT TCCTTAGAAG GACAAGCCTG
GATGAATTGC CGCAGTTATT TAATGTATTA AAAGGGGAAA TGAGTTTAGT TGGACCAAGG
CCTATCATTA CTGCTGAGTT AGAAAGATAC AATGAAGAAG TTGATTATTA TTTATTGAGC
AAACCAGGAA TGACAGGCCT ATGGCAAGTT AGTGGGCGTA GTGATGTGGA TTATGAGACA
CGCGTTTATC TTGATGCTTG GTACGTTAAA AATTGGTCAA TGTGGAATGA TATCGCGATT
TTATTTAAAA CTGTGGGTGT TGTATTAAAA AGAGACGGTG CGTATTAA
 
Protein sequence
MRLLVKNVIV SITLALSDFI SFILSLYIAL DVLSLIVEKN DYLLLTNEID GWFLLHWILG 
VCCVGWYSVR LRHYFYRKPF WFELKEILRT LVIFAIIEIA VMAFTTWSFS RYLWVITWLF
VIILVPLSRM VTKTLLNKAG VWQRDTWIVG SSENAHEAYK AISGEKNLGL NIVGFISSPG
GVPSGEAIDG IPVIENNLEW LNDIDPKTQF IVAVESHQSK IRNAWLRNFM IKGYRYVSVI
PTLRGMPLDS TDMSFIFSHE VMIFRVQQNL AKWSSRILKR LFDIIGSLSI ITLLSPVLLY
ISLKVKKDGG PAIYGHERIG KGGKPFKCLK FRSMVINSKE VLEELLNNDI NAREEWNLTF
KLKNDPRITK IGGFLRRTSL DELPQLFNVL KGEMSLVGPR PIITAELERY NEEVDYYLLS
KPGMTGLWQV SGRSDVDYET RVYLDAWYVK NWSMWNDIAI LFKTVGVVLK RDGAY