Gene EcHS_A2168 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2168 
Symbol 
ID5594720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2145433 
End bp2147880 
Gene Length2448 bp 
Protein Length815 aa 
Translation table11 
GC content55% 
IMG OID640921301 
Productglycosyl transferase, group 1 family protein 
Protein accessionYP_001458840 
Protein GI157161522 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones65 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCGCG CCATTATTGA AAATGCCGGT GAGCATCGCG TCAGTATTTT GATCAATGGT 
ATGTATCCCA TTGATAATAT TAATGATGTT AAAATGGCCT ATCGGGACCT GCTGACGGAT
GAAGATATGT TCATCTTCTC AGCCGTCGCG CCAACAGCCT ATCGTCATAT TGAAAACCAT
GGCCGCAGTA AAGCGGCCCA GGCGGCCCGC GATATCGCTA TCGCCAATAT CGCGCCTGAT
ATCGTCTATG TTATTAGCTT CTTCGAGGGG CATAGCGATA GTTATACCGT GTCTATTCCG
GCGGATAATG TGCCGTGGAA AACGGTGTGC GTATGCCACG ATCTGATCCC GCTGTTAAAT
AAAGAGCGTT ATCTCGGCGA TCCGAATTTC CGTGAGTTTT ATATGAATAA GCTGGCGGAA
TTTGAGCGGG CAGACGCGAT CTTCGCCATT TCGCAGTCGG CAGCTCAGGA AGTGATCGAA
TATACCGACA TCGCCAGCGA TCGGGTACTG AACATCTCCT CGGCGGTGGG CGAAGAGTTT
GCGGTGATCG ATTATTCCGC TGAGCGTATT CAGTCTTTAA AAGACAAATA CAGCCTGCCG
GATGAGTTTA TCTTGTCGCT GGCGATGATC GAGCCGCGGA AAAATATTGA AGCGCTTATT
CACGCCTACA GCTTGCTGCC TGCCGAGCTG CAGCAGCGCT ATCCGATGGT GCTGGCGTAT
AAAGTGCAGC CAGAACAACT GGAGCGGATC CTGCGTCTGG CGGAAAGCTA TGGTTTGTCA
CGCAGCCAGC TTATCTTTAC TGGGTTCCTG ACCGACGACG ATCTGATTGC CCTGTACAAC
CTGTGCAAAC TGTTTGTGTT CCCGTCGCTG CATGAAGGTT TCGGCCTGCC GCCGCTGGAA
GCGATGCGCT GCGGGGCGGC GACCTTAGGT TCAAACATTA CCAGCCTGCC GGAAGTCATT
GGCTGGGAAG ATGCCATGTT CAATCCACAT GATGTGCAGG ACATTCGCCG GGTCATGGAG
AAGGCGCTGA CCGATGAGGC GTTTTATCGT GAGCTGAAGG CGCATGCTCT CGCGCAGTCG
GCCAAATTCT CGTGGGCCAA TACCGCCCAT CTGGCGATCG AGGGTTTCAC TCGTCTGCTG
CAGTCGTCCC AAGAGACGGA TGCCGGGCAG GCGGAAAGCG TGACCGCCTC CCGCCTTCAG
ATGATGCAGA AAATCGATGC GCTGAGCGAA GTCGACCGTC TTGGTCTGGC ATGGGCGGTT
GCGCGCAATA GCTTTAAGCG CCATACCCGC AAGCTGCTGG TGGACATTTC GGTTCTGGCG
CAGCATGACG CGAAGACCGG GATCCAGCGC GTCTCGCGCA GTATCCTCAG CGAATTACTG
AAATCTGGCG TGCCGGGTTA CGAGGTTTCC GCCGTCTACT ACACCCCTGG CGAGTGTTAT
CGCTACGCCA ACCAATATCT GTCCAGTCAT TTCCCGGGCG AATTTGGTGC TGACGAACCG
GTGCTGTTCA GCAAAGACGA TGTGCTTATT GCCACCGATC TGACCGCGCA TCTCTTCCCG
GAGCTGGTGA CGCAAATTGA CAGCATGCGC GCCGCCGGGG CGTTCGCCTG CTTCGTGGTG
CATGATATTC TGCCATTACG CCGTCCGGAG TGGAGTATCG AAGGCATTCA GCGTGATTTC
CCGATCTGGC TGTCCTGCCT CGCAGAGCAC GCCGACCGGC TGATCTGCGT CTCTGCCAGC
GTCGCCGAGG ATGTGAAAGC GTGGATTGCG GAAAATCGCC ATTGGGTGAA ACCGAACCCG
CTGCAGACCG TCAGCAACTT CCATCTGGGA GCCGACCTCG ATGCCAGCGT ACCGTCCACT
GGCATGCCGG ATAATGCACA GGCGCTGTTA GCAGCGATGG CTGCAGCTCC GTCATTTATC
ATGGTGGGTA CCATGGAGCC GCGCAAAGGC CACGCCCAGA CGCTGGCCGC CTTCGAGGAA
CTGTGGCGCG AGGGCAAAGA CTACAACTTG TTTATCGTCG GCAAACAGGG CTGGAACGTT
GACAGCTTGT GCGAAAAATT ACGCCATCAT CCGCAGCTGA ACAAAAAGCT CTTCTGGCTG
CAGAATATCA GCGACGAGTT TTTGGCCGAG CTATACGCTC GTTCACGCGC GCTGATCTTT
GCCTCGCAGG GAGAAGGCTT TGGCCTGCCG TTGATTGAAG CGGCGCAGAA AAAGCTGCCG
GTGATTATTC GCGACATTCC GGTGTTTAAA GAGATTGCTC AGGAACATGC CTGGTATTTC
TCCGGTGAAG CGCCGTCCGA TATCGCGAAG GCGGTAGAAG AGTGGTTAGC CCTGTACGAG
CAAAACGCGC ATCCTCGTTC CGAAAATATC AACTGGTTAA CCTGGAAACA GAGCGCGGAA
TTTCTCCTGA AAAACCTGCC GATTATCGCG CCAGCCGCGA AGCAATAA
 
Protein sequence
MSRAIIENAG EHRVSILING MYPIDNINDV KMAYRDLLTD EDMFIFSAVA PTAYRHIENH 
GRSKAAQAAR DIAIANIAPD IVYVISFFEG HSDSYTVSIP ADNVPWKTVC VCHDLIPLLN
KERYLGDPNF REFYMNKLAE FERADAIFAI SQSAAQEVIE YTDIASDRVL NISSAVGEEF
AVIDYSAERI QSLKDKYSLP DEFILSLAMI EPRKNIEALI HAYSLLPAEL QQRYPMVLAY
KVQPEQLERI LRLAESYGLS RSQLIFTGFL TDDDLIALYN LCKLFVFPSL HEGFGLPPLE
AMRCGAATLG SNITSLPEVI GWEDAMFNPH DVQDIRRVME KALTDEAFYR ELKAHALAQS
AKFSWANTAH LAIEGFTRLL QSSQETDAGQ AESVTASRLQ MMQKIDALSE VDRLGLAWAV
ARNSFKRHTR KLLVDISVLA QHDAKTGIQR VSRSILSELL KSGVPGYEVS AVYYTPGECY
RYANQYLSSH FPGEFGADEP VLFSKDDVLI ATDLTAHLFP ELVTQIDSMR AAGAFACFVV
HDILPLRRPE WSIEGIQRDF PIWLSCLAEH ADRLICVSAS VAEDVKAWIA ENRHWVKPNP
LQTVSNFHLG ADLDASVPST GMPDNAQALL AAMAAAPSFI MVGTMEPRKG HAQTLAAFEE
LWREGKDYNL FIVGKQGWNV DSLCEKLRHH PQLNKKLFWL QNISDEFLAE LYARSRALIF
ASQGEGFGLP LIEAAQKKLP VIIRDIPVFK EIAQEHAWYF SGEAPSDIAK AVEEWLALYE
QNAHPRSENI NWLTWKQSAE FLLKNLPIIA PAAKQ