Gene EcHS_A3834 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3834 
Symbol 
ID5592837 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3829874 
End bp3830857 
Gene Length984 bp 
Protein Length327 aa 
Translation table11 
GC content32% 
IMG OID640922946 
Productglycosyl transferase, group 2 family protein 
Protein accessionYP_001460424 
Protein GI157163106 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.00000153474 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAATG ATTACCCTTT AGTATCCATA ATAATACCGA CGTATAATTC ATCTGATTAC 
ATTACTGAAA CTCTAACGAA ATTAGAAAAA CAAACTTACC CAAATTTTGA AATTGTTATT
GTTAATGATG GTTCTAAAGA CAACACATCA AACGTTTTGA GAGAGTATGG GTTAACCCAC
TCTCGATTAA TTATTATCAA TAAAGAAAAT GGCGGTGTTT CGTCTGCCAG GAATACAGGT
ATCCGCAAGG CGCAAGGACA GTTTATATGT TTTATGGATG ATGATGATGA GATAGATCCT
AACTATCTGC TGAAGATGTA TTCCAGACAA CATGAGACGG GAGGAGATGC CATTTATTGT
GGGCTTTATG GCCATCATAT AAAAAATGGT GTTACTTACT CACCTATAAA TACAGAGTTT
AATGAAGGAT CTTTACTTTT CGACTTTTTT TATAAAAAGG TTAGATTCCA TATAGGGTGC
TTGTTTATAA GAAAACAACT TCTGGAAGAG AATAATCTTT TTTTTGATGA AGATTTACGA
CTAGGAGAAG ATCTGGATTT TATCTATCGA CTGCTAATTA CATGCGATAT GTATGCGGTT
CCATATTATA TGTATAAGCA TAACTATAGA GAAAATTCCT TAATGAACTC ATGTAGAACC
ATCACTCATT ATCGACATGA GTCATTTGCG CACGAAAAAA TCTACTCTTC TGTGATGCAG
TTATACAAAG GTAACCGGAA AGAAGAAATT CATACATTAT TGAGTCAAAA TAGAGCTTAT
CATAAAACTC GTTATTTGTG GAATGTTCTA CTTAATGGTG ATTTTAAACA ATTGAATCAA
TTAGTTGAAA GCAATGAAAA AGAATTAAAA GATTGTAATC TTCCTGGCAA GAGAGATAAG
AGACGAGCAA AAATATTAGC ATCAAAAAAT TATATTATCT GGAGGATGGT AAGACTGGTA
AATAGAAAAA AGAATAAACG TTAG
 
Protein sequence
MSNDYPLVSI IIPTYNSSDY ITETLTKLEK QTYPNFEIVI VNDGSKDNTS NVLREYGLTH 
SRLIIINKEN GGVSSARNTG IRKAQGQFIC FMDDDDEIDP NYLLKMYSRQ HETGGDAIYC
GLYGHHIKNG VTYSPINTEF NEGSLLFDFF YKKVRFHIGC LFIRKQLLEE NNLFFDEDLR
LGEDLDFIYR LLITCDMYAV PYYMYKHNYR ENSLMNSCRT ITHYRHESFA HEKIYSSVMQ
LYKGNRKEEI HTLLSQNRAY HKTRYLWNVL LNGDFKQLNQ LVESNEKELK DCNLPGKRDK
RRAKILASKN YIIWRMVRLV NRKKNKR