Gene EcHS_A0425 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0425 
Symbol 
ID5595106 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp446684 
End bp447880 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content50% 
IMG OID640919610 
Productglycosyl transferase, group 2 family protein 
Protein accessionYP_001457195 
Protein GI157159877 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACCT GGATATTTAT CTGTATGTCC ATAGCAATGT TGCTATGGTT TTTAAGTACG 
CTAAGACGTA AACCCAGTCA AAAGAAAGGC TGTATTGACG CCATTATACC TGCGTATAAC
GAAGGCCCGT GTCTGGCGCA GTCACTGGAT AATCTACTGC GTAACCCTTA TTTTTGCCGG
GTAATTTGCG TTAACGACGG CTCCACGGAC AATACCGAAG CGGTCATGGC GGAAGTCAAA
CGCAAATGGG GCGACCGCTT TGTTGCCGTC ACGCAAAAAA ATACCGGTAA AGGTGGTGCG
CTGATGAATG GCCTCAATTA TGCCACCTGC GACCAGGTTT TTTTAAGTGA TGCCGACACC
TATGTTCCGC CCGATCAAGA CGGAATGGGC TATATGCTGG CAGAAATTGA GCGCGGTGCT
GATGCCGTAG GCGGCATTCC CTCTACTGCG TTGAAAGGCG CGGGGCTGTT ACCGCACATC
CGCGCGACCG TAAAGTTGCC GATGATTGTT ATGAAGCGCA CGCTACAGCA GCTCCTGGGC
GGCGCACCGT TTATTATCAG CGGTGCCTGC GGGATGTTCC GTACTGATGT ATTGCGTAAG
TTCGGTTTCT CGGATCGTAC TAAAGTCGAA GACCTTGATC TCACCTGGAC ATTAGTGGCA
AACGGCTACC GTATTCGGCA GGCGAATCGC TGCATCGTAT ACCCACAGGA ATGCAACAGC
CCGCGTGAGG AGTGGCGTCG CTGGCGGCGT TGGATTGTGG GATACGCGGT CTGTATGCGC
CTGCATAAAA GACTTTTATT TAGCCGCTTC GGTATCTTCA GTATATTTCC TATGCTGTTG
GTTGTGCTTT ATGGCGTTGG GATTTATCTC ACTACCTGGT TTAATGAATT CATCACCACC
GGGCCGCATG GTGTGGTGTT GGCAATGTTT CCGCTTATCT GGATCGGCGT AGTTTGTGTT
ATTGGTGCTT TTAGCGCCTG GTTTCATCGT TGCTGGTTGT TGGTGCCTTT AGCGCCGCTT
TCCGTTGTGT ATGTATTATT AGCTTATGCC ATCTGGATTA TTTATGGACT TATTGCCTTT
TTTACTGGAC GCGAACCTCA GCGCGACAAA CCCACCCGCT ATTCCGCACT GGTGGAAGCG
TCAACCGCTT ATTCCCAACC TTCTGTCACA GGAACTGAAA AACTATCTGA AGCTTAA
 
Protein sequence
MKTWIFICMS IAMLLWFLST LRRKPSQKKG CIDAIIPAYN EGPCLAQSLD NLLRNPYFCR 
VICVNDGSTD NTEAVMAEVK RKWGDRFVAV TQKNTGKGGA LMNGLNYATC DQVFLSDADT
YVPPDQDGMG YMLAEIERGA DAVGGIPSTA LKGAGLLPHI RATVKLPMIV MKRTLQQLLG
GAPFIISGAC GMFRTDVLRK FGFSDRTKVE DLDLTWTLVA NGYRIRQANR CIVYPQECNS
PREEWRRWRR WIVGYAVCMR LHKRLLFSRF GIFSIFPMLL VVLYGVGIYL TTWFNEFITT
GPHGVVLAMF PLIWIGVVCV IGAFSAWFHR CWLLVPLAPL SVVYVLLAYA IWIIYGLIAF
FTGREPQRDK PTRYSALVEA STAYSQPSVT GTEKLSEA