Gene ECH74115_1264 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1264 
SymbolpgaC 
ID6967460 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1271750 
End bp1272988 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content47% 
IMG OID643385254 
ProductN-glycosyltransferase 
Protein accessionYP_002269749 
Protein GI209395977 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.498109 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAGGT TCGTTTTCTT CTGGCCGTTT TTTATGTCCA TTATGTGGAT TGTTGGCGGC 
GTCTATTTCT GGGTCTATCG TGAACGCCGC TGGCCGTGGG GAGAAAACGC ACCAGCTCCC
CAGTTGAAAG ATAATCCGTC TATCTCCATT ATCATTCCCT GTTTTAATGA GGAGAAAAAC
GTTGAGGAAA CCATACACGC CGCTTTAGCA CAGCGTTATG AGAACATTGA AGTTATTGCC
GTAAATGACG GTTCAACAGA TAAAACCCGT GCCATCCTGG ATCGCATGGC TGCACAAATT
CCCCATTTGC GGGTCATTCA TCTGGCGCAA AACCAGGGGA AAGCCATTGC GCTTAAAACC
GGAGCTGCCG CGGCGAAAAG TGAATATCTG GTGTGCATTG ATGGCGATGC GTTATTAGAC
CGCGATGCGG CGGCATATAT TGTGGAACCG ATGTTGTACA ACCCGCGTGT GGGTGCCGTA
ACCGGTAATC CTCGTATTCG AACACGTTCT ACCCTGGTGG GTAAAATTCA GGTTGGCGAG
TATTCCTCAA TTATTGGTTT GATCAAGCGA ACCCAGCGTA TCTATGGAAA CGTATTTACC
GTTTCCGGTG TTATTGCCGC ATTTCGTCGC AGCGCCCTGG CAGAAGTGGG TTACTGGAGT
GACGATATGA TCACCGAAGA TATTGATATT AGCTGGAAGC TGCAGTTGAA TCAGTGGACG
ATTTTTTACG AGCCACGGGC ACTGTGCTGG ATATTAATGC CTGAAACGTT AAAAGGGCTG
TGGAAACAGC GCCTGCGCTG GGCTCAGGGC GGTGCAGAAG TATTCCTCAA AAATATGACA
AGGTTGTGGC GCAAAGAAAA CTTTCGAATG TGGCCGCTGT TTTTTGAATA CAGCCTGACG
ACAATATGGG CCTTCACCTG CCTGGTCGGT TTCATTATTT ACGCAGTCCA ACTTGCCGGT
GTACCGTTAA ATATTGAATT GACACATATC GCTGCGACAC ATACTGCCGG AATATTATTG
TGTACGTTAT GTTTACTGCA ATTTATTGTC AGCCTGATGA TCGAGAATCG CTATGAGCAT
AATCTGACTT CATCGCTTTT CTGGATTATT TGGTTCCCGG TTATTTTCTG GATGCTGAGC
CTGGCAACGA CATTGGTATC ATTTACACGA GTCATGTTGA TGCCTAAAAA GCAACGCGCC
CGTTGGGTAA GTCCCGATCG CGGGATTCTG AGAGGTTAA
 
Protein sequence
MMRFVFFWPF FMSIMWIVGG VYFWVYRERR WPWGENAPAP QLKDNPSISI IIPCFNEEKN 
VEETIHAALA QRYENIEVIA VNDGSTDKTR AILDRMAAQI PHLRVIHLAQ NQGKAIALKT
GAAAAKSEYL VCIDGDALLD RDAAAYIVEP MLYNPRVGAV TGNPRIRTRS TLVGKIQVGE
YSSIIGLIKR TQRIYGNVFT VSGVIAAFRR SALAEVGYWS DDMITEDIDI SWKLQLNQWT
IFYEPRALCW ILMPETLKGL WKQRLRWAQG GAEVFLKNMT RLWRKENFRM WPLFFEYSLT
TIWAFTCLVG FIIYAVQLAG VPLNIELTHI AATHTAGILL CTLCLLQFIV SLMIENRYEH
NLTSSLFWII WFPVIFWMLS LATTLVSFTR VMLMPKKQRA RWVSPDRGIL RG