Gene ECH74115_4999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4999 
Symbol 
ID6968589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4651071 
End bp4652078 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content31% 
IMG OID643388680 
Productlipopolysaccharide 1,3-galactosyltransferase 
Protein accessionYP_002273107 
Protein GI209396863 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1442] Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.13851 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones76 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCAAC TCAATGATAG TGACATCATC CTTTTTGAGT ATAATTTTCA TTATCAAAAT 
ATAAGATCTA AAAATACTCT TGATATAGCA TTTGGTATTG ACAGAAATTT TCTTTTTGGA
TGTGGTGTAG CCATCGCATC TATTCTATTA AACAATAGAG AAATCTCTTG TGAATTTCAT
GTTTTCACAG ATTATATCAG TGATAAAGAC AAATTATATT TTTCTGATTT AGCAAAACAA
TATAATTCAA GAATTAATAT TTATGTTATC AATTGTGATA AGCTGAAGTC ATTACCAAGC
ACGAAAAACT GGACTTACGC AACATATTTT CGATTTATAA TTGCAGATTA TTTCTATCAT
AAACATGAAA AAATACTATA TCTTGATGCA GATATTGCTT GCAAGGGTAG TATTAAAGAA
CTCTTAGATT ATCAATTTTC TACTAATGAA ATTGCCGCAG TTGTAGCTGA AAGAGATGTT
GAATGGTGGC AAAATCGAGC CTCGGTATTA ACTACACCAC AGTTAGCTTC TGGATATTTC
AATGCTGGTT TTTTACTGAT AAATATTGAT GAGTGGAATC TAAATAACAT TTCGTCAAAA
GCTATTGAAA TGTTGCGTGA CCCAGATTGG GTAAGTAAAA TCACCCACCT TGATCAAGAT
GTACTGAATG TATTATTGAA TGGTAAAGTG AAGTTTATTT CGGAGAAATA TAATACCCGA
TATAGTATTA ACTATGAATT AAAAGACAAA GTTGATAATC CAGTCAATGA TGACACCGTG
TTTATACACT ATGTGGGACC TACAAAACCT TGGCATGAGT GGGCTGACTA TCCGGTGTCA
CGTAGTTTTT TGATCGCCAA AGCAGCTTCT CCGTGGAGTA AAGAAGATTT ACTTAAACCT
GTAAATAGCA ATCAGTATCG GTATTGTGCA AAACATAAAT TTAAACAAAA GCATTATATG
GCAGGCATTT TTAATTATTT AAAGTATTAT AAAGAAAAAT GCTTCTAA
 
Protein sequence
MSQLNDSDII LFEYNFHYQN IRSKNTLDIA FGIDRNFLFG CGVAIASILL NNREISCEFH 
VFTDYISDKD KLYFSDLAKQ YNSRINIYVI NCDKLKSLPS TKNWTYATYF RFIIADYFYH
KHEKILYLDA DIACKGSIKE LLDYQFSTNE IAAVVAERDV EWWQNRASVL TTPQLASGYF
NAGFLLINID EWNLNNISSK AIEMLRDPDW VSKITHLDQD VLNVLLNGKV KFISEKYNTR
YSINYELKDK VDNPVNDDTV FIHYVGPTKP WHEWADYPVS RSFLIAKAAS PWSKEDLLKP
VNSNQYRYCA KHKFKQKHYM AGIFNYLKYY KEKCF