Gene ECH74115_2981 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2981 
Symbol 
ID6969738 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2760279 
End bp2761673 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content54% 
IMG OID643386821 
Productputative UDP-glucose lipid carrier transferase 
Protein accessionYP_002271289 
Protein GI209400434 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03023] Undecaprenyl-phosphate glucose phosphotransferase
[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.0000341548 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACAAATC TAAAAAAGCG CGAGCGAGCG AAAACCAATG CATCGTTAAT CTCTATGGTG 
CAACGCTTTT CAGATATCAC CATCATGTTT GCCGGACTAT GGCTGGTTTG CGAAGTGAGC
GGACTGTCAT TCCTCTACAT GCACCTGTTG GTGGCGCTGA TTACGCTGGT GGTGTTCCAG
ATGCTGGGCG GCATCACCGA TTTTTATCGC TCATGGCGCG GTGTTCGGGC AGCGACAGAA
TTTGCCCTGT TGCTACAAAA CTGGACCTTA AGCGTGATTT TCAGCGCCGG ACTGGTGGCG
TTCAACAATG ATTTCGACAC GCAACTGAAA ATCTGGCTGG CGTGGTATGG GCTGACCAGT
ATCGGACTGG TGGTTTGCCG TTCATGTATT CGCATTGGGG CGGGCTGGCT GCGTAATCAT
GGCTATAACA AGCGCATGGT CGCGGTGGCG GGGGATTTAG CCGCCGGGCA AATGCTGATG
GAGAGCTTCC GTAACCAGCC GTGGTTAGGG TTTGAAGTGG TGGGCGTTTA CCACGACCCG
AAACTGGGCG GCGTTTCTAA CGACTGGGCG GGTAACCTGC AACAGCTGGT CGAGGATGCT
AAAGCAGGCA AGATTCATAA CGTCTATATC GCGATGCAAA TGTGCGACGG CGCGCGAGTG
AAAAAACTGG TCCATCAACT GGCGGACACC ACCTGTTCGG TGCTGCTGAT CCCCGATGTC
TTTACCTTCA ACATTCTCCA TTCACGTCTT GAAGAGATGA ACGGCGTTCC GGTGGTGCCG
CTTTACGACA CGCCGCTTTC CGGGGTTAAC CGCCTGCTCA AACGTGCGGA AGACATTGTG
CTGGCGACGC TTATCCTGCT GCTGATCTCC CCGGTGCTGT GCTGCATTGC GCTGGCGGTG
AAACTCAGTT CACCTGGGCC GGTTATTTTC CGCCAGACTC GCTACGGCAT GGATGGCAAG
CCGATCAAAG TGTGGAAGTT CCGTTCCATG AAAGTGATGG AGAACGACAA AGTGGTGACC
CAGGCGACGC AGAACGATCC GCGCGTCACC AAAGTGGGGA ACTTTCTGCG CCGCACCTCG
CTGGATGAAT TGCCGCAGTT TATCAATGTG CTGACCGGGG GGATGTCGAT TGTCGGTCCA
CGTCCGCACG CGGTGGCGCA TAACGAACAG TATCGACAGC TCATTGAAGG CTACATGCTG
CGTCATAAGG TGAAACCGGG CATTACCGGC TGGGCGCAGA TTAACGGCTG GCGCGGCGAA
ACCGACACGC TGGAGAAAAT GGAAAAACGC GTCGAGTTCG ACCTTGAGTA CATCCGCGAA
TGGAGCGTCT GGTTCGATAT CAAAATCGTT TTCCTGACGG TATTCAAAGG ATTCGTTAAC
AAAGCGGCAT ATTGA
 
Protein sequence
MTNLKKRERA KTNASLISMV QRFSDITIMF AGLWLVCEVS GLSFLYMHLL VALITLVVFQ 
MLGGITDFYR SWRGVRAATE FALLLQNWTL SVIFSAGLVA FNNDFDTQLK IWLAWYGLTS
IGLVVCRSCI RIGAGWLRNH GYNKRMVAVA GDLAAGQMLM ESFRNQPWLG FEVVGVYHDP
KLGGVSNDWA GNLQQLVEDA KAGKIHNVYI AMQMCDGARV KKLVHQLADT TCSVLLIPDV
FTFNILHSRL EEMNGVPVVP LYDTPLSGVN RLLKRAEDIV LATLILLLIS PVLCCIALAV
KLSSPGPVIF RQTRYGMDGK PIKVWKFRSM KVMENDKVVT QATQNDPRVT KVGNFLRRTS
LDELPQFINV LTGGMSIVGP RPHAVAHNEQ YRQLIEGYML RHKVKPGITG WAQINGWRGE
TDTLEKMEKR VEFDLEYIRE WSVWFDIKIV FLTVFKGFVN KAAY