Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2981 |
Symbol | |
ID | 6969738 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2760279 |
End bp | 2761673 |
Gene Length | 1395 bp |
Protein Length | 464 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643386821 |
Product | putative UDP-glucose lipid carrier transferase |
Protein accession | YP_002271289 |
Protein GI | 209400434 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2148] Sugar transferases involved in lipopolysaccharide synthesis |
TIGRFAM ID | [TIGR03023] Undecaprenyl-phosphate glucose phosphotransferase [TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.0000341548 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACAAATC TAAAAAAGCG CGAGCGAGCG AAAACCAATG CATCGTTAAT CTCTATGGTG CAACGCTTTT CAGATATCAC CATCATGTTT GCCGGACTAT GGCTGGTTTG CGAAGTGAGC GGACTGTCAT TCCTCTACAT GCACCTGTTG GTGGCGCTGA TTACGCTGGT GGTGTTCCAG ATGCTGGGCG GCATCACCGA TTTTTATCGC TCATGGCGCG GTGTTCGGGC AGCGACAGAA TTTGCCCTGT TGCTACAAAA CTGGACCTTA AGCGTGATTT TCAGCGCCGG ACTGGTGGCG TTCAACAATG ATTTCGACAC GCAACTGAAA ATCTGGCTGG CGTGGTATGG GCTGACCAGT ATCGGACTGG TGGTTTGCCG TTCATGTATT CGCATTGGGG CGGGCTGGCT GCGTAATCAT GGCTATAACA AGCGCATGGT CGCGGTGGCG GGGGATTTAG CCGCCGGGCA AATGCTGATG GAGAGCTTCC GTAACCAGCC GTGGTTAGGG TTTGAAGTGG TGGGCGTTTA CCACGACCCG AAACTGGGCG GCGTTTCTAA CGACTGGGCG GGTAACCTGC AACAGCTGGT CGAGGATGCT AAAGCAGGCA AGATTCATAA CGTCTATATC GCGATGCAAA TGTGCGACGG CGCGCGAGTG AAAAAACTGG TCCATCAACT GGCGGACACC ACCTGTTCGG TGCTGCTGAT CCCCGATGTC TTTACCTTCA ACATTCTCCA TTCACGTCTT GAAGAGATGA ACGGCGTTCC GGTGGTGCCG CTTTACGACA CGCCGCTTTC CGGGGTTAAC CGCCTGCTCA AACGTGCGGA AGACATTGTG CTGGCGACGC TTATCCTGCT GCTGATCTCC CCGGTGCTGT GCTGCATTGC GCTGGCGGTG AAACTCAGTT CACCTGGGCC GGTTATTTTC CGCCAGACTC GCTACGGCAT GGATGGCAAG CCGATCAAAG TGTGGAAGTT CCGTTCCATG AAAGTGATGG AGAACGACAA AGTGGTGACC CAGGCGACGC AGAACGATCC GCGCGTCACC AAAGTGGGGA ACTTTCTGCG CCGCACCTCG CTGGATGAAT TGCCGCAGTT TATCAATGTG CTGACCGGGG GGATGTCGAT TGTCGGTCCA CGTCCGCACG CGGTGGCGCA TAACGAACAG TATCGACAGC TCATTGAAGG CTACATGCTG CGTCATAAGG TGAAACCGGG CATTACCGGC TGGGCGCAGA TTAACGGCTG GCGCGGCGAA ACCGACACGC TGGAGAAAAT GGAAAAACGC GTCGAGTTCG ACCTTGAGTA CATCCGCGAA TGGAGCGTCT GGTTCGATAT CAAAATCGTT TTCCTGACGG TATTCAAAGG ATTCGTTAAC AAAGCGGCAT ATTGA
|
Protein sequence | MTNLKKRERA KTNASLISMV QRFSDITIMF AGLWLVCEVS GLSFLYMHLL VALITLVVFQ MLGGITDFYR SWRGVRAATE FALLLQNWTL SVIFSAGLVA FNNDFDTQLK IWLAWYGLTS IGLVVCRSCI RIGAGWLRNH GYNKRMVAVA GDLAAGQMLM ESFRNQPWLG FEVVGVYHDP KLGGVSNDWA GNLQQLVEDA KAGKIHNVYI AMQMCDGARV KKLVHQLADT TCSVLLIPDV FTFNILHSRL EEMNGVPVVP LYDTPLSGVN RLLKRAEDIV LATLILLLIS PVLCCIALAV KLSSPGPVIF RQTRYGMDGK PIKVWKFRSM KVMENDKVVT QATQNDPRVT KVGNFLRRTS LDELPQFINV LTGGMSIVGP RPHAVAHNEQ YRQLIEGYML RHKVKPGITG WAQINGWRGE TDTLEKMEKR VEFDLEYIRE WSVWFDIKIV FLTVFKGFVN KAAY
|
| |