Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1015 |
Symbol | |
ID | 6147317 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1034010 |
End bp | 1035404 |
Gene Length | 1395 bp |
Protein Length | 464 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641615902 |
Product | putative UDP-glucose lipid carrier transferase |
Protein accession | YP_001743094 |
Protein GI | 170682437 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2148] Sugar transferases involved in lipopolysaccharide synthesis |
TIGRFAM ID | [TIGR03023] Undecaprenyl-phosphate glucose phosphotransferase [TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.742651 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAAATC TAAAAAAGCG CGAGCGAGCG AAAACCAATG CATCGTTAAT CTCTATGGTG CAACGCTTTT CAGATATCAC CATCATGTTT GCCGGACTAT GGCTGGTTTG CGAAGTCAGC GGACTGTCAT TCCTCTACAT GCACCTGTTG GTGGCACTGA TTACGCTGGT GGTGTTCCAG ATGCTGGGCG GCATCACCGA TTTTTATCGC TCATGGCGCG GTGTTCGGGC AGCGACAGAA TTTGCCCTGT TGCTACAAAA CTGGACCTTA AGCGTGATTT TCAGCGCTGG ACTGGTGGCG TTCAACAATG ATTTCGACAC GCAACTGAAA ATCTGGCTGG CGTGGTATGG GCTGACCAGT ATCGGACTGG TGGTTTGCCG TTCATGTATT CGCATTGGGG CGGGCTGGCT GCGTAATCAT GGCTATAACA AGCGCATGGT CGCGGTGGCG GGGGATTTAG CCGCCGGACA AATGCTGATG GAGAGCTTCC GTAATCAGCC GTGGTTAGGG TTTGAAGTGG TGGGCGTATA CCACGACCCA AAACCGGGCG GCGTTTCTAA CGACTGGGCG GGCAATCTGC AACAACTGGT CGAGGATGCG AAAGCGGGCA AGATTCATAA CGTCTATATC GCGATGCAGA TGTGCGACGG CGCGCGAGTG AAAAAACTGG TCCATCAACT GGCGGACACC ACCTGTTCGG TGCTGCTGAT CCCCGATGTC TTTACCTTCA ACATTCTCCA TTCACGCCTC GAAGAGATGA ATGGCGTACC GGTGGTGCCG CTGTATGACA CGCCACTTTC CGGGGTTAAC CGCCTGATAA AACGTGCGGA AGACATTGTG CTGGCGACGC TTATCCTGCT GCTGATCTCT CCGGTGCTGT GCTGTATTGC GCTGGCGGTG AAACTCAGTT CGCCAGGGCC GGTTATTTTC CGCCAGACTC GCTACGGCAT GGATGGCAAA CCGATCAAAG TGTGGAAGTT CCGTTCGATG AAAGTGATGG AGAACGACAA AGTGGTGACT CAGGCGACGC AGAACGATCC GCGCGTCACC AAAGTGGGGA ACTTTCTGCG CCGCACCTCG CTGGATGAAT TGCCGCAGTT TATCAATGTG CTGACCGGCG GGATGTCGAT TGTCGGGCCA CGTCCACACG CGGTGGCGCA TAACGAACAG TATCGACAGC TCATTGAAGG CTACATGCTG CGCCATAAGG TGAAACCGGG CATTACCGGC TGGGCGCAGA TTAACGGCTG GCGCGGCGAA ACCGACACGC TGGAGAAAAT GGAAAAACGC GTCGAGTTCG ACCTTGAGTA CATCCGCGAA TGGAGCGTCT GGTTCGATAT CAAAATCGTT TTCCTGACGG TATTCAAAGG TTTCGTTAAC AAAGCGGCAT ATTGA
|
Protein sequence | MTNLKKRERA KTNASLISMV QRFSDITIMF AGLWLVCEVS GLSFLYMHLL VALITLVVFQ MLGGITDFYR SWRGVRAATE FALLLQNWTL SVIFSAGLVA FNNDFDTQLK IWLAWYGLTS IGLVVCRSCI RIGAGWLRNH GYNKRMVAVA GDLAAGQMLM ESFRNQPWLG FEVVGVYHDP KPGGVSNDWA GNLQQLVEDA KAGKIHNVYI AMQMCDGARV KKLVHQLADT TCSVLLIPDV FTFNILHSRL EEMNGVPVVP LYDTPLSGVN RLIKRAEDIV LATLILLLIS PVLCCIALAV KLSSPGPVIF RQTRYGMDGK PIKVWKFRSM KVMENDKVVT QATQNDPRVT KVGNFLRRTS LDELPQFINV LTGGMSIVGP RPHAVAHNEQ YRQLIEGYML RHKVKPGITG WAQINGWRGE TDTLEKMEKR VEFDLEYIRE WSVWFDIKIV FLTVFKGFVN KAAY
|
| |