Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2266 |
Symbol | |
ID | 6144263 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2287536 |
End bp | 2289428 |
Gene Length | 1893 bp |
Protein Length | 630 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 641617141 |
Product | glycosyl transferase family protein |
Protein accession | YP_001744314 |
Protein GI | 170679598 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1442] Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAGTA TCAAAATTTA TACATGTCAC CACAAACCCA GTGCTTTTCT AAATGCCTCT ATTATTAAAC CACTGCATGT CGGGAAGGCT AATACTTATA ATGATATTGG CTGTGAAGGG GATGATAGCG GAGACAACAT TTCCTTTAAA AACCCCTTCT ATTGTGAGCT GACGGCCCAC TACTGGGTGT GGAAGAATGA ATCTCTGGCT GACTATGTGG GATTCATGCA TTACCGCAGA CACTTGAACT TTGCAGAACA ACAAAATCAT CCTGAGGATA ACTGGGGGGT AGTCAACTAC CCGCTCATCA ATGCCGAATA CGAAAGCCAG TTTGGATTAA GTGATGAATC CATAAGCACC TGCGTCGATG GATATGATCT GCTGTTACCC AAAAAATGGT CGGTAACATC GGCAGGAAGT AAGAATAACC TTGATCATTA TGCAAAAGGT GAGTTTTTAC ACATTAAAGA CTATCAGTCT GCCCTGGATG TCGTTGAAGA GCTGTATCCA CAATATAAAG CTGCGATTCA ACAATTCAAT AATGCAACTG ATGGTTATTA TACGAACATG TTTGTTATGC GCAAAGACAT GTTCCTGGAT TATTCTGAAT GGCTTTTTGC TATTTTGTCT AATCTTGAAG ATCGGATTTC GATGAATAAC TACAACGCTC AAGAAAAACG AGTTATAGGA CACATTGCTG AGCGTTTATT CAATATCTAT ATCATCAAAT GTCAACAAGA TAAACAGCTA AAAATAAAAG AACTACAGCG CACGTTTGTT ACTGCTGAAA CATTTAATGG TAAATTAAAG CCCGTTTTTG ACGAAAGCGT ACCGGTTGTT ATTAGTTTTG ATAATAACTA TGCATTAAGT GGCGGGGCAT TAATCAATTC AATTGTTCTC CATTCTGATG CGAGCAGAAA CTATGATATC GTTGTTCTGG AAAATAAGGT CAGTCATTTA AATAAACAAC GCCTTATCAA GCTAGTTGCT GGTCATAACA ACATATCATT GCGCTTTTTT GATGTGAATT CATTCACTGA GATGAGTGAT GTTCATACCC GTGCACATTT TAGTGCGTCG ACCTATGCGC GCTTGTTCAT CCCGCAACTT TTCCGCGAGT ATAAAAAAGT TGTGTTTATC GATTCTGACA CCGTGGTGAA GGCTGATTTG GCGACACTTC TGGATGTCGA GATCGGCACT AACCTGGTTG CCGCTGTTAA AGACATCGTC ATGGAAGGGT TTGTGAAGTT TGGTACCATG TCAGAATCAG ATGATGGCAT TATGCCTGCA GAGCAATACC TGAAAAAGAC ATTAGGAATG ACTAATCCTG ACGAATATTT TCAGGCCGGG ATTATTGTTT TTAATGTCGA ACAAATGGTT ACAGAGAATA CCTTTGCTCA ATTGATGTCA GCATTGAAAG CCAAAAAGTA TTGGTTCTTA GATCAGGATA TCATGAACAA AGTCTTCTTT GGCCGAGTCA AATTCTTACC ATTAGAATGG AACGTGTACC ATGGTAATGG TAATACCGAT GATTTCTTCC CGAATCTCAA GTTTTCAACC TATATGCGCT TTTTGCAAGC CAGAAGAAAT CCAAAAATGA TTCACTATGC GGGTGAAAAC AAACCATGGA ATACTGAGAA AGTCGATTTC TATGATGATT TTCTTGAGAA TGTTTTAAGT ACGCCATGGG AGAAAGAAAT CTACTATCGC CAGTTACCTG TGGCCACGGT AGTACCTAAC CAACATACTG AACTGCAGCA AACCGTGTTA CTGCAGACAA AGATTAAAAG AGCTTTAATG CCATATGTTA ACAAATATGC TCCTGTCGGT TCGCCAAGAA GAAATAAGCT CATCAAATAT TATTATAAAG TTCGCCGTTC GATTCTTGGC TAA
|
Protein sequence | MNSIKIYTCH HKPSAFLNAS IIKPLHVGKA NTYNDIGCEG DDSGDNISFK NPFYCELTAH YWVWKNESLA DYVGFMHYRR HLNFAEQQNH PEDNWGVVNY PLINAEYESQ FGLSDESIST CVDGYDLLLP KKWSVTSAGS KNNLDHYAKG EFLHIKDYQS ALDVVEELYP QYKAAIQQFN NATDGYYTNM FVMRKDMFLD YSEWLFAILS NLEDRISMNN YNAQEKRVIG HIAERLFNIY IIKCQQDKQL KIKELQRTFV TAETFNGKLK PVFDESVPVV ISFDNNYALS GGALINSIVL HSDASRNYDI VVLENKVSHL NKQRLIKLVA GHNNISLRFF DVNSFTEMSD VHTRAHFSAS TYARLFIPQL FREYKKVVFI DSDTVVKADL ATLLDVEIGT NLVAAVKDIV MEGFVKFGTM SESDDGIMPA EQYLKKTLGM TNPDEYFQAG IIVFNVEQMV TENTFAQLMS ALKAKKYWFL DQDIMNKVFF GRVKFLPLEW NVYHGNGNTD DFFPNLKFST YMRFLQARRN PKMIHYAGEN KPWNTEKVDF YDDFLENVLS TPWEKEIYYR QLPVATVVPN QHTELQQTVL LQTKIKRALM PYVNKYAPVG SPRRNKLIKY YYKVRRSILG
|
| |