Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2263 |
Symbol | |
ID | 6145742 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2284330 |
End bp | 2285463 |
Gene Length | 1134 bp |
Protein Length | 377 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 641617138 |
Product | glycosyl transferase, group 1 family protein |
Protein accession | YP_001744311 |
Protein GI | 170679591 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAC TGTGTTATTT CATAAATTCG GATTGGTATT TTGATTTGCA TTGGACCGAT CGTGCAATTG CCGCCCGAGA TGCCGGTTAT GAGATTCACA TTATTAGCCA TTTTGTCGAT GATAAAATAG CGGAAAAATT CAGGACATTA GGTTTTGTTT GCCATAACAT TCCACTTGTC GCCCAATCAT TCAACGTTTT GATTTTCTTT CGGGCCTTTT CTAAGGCTCG GAAAATTATT CAAAACATCA ATCCAGATTT GTTGCACTGC ATTACCATCA AGCCCTGTTT GATCGGCGGT TTTCTGGCTA AAAGTACCCA TCGTCCTGTT ATTCTGAGTT TTGTCGGTTT GGGCCGAGTA TTTTCCGCAG AGTCTGCCTG TCTTAAGCTG CTGCGAAGTT TTACTGTTAT GGCATATAAG TATATCGCCA GTAACAAATG CAGTTTGTTT ATGTTCGAAC ATGATAAAGA CAGAGCTAAA CTTGCCGATC TGGTTGGTAT CGATTACAAA CAGACTATTG TTATTGATGG CGCGGGTATT AATCCAGAGA TTTACAAATA CTCTCTGGAG CAGCAGCGTG ATGTTCCGGT CGTCCTTTTT GCCAGCCGTA TGCTGTGGAG TAAAGGACTG GGTGACCTGA TTGAAGCCAA AAAAATACTG AGTAATAAAA ATATTCACTT TACGCTGAAT GTTGCCGGTA TTTTAGTTGA GAATGATAAA GACGCAATTC CGCTGGCGAC GATACAGAAG TGGCAAAGCG AAGGCGTGAT TAACTGGCTC GGTCATTGCT CTAATGTATT TGATTTAATT GAAGAATCAA ATATCGTTGC TTTGCCGTCG GTCTACGCCG AAGGCGTACC GCGTATCTTG CTGGAAGCTT CCTCTGTCGG GCGCGCTTGT ATCGCTTATG ATGTTGGTGG CTGTGATAGC TTAATTATCA ATAACTATAA TGGGTTGATT GTAAAAAGTA AATCTGTCGA GGAATTAGCG GAGAAACTCG GTTTCCTGTT GGATAACCCA GAAACGCGCG TCGCAATGGG TATCAATGGC AGAAAACGCA TTCAAGATAA ATTCTCGAGT GTGATGATCA TTAATAAAAC ATTAAAAACA TATCGTGATG TTGTTGAAGA GTAA
|
Protein sequence | MKKLCYFINS DWYFDLHWTD RAIAARDAGY EIHIISHFVD DKIAEKFRTL GFVCHNIPLV AQSFNVLIFF RAFSKARKII QNINPDLLHC ITIKPCLIGG FLAKSTHRPV ILSFVGLGRV FSAESACLKL LRSFTVMAYK YIASNKCSLF MFEHDKDRAK LADLVGIDYK QTIVIDGAGI NPEIYKYSLE QQRDVPVVLF ASRMLWSKGL GDLIEAKKIL SNKNIHFTLN VAGILVENDK DAIPLATIQK WQSEGVINWL GHCSNVFDLI EESNIVALPS VYAEGVPRIL LEASSVGRAC IAYDVGGCDS LIINNYNGLI VKSKSVEELA EKLGFLLDNP ETRVAMGING RKRIQDKFSS VMIINKTLKT YRDVVEE
|
| |