Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0391 |
Symbol | |
ID | 6143894 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 405115 |
End bp | 406311 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641615287 |
Product | glycosyl transferase, group 2 family protein |
Protein accession | YP_001742494 |
Protein GI | 170680221 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.105387 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 59 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAACCT GGATATTTAT CTGTATGGCC GTAGCAATAT TGCTATGGTT CCTGAGTACG TTAAGGCGTA AGCCCAGCCA AAAAAAAGGC TGTATTGACG CCATTATACC TGCTTATAAC GAAGGCCCGT GTCTGGCGCA GTCACTGGAT AATCTGCTGC GTAACCCTTA TTTTTGCCGG GTAATTTGCG TTAACGACGG CTCCACGGAC AATACCGAAG CGGTCATGGC GGAAGTCAAA CGCAAATGGG GCGACCGCTT TATTGCCGTC ACGCAAAAAA ATACCGGTAA AGGTGGTGCG CTGATGAATG GCCTCAATTA CGCCACCTGC GACCAGGTTT TTTTAAGTGA TGCCGACACC TATGTTCCGC CCGATCAAGA CGGAATGGGC TATATGCTGG CAGAAATAGA GCGCGGTGCT GATGCCGTAG GCGGCATTCC CTCTACTGCG TTGAAAGGCG CGGGTCTGTT ACCGCACATC CGCGCGACCG TAAAGTTGCC GATGATTGTT ATGAAGCGCA CGCTACAGCA GCTCCTGGGC GGCGCACCGT TTATTATCAG CGGTGCCTGC GGGATGTTCC GTACTGATGT ATTGCGTAAG TTCGGTTTCT CTGATCGTAC TAAAGTCGAA GACCTTGATC TCACCTGGAC ATTGGTGGCA AACGGCTATC GTATTCGGCA GGCGAATCGC TGCATCGTAT ACCCACAGGA ATGCAACAGC CCGCGTGAGG AATGGCGTCG CTGGCGGCGT TGGATTGTGG GCTACGCGGT CTGTATGCGC CTGCATAAAA GACTTTTATT TAGCCGCTTC GGTATCTTCA GTATATTTCC TATGCTGTTG GTTGTGCTTT ATGGCGTTGG GATTTATCTC ACTACCTGGT TTAATGAATT CATCACCACC GGGCCGCATA GTGTGGTGTT GGCAATGTTT CCGCTTATCT GGGTCGGCGT AGTTTGTGTT ATTGGTGCTT TTAGCGCCTG GTTTCATCGT TGCTGGTTGT TGGTGCCTTT AGCGCCGCTT TCCGTTGTGT ATGTATTATT AGCTTATGCC ATCTGGATTA TTTATGGACT TATTGCCTTT TTTACTGGAC GCGAACCTCA GCGCGACAAA CCCACCCGCT ATTCCGCACT GGTGGAAGCG TCAACCGCTT ATTCCCAACC TTCTGTCACA GGAACTGAAA AACTTTCTGA AGCTTAA
|
Protein sequence | MKTWIFICMA VAILLWFLST LRRKPSQKKG CIDAIIPAYN EGPCLAQSLD NLLRNPYFCR VICVNDGSTD NTEAVMAEVK RKWGDRFIAV TQKNTGKGGA LMNGLNYATC DQVFLSDADT YVPPDQDGMG YMLAEIERGA DAVGGIPSTA LKGAGLLPHI RATVKLPMIV MKRTLQQLLG GAPFIISGAC GMFRTDVLRK FGFSDRTKVE DLDLTWTLVA NGYRIRQANR CIVYPQECNS PREEWRRWRR WIVGYAVCMR LHKRLLFSRF GIFSIFPMLL VVLYGVGIYL TTWFNEFITT GPHSVVLAMF PLIWVGVVCV IGAFSAWFHR CWLLVPLAPL SVVYVLLAYA IWIIYGLIAF FTGREPQRDK PTRYSALVEA STAYSQPSVT GTEKLSEA
|
| |