Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2255 |
Symbol | |
ID | 6144538 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2274063 |
End bp | 2276285 |
Gene Length | 2223 bp |
Protein Length | 740 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 641617131 |
Product | glycosyl transferase, group 1/glycosyl transferase, group 2 |
Protein accession | YP_001744304 |
Protein GI | 170682772 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 59 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAAA TTCTTATAAT GACGCCGGAC ATTGAGGGGC CTGTCCGTAA CGGCGGTATT GGTACTGCTT TCACTGCTCT TGCCACTACT TTGGCAAAAA AGGGGTATGA TGTTGATGTA TTGTATACAT GTGGCGACTA TTCTGAATCA TCTGTATCGA AATTTAGCGA CTGGTCACGT ATTTATAGTA CCTTTGGTAT CAATCTGCTA AGAACCGGAC TGATAAAAGA GATTAATATT GATGCACCGT ATTTTAGAAG GAAAAGTTAT TCAATTTATC TCTGGTTGAA AGAAAATAAC ATCTATGACA CTGTTATTTC TTGTGAGTGG CAGGCAGATC TTTATTACAC TTTATTAAGC AAAAAGAATG GAACGGATTT TGAAAATACA AAGTTCATTG TAAATACTCA CAGTTCAACG TTATGGGCTG ATGAAGGTAA TTACCAGCTT CCATATGATC AGAACCATCT TGAACTCTAT TATATGGAGA AAATGGTGGT TGAAATGGCG GATGAAGTTG TTAGTCCGTC TCAGTATTTA ATTGATTGGA TGTTGAGTAA GCACTGGAAT GTTCCTGAAG AACGTCATGT AATTTTAAAT TGCGAGCCAT TTCAAGGGTT TGTGACGAGA GATGATGTTA CAGTTAAAAT AAATGAAAAG CCAGCTTCTG GCGTTGAGCT TGTATTTTTC GGCCGCCTTG AAACCCGTAA AGGACTTGAC ATATTCCTGC GTGCATTAAG AAAACTATCT GATGAAGATA AAGAGAGCAT TTCTGGAGTA ACCTTCCTCG GAAAAAATGT CACCATGGGG AAAACTGATT CATTTACTTA TATTATGAAT CAGACTAAAA ATTTGGGACT CGCAGTTAAT GTCATCTGCG ACTATGATCG TACCAACGCT AATGAATATA TAAAAAGAAA AAATGTATTA GTCATCATTC CATCACTTGT AGAAAACTCA CCCTATACTG TTTATGAATG CTTGATTAAT AACGTTAATT TCCTCGCTTC AAACGTTGGT GGAATTCCAG AGCTTATTCA GCAGGAGCAT CATGCGGAAG TTCTATTTAT TCCTACACCT GTCGATTTAT ACTGGAAAAT CCACTATCGC TTAAAAAATA TAAATATAAA ACCAGGGCTT GCTGAATCAC AAGACAATAT TAAAGAAGCT TGGTTTGTCG CAGTTGAACG AAAAAACAAC CGCGCATTCA AGAAAATCGA TGAAGCTAAC AGCCCGTTAG TTAGCGTGTG TATAACTCAC TTCGAACGTC ACCATTTGCT TCAGCAAGCA CTCGCATCAA TAAAATCTCA GACGTACCAA AATATTGAGG TCATCTTGGT TGATGATGGA AGTACGACAG AAGATTCTCA TCGTTATTTA AATCTCATCG AGAATGATTT TAACTCTCGA GGCTGGAAAA TTGTCCGTAG TTCTAATAAC TATCTGGGTG CTGCAAGGAA TTTGGCTGCG CGACACGCCT CTGGCGAATA TCTGATGTTT ATGGACGATG ATAATGTTGC TAAGCCTTTT GAGGTAGAAA CGTTTGTTAC TGCAGCATTA AACTCTGGGG CCGATGTGTT AACCACACCA AGCGATCTTA TTTTTGGTGA GGAGTTCCCT TCTCCGTTCC GTAAAATGAC GCACTGCTGG CTTCCGTTAG GGCCTGATTT AAATATCGCC AGCTTTAGTA ACTGCTTTGG CGATGCTAAT GCGCTGATCA GAAAAGAGGT TTTTGAAAAA GTAGGCGGAT TTACTGAAGA TTACGGTTTA GGTCATGAAG ACTGGGAGTT TTTTGCCAAA ATATCATTAC AGGGATATAA ATTGCAAATC GTCCCGGAAC CTCTATTTTG GTATAGAGTT GCAAACTCCG GCATGTTGTT AAGTGGAAAT AAGAGTAAAA ATAACTACCG CAGTTTCCGT CCTTTTATGG ATGAGAATGT TAAATATAAC TATGCAATGG GGTTGATACC TTCCTACCTC GAGAAGATTC AAGAACTTGA GAGTGAAGTG AATCGCTTGC GGAGCATCAA TGGTGGTCAT TCTGTCAGTA ACGAGTTACA ACTTTTAAAT AATAAGGTTG ATGGTCTTAT TTCTCAGCAA AGAGATGGCT GGGCCCATGA CCGTTTTAAT GCTCTGTATG AAGCAATTCA TGTCCAAGGC GCAAAACGAG GCACCAGCCT GGTTCGCCGG GTTGCCCGGA AAGTGAAATC AATGTTAAAA TAA
|
Protein sequence | MKKILIMTPD IEGPVRNGGI GTAFTALATT LAKKGYDVDV LYTCGDYSES SVSKFSDWSR IYSTFGINLL RTGLIKEINI DAPYFRRKSY SIYLWLKENN IYDTVISCEW QADLYYTLLS KKNGTDFENT KFIVNTHSST LWADEGNYQL PYDQNHLELY YMEKMVVEMA DEVVSPSQYL IDWMLSKHWN VPEERHVILN CEPFQGFVTR DDVTVKINEK PASGVELVFF GRLETRKGLD IFLRALRKLS DEDKESISGV TFLGKNVTMG KTDSFTYIMN QTKNLGLAVN VICDYDRTNA NEYIKRKNVL VIIPSLVENS PYTVYECLIN NVNFLASNVG GIPELIQQEH HAEVLFIPTP VDLYWKIHYR LKNINIKPGL AESQDNIKEA WFVAVERKNN RAFKKIDEAN SPLVSVCITH FERHHLLQQA LASIKSQTYQ NIEVILVDDG STTEDSHRYL NLIENDFNSR GWKIVRSSNN YLGAARNLAA RHASGEYLMF MDDDNVAKPF EVETFVTAAL NSGADVLTTP SDLIFGEEFP SPFRKMTHCW LPLGPDLNIA SFSNCFGDAN ALIRKEVFEK VGGFTEDYGL GHEDWEFFAK ISLQGYKLQI VPEPLFWYRV ANSGMLLSGN KSKNNYRSFR PFMDENVKYN YAMGLIPSYL EKIQELESEV NRLRSINGGH SVSNELQLLN NKVDGLISQQ RDGWAHDRFN ALYEAIHVQG AKRGTSLVRR VARKVKSMLK
|
| |