Gene EcSMS35_2255 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2255 
Symbol 
ID6144538 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2274063 
End bp2276285 
Gene Length2223 bp 
Protein Length740 aa 
Translation table11 
GC content39% 
IMG OID641617131 
Productglycosyl transferase, group 1/glycosyl transferase, group 2 
Protein accessionYP_001744304 
Protein GI170682772 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAA TTCTTATAAT GACGCCGGAC ATTGAGGGGC CTGTCCGTAA CGGCGGTATT 
GGTACTGCTT TCACTGCTCT TGCCACTACT TTGGCAAAAA AGGGGTATGA TGTTGATGTA
TTGTATACAT GTGGCGACTA TTCTGAATCA TCTGTATCGA AATTTAGCGA CTGGTCACGT
ATTTATAGTA CCTTTGGTAT CAATCTGCTA AGAACCGGAC TGATAAAAGA GATTAATATT
GATGCACCGT ATTTTAGAAG GAAAAGTTAT TCAATTTATC TCTGGTTGAA AGAAAATAAC
ATCTATGACA CTGTTATTTC TTGTGAGTGG CAGGCAGATC TTTATTACAC TTTATTAAGC
AAAAAGAATG GAACGGATTT TGAAAATACA AAGTTCATTG TAAATACTCA CAGTTCAACG
TTATGGGCTG ATGAAGGTAA TTACCAGCTT CCATATGATC AGAACCATCT TGAACTCTAT
TATATGGAGA AAATGGTGGT TGAAATGGCG GATGAAGTTG TTAGTCCGTC TCAGTATTTA
ATTGATTGGA TGTTGAGTAA GCACTGGAAT GTTCCTGAAG AACGTCATGT AATTTTAAAT
TGCGAGCCAT TTCAAGGGTT TGTGACGAGA GATGATGTTA CAGTTAAAAT AAATGAAAAG
CCAGCTTCTG GCGTTGAGCT TGTATTTTTC GGCCGCCTTG AAACCCGTAA AGGACTTGAC
ATATTCCTGC GTGCATTAAG AAAACTATCT GATGAAGATA AAGAGAGCAT TTCTGGAGTA
ACCTTCCTCG GAAAAAATGT CACCATGGGG AAAACTGATT CATTTACTTA TATTATGAAT
CAGACTAAAA ATTTGGGACT CGCAGTTAAT GTCATCTGCG ACTATGATCG TACCAACGCT
AATGAATATA TAAAAAGAAA AAATGTATTA GTCATCATTC CATCACTTGT AGAAAACTCA
CCCTATACTG TTTATGAATG CTTGATTAAT AACGTTAATT TCCTCGCTTC AAACGTTGGT
GGAATTCCAG AGCTTATTCA GCAGGAGCAT CATGCGGAAG TTCTATTTAT TCCTACACCT
GTCGATTTAT ACTGGAAAAT CCACTATCGC TTAAAAAATA TAAATATAAA ACCAGGGCTT
GCTGAATCAC AAGACAATAT TAAAGAAGCT TGGTTTGTCG CAGTTGAACG AAAAAACAAC
CGCGCATTCA AGAAAATCGA TGAAGCTAAC AGCCCGTTAG TTAGCGTGTG TATAACTCAC
TTCGAACGTC ACCATTTGCT TCAGCAAGCA CTCGCATCAA TAAAATCTCA GACGTACCAA
AATATTGAGG TCATCTTGGT TGATGATGGA AGTACGACAG AAGATTCTCA TCGTTATTTA
AATCTCATCG AGAATGATTT TAACTCTCGA GGCTGGAAAA TTGTCCGTAG TTCTAATAAC
TATCTGGGTG CTGCAAGGAA TTTGGCTGCG CGACACGCCT CTGGCGAATA TCTGATGTTT
ATGGACGATG ATAATGTTGC TAAGCCTTTT GAGGTAGAAA CGTTTGTTAC TGCAGCATTA
AACTCTGGGG CCGATGTGTT AACCACACCA AGCGATCTTA TTTTTGGTGA GGAGTTCCCT
TCTCCGTTCC GTAAAATGAC GCACTGCTGG CTTCCGTTAG GGCCTGATTT AAATATCGCC
AGCTTTAGTA ACTGCTTTGG CGATGCTAAT GCGCTGATCA GAAAAGAGGT TTTTGAAAAA
GTAGGCGGAT TTACTGAAGA TTACGGTTTA GGTCATGAAG ACTGGGAGTT TTTTGCCAAA
ATATCATTAC AGGGATATAA ATTGCAAATC GTCCCGGAAC CTCTATTTTG GTATAGAGTT
GCAAACTCCG GCATGTTGTT AAGTGGAAAT AAGAGTAAAA ATAACTACCG CAGTTTCCGT
CCTTTTATGG ATGAGAATGT TAAATATAAC TATGCAATGG GGTTGATACC TTCCTACCTC
GAGAAGATTC AAGAACTTGA GAGTGAAGTG AATCGCTTGC GGAGCATCAA TGGTGGTCAT
TCTGTCAGTA ACGAGTTACA ACTTTTAAAT AATAAGGTTG ATGGTCTTAT TTCTCAGCAA
AGAGATGGCT GGGCCCATGA CCGTTTTAAT GCTCTGTATG AAGCAATTCA TGTCCAAGGC
GCAAAACGAG GCACCAGCCT GGTTCGCCGG GTTGCCCGGA AAGTGAAATC AATGTTAAAA
TAA
 
Protein sequence
MKKILIMTPD IEGPVRNGGI GTAFTALATT LAKKGYDVDV LYTCGDYSES SVSKFSDWSR 
IYSTFGINLL RTGLIKEINI DAPYFRRKSY SIYLWLKENN IYDTVISCEW QADLYYTLLS
KKNGTDFENT KFIVNTHSST LWADEGNYQL PYDQNHLELY YMEKMVVEMA DEVVSPSQYL
IDWMLSKHWN VPEERHVILN CEPFQGFVTR DDVTVKINEK PASGVELVFF GRLETRKGLD
IFLRALRKLS DEDKESISGV TFLGKNVTMG KTDSFTYIMN QTKNLGLAVN VICDYDRTNA
NEYIKRKNVL VIIPSLVENS PYTVYECLIN NVNFLASNVG GIPELIQQEH HAEVLFIPTP
VDLYWKIHYR LKNINIKPGL AESQDNIKEA WFVAVERKNN RAFKKIDEAN SPLVSVCITH
FERHHLLQQA LASIKSQTYQ NIEVILVDDG STTEDSHRYL NLIENDFNSR GWKIVRSSNN
YLGAARNLAA RHASGEYLMF MDDDNVAKPF EVETFVTAAL NSGADVLTTP SDLIFGEEFP
SPFRKMTHCW LPLGPDLNIA SFSNCFGDAN ALIRKEVFEK VGGFTEDYGL GHEDWEFFAK
ISLQGYKLQI VPEPLFWYRV ANSGMLLSGN KSKNNYRSFR PFMDENVKYN YAMGLIPSYL
EKIQELESEV NRLRSINGGH SVSNELQLLN NKVDGLISQQ RDGWAHDRFN ALYEAIHVQG
AKRGTSLVRR VARKVKSMLK