Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2264 |
Symbol | |
ID | 6143748 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2285476 |
End bp | 2286366 |
Gene Length | 891 bp |
Protein Length | 296 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 641617139 |
Product | glycosyl transferase, group 2 family protein |
Protein accession | YP_001744312 |
Protein GI | 170680732 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACATA CTGCACTAAT AGTCACTTTC AACCGTCTTG AGAAGCTAAA AAAGACGGTT GCTGAAACTG TTAAGCTGCA CTTTACCTCT ATCGTTATTG TCAATAATGG GTCCACGGAT GGGACCTCTG ACTGGCTGCA GACAATCACT GATCCCCGAG TCATTGTCTT AAATCTGGCC AGTAATAACG GGGGGGCCGG CGGGTTTAAG GCCGGTAGTC AATATATATG CAGCTCTGTT GATAGCGATT GGGTCTTCTT TTATGACGAT GATGCTTATC CGCAAAGTGA TATTCTTGAT AAGTTCGCGA CGATAAATAA AGAAGATTGT CGTGTTTTTA CGGCTTTAGT TAAAGATTTA CAGGGCCGGA CATGCGCGAT GAATGTTCCA TTTGTAAAAG TCCCGACCTC GTTTAGTGAT ACCTTGCAAT ATATCAAACA TCCCCAACGA TATGTACCAG ATAATAAGAA AATGCTGGTG CAAACCGTTT CATTTGTTGG CATGATCATT AAGCGTGAAG TCTTGAATGA GCACCTTCAT CATATTTATG ATGAACTTTT CCTTTATTTT GATGATCTCT ATTTTGGTTA TCAATTAACT TTAAGTGGCG AGAAAATCAC GTATCAACCG GAGCTTGTTT TTACTCATGA CGTGAGTATT CAGGGAAAGG TTATATCGCC AGAGTGGAAG GTTTACTATC TTTGTCGGAA CTTGATTTTG GCTAAGAGAA TATTCACGGA AGTGGAAATT TTTAGTAGTT CATCAATCCT TTTGCGCCTG TGTAAGTATA TCACCATATT CCCTGTGCAG CGCCGGAAAT GGGTATATCT AAAGTTTTTA TGCCGTGGGA TTGTACATGG TGTAAAAGGA ATCAGCGGTA AGTTTCACTA A
|
Protein sequence | MKHTALIVTF NRLEKLKKTV AETVKLHFTS IVIVNNGSTD GTSDWLQTIT DPRVIVLNLA SNNGGAGGFK AGSQYICSSV DSDWVFFYDD DAYPQSDILD KFATINKEDC RVFTALVKDL QGRTCAMNVP FVKVPTSFSD TLQYIKHPQR YVPDNKKMLV QTVSFVGMII KREVLNEHLH HIYDELFLYF DDLYFGYQLT LSGEKITYQP ELVFTHDVSI QGKVISPEWK VYYLCRNLIL AKRIFTEVEI FSSSSILLRL CKYITIFPVQ RRKWVYLKFL CRGIVHGVKG ISGKFH
|
| |