Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2262 |
Symbol | |
ID | 6145675 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2283048 |
End bp | 2284208 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 641617137 |
Product | glycosyl transferase, group 2 |
Protein accession | YP_001744310 |
Protein GI | 170681557 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 59 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGAAA AAGAGAGTAT AGCTGTTTCC GTTATTATCC CTGTGCACAA TGCGGCGGGA TATATTTCCG ACACATTGAG CACTGTTTTG TCGCAAACAT TAAATGATAT TGAAATCATT ATTGTCAACG ATTGTTCTGA CGATAATACC TTAGAGATTG TCTCGGCGCT GGCTGAAACT GATTCACGAA TTAGAGTCAT TAACAACACG ACAAATCTTG GTGGGGGAGG CTCAAGAAAC ATCGGGTTGG ATGCTGCTAC CGGTGAGTAC GTTATTTTTC TTGATGACGA CGATTATGTC GATAACATGA TGTTGGAGCG CATGTACGCA CGTGCTAGCG ATATACAAGC GGATGTCGTT ATCTGTCGCA GCCAGTCTTT TGATCCCGCC TTGCAGGTTT ATGCTCCGAT GCCGTGGTCA GTGCGCCAGG AGCTGTTACC TGATCTCAAG TTTTTTTCAT CCCAGGATAT TCCCTCTGAC TTTTTCCGCA CCTTTGTCTG GTGGCCATGG GACAAGCTTC TCAGGCGTCA ATTTATTACT AGTCATCAGC TTCGCTTTCA GGAAATCAGG ACGACCAACG ATCTTTTCTT CGTCTGCGCA TTTATGCTCA TGGCGAACAG AATTTCGGTC TTAAATGAAA CGTTAATTTC TCATACCATC AACCGTAGTG AATCCCTATC CGCCACGCGA GCAGAGTCGC ACCGCTGTGC TGTTGAAGCA TTGGTGGCGC TTAAAGCGTT TATCTGTCAG CAGGGGATGA TGGAACACCG TCTCAGAGAT TATAAAAACT ATGTTGTCGT GTTCCTTGAG TGGCATTTAA ACACGATTTC TGGTCCGGCA TTTCACCCGT TTTATCAGCA AGTGAAAGAG TTTGTTGTTG CCCTGGATGC AAAGAGTGAT GATTTTTATG ATGAATTTAT CGCCGCCGCC CATCATAGGA TCACCACGCT GTCAGCTGAA GAGTACCTAT TCTCTCTTAA AGATCGGGTT CTGAAGGAAC TTGAGTTTTT CCAGGCCAGG AGCTCCGCAT TACAGCAAGA AGTTGAGACG CTTACCCACT CATTAGCCGG GCAAAAGGAT GAAAATGCGA TATTGCATAA TCAATTGCAT GAGATTGAAG AGCGGGTAAC GGCTTTGTTG AATAAATCGA ACTTTTGCTG A
|
Protein sequence | MSEKESIAVS VIIPVHNAAG YISDTLSTVL SQTLNDIEII IVNDCSDDNT LEIVSALAET DSRIRVINNT TNLGGGGSRN IGLDAATGEY VIFLDDDDYV DNMMLERMYA RASDIQADVV ICRSQSFDPA LQVYAPMPWS VRQELLPDLK FFSSQDIPSD FFRTFVWWPW DKLLRRQFIT SHQLRFQEIR TTNDLFFVCA FMLMANRISV LNETLISHTI NRSESLSATR AESHRCAVEA LVALKAFICQ QGMMEHRLRD YKNYVVVFLE WHLNTISGPA FHPFYQQVKE FVVALDAKSD DFYDEFIAAA HHRITTLSAE EYLFSLKDRV LKELEFFQAR SSALQQEVET LTHSLAGQKD ENAILHNQLH EIEERVTALL NKSNFC
|
| |