Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1026 |
Symbol | |
ID | 6147290 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1048051 |
End bp | 1049076 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 641615913 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_001743105 |
Protein GI | 170682395 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATAG CGATGATAAT ACCATCTTTA AAAAATACCG CGCCAAATAA TATAGCCCTT TCAATATGTG AAGGGGCAAG GAAGCATGGT GTCAATATAG ATATTTATTA TCTTGATGAA ATTGTAGAGT TGGATATAGA CGAGTCAGAA CTGAATATAA ATAGGCTAAA TGATAGAACC ATTTTAAACA GTTATGATAT TGTTCACTCT CATGGCTTAA GACCCGATGT TATTAATTGT AAGACCAGCA AAAATGTCAT TAATGTAAGT ACGCTTCATA GTTTTATTTT AGATGATCTT AAGAATAAAT ATGGAATAAA AGGATATCTG ATCGCTTTTG CTTGGCTTAA ATTATTAAAG CGATTTGATT CATGTATTGC AATTAGTGAG GCGACTAAAC GGTTTTATGT TAAACATGGT CTTAAATGCC ACCATATATA TAATGGAATC GACTTTCCTG ACATCGAATT ATATCAAGAT AACTCAAGGC TAAATAACGG CAAAATAAAT ATAGTATCGA TCGCGCATCT GGAAAAAATA AAAGGCTTAG AGCAACTCCT ATATCTAGCC GCGGAGCGAG AAGAGTTCCA TGTTCATATA ATTGGTGAAG GAACTTACCG TGGGAAATTA ACAAATATTA TTGATAAGTT TGATTTATCT CAGCGTGTAA CTTTGCATGG ATATATATCA AATGCGAGTT CTATGTTGGG GAAATTTGAT GTTTATGTGC AACCATCTAA AAGTGAGGGG TTTGGCATTG CAGTCATTGA AGCTCTTGTA AATAAAATCC CAACAGTTTG CTCAGACATT GAAGTGTTCA GAGAGTTATT TGGCTCAGGA GAAGTTGAAT TTTTCAAGTT AGACGATATT AATTCACTTT ACGATGCAAT TTCAAGTGCA TTAACCAAGA CAGATATGTC TCGAGATGCA AGTGCCACAG CGATAAGCCA AAAATTTTCT TCAGAAGTGA TGTCATTGAA CTATTTAAGT TGGTATAAAA AATTATATGA AGAAAGAAAT TTATAA
|
Protein sequence | MKIAMIIPSL KNTAPNNIAL SICEGARKHG VNIDIYYLDE IVELDIDESE LNINRLNDRT ILNSYDIVHS HGLRPDVINC KTSKNVINVS TLHSFILDDL KNKYGIKGYL IAFAWLKLLK RFDSCIAISE ATKRFYVKHG LKCHHIYNGI DFPDIELYQD NSRLNNGKIN IVSIAHLEKI KGLEQLLYLA AEREEFHVHI IGEGTYRGKL TNIIDKFDLS QRVTLHGYIS NASSMLGKFD VYVQPSKSEG FGIAVIEALV NKIPTVCSDI EVFRELFGSG EVEFFKLDDI NSLYDAISSA LTKTDMSRDA SATAISQKFS SEVMSLNYLS WYKKLYEERN L
|
| |