Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3884 |
Symbol | |
ID | 6143397 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3952045 |
End bp | 3953040 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641618710 |
Product | acyltransferase family protein |
Protein accession | YP_001745849 |
Protein GI | 170680005 |
COG category | [S] Function unknown |
COG ID | [COG3274] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 73 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCCCA AAATTTACTG GATTGATAAC CTGCGAGGGA TAGCGTGTTT AATGGTGGTG ATGATTCACA CCACTACCTG GTATGTGACC AATGCTCATA GTGTTAGCCC CGTCACATGG GATATCGCCA ATGTTCTGAA TTCTGCCTCT CGTGTCAGCG TGCCGCTATT TTTCATGATT TCCGGCTATC TCTTTTTTGG CGAACGCAGC GCCCAGCCGC GCCATTTCTT GCGCATCGGC TTATGCCTGC TGTTTTATAG CACTGTTGCG CTGCTCTACA TTGCGCTGTT TACCTCCATC AATATGGAGT TAGCGCTGAA AAACCTGCTG CAAAAGCCAG TGTTTTACCA CTTGTGGTTT TTCTTCGCGA TTGCGGTGAT TTATCTGGTT TCACCGCTGA TTCAGGTGAA GAACGTCGGC GGAAAAATGT TGCTGGTGCT AATGGTGGTG ATTGGCATTA TCGCTAACCC AAACACAGTG CCGCAGAAAA TTGACGGTTT TGAATGGCTG CCAATTAACT TATATATCAA TGGCGATACT TTTTACTACA TCCTGTATGG CATGTTGGGC CGCGCTATAG GGATGATGGA CACACAGCAT AAAGCACTGT CGTGGGTGTG CGCCGCACTG TTTGCGACGG GGGTATTTAT TATCTCTCGC GGGACATTAT ATGAATTGCA GTGGCGCGGA AATTTTGCCG ATACCTGGTA TCTTTACTGT GGGCCGATGG TTTTTATCTG CGCAATCACG CTATTGACTC TGGTTAAAAA CACGCTGGAT ACACACACTG TTCCCGGGCT TGGGCTGATA TCACGCCATT CTTTAGGTAT ATACGGGTTC CATGCATTGA TTATCCATGC GCTGCGTACC CGGGGGATTG AGCTTAAAAA CTGGCCAATA CTCGATATTA TTTGGATCTT TTGCGCGACG TTGGCAGCGA GTTTGTTACT TTCTATGCTG GTACAACGAA TCGACAGAAA CAGACTAGTG AGTTAA
|
Protein sequence | MQPKIYWIDN LRGIACLMVV MIHTTTWYVT NAHSVSPVTW DIANVLNSAS RVSVPLFFMI SGYLFFGERS AQPRHFLRIG LCLLFYSTVA LLYIALFTSI NMELALKNLL QKPVFYHLWF FFAIAVIYLV SPLIQVKNVG GKMLLVLMVV IGIIANPNTV PQKIDGFEWL PINLYINGDT FYYILYGMLG RAIGMMDTQH KALSWVCAAL FATGVFIISR GTLYELQWRG NFADTWYLYC GPMVFICAIT LLTLVKNTLD THTVPGLGLI SRHSLGIYGF HALIIHALRT RGIELKNWPI LDIIWIFCAT LAASLLLSML VQRIDRNRLV S
|
| |