Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3951 |
Symbol | |
ID | 6147352 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4030264 |
End bp | 4031298 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 641618778 |
Product | putative glycosyl transferase |
Protein accession | YP_001745917 |
Protein GI | 170679583 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.174357 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.660654 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGAACA GCACTAATAA ACTAAGTGTT ATTATCCCGT TATATAATGC GGGCGATGAT TTCAGCGCTT GCATGGAATC TTTAATCGCG CAAACCTGGA CTGCTCTGGA AATCATTATT ATTAACGATG GTTCAACGGA TAATTCTGTT GAAATAGCAA AGCATTACGC AGAAAACTAT CCGCACGTTC GTTTGTTGCA TCAGGCGAAT GCTGGCGCAT CGGTGGCGCG TAATCGTGGG ATCGAGGTGG CGACGGGCAA ATATGTCGCT TTTGTCGATG CTGACGATGA GGTGTATCCC ACCATGTACG AAACGCTGAT GACCATGGCG TTAGAGGACG ACCTCGACGT GGCGCAGTGC AACGCTGATT GGTGTTTTCG TGAAACGGGT GAAACCTGGC AATCCATCCC TACCGATCGC CTTCGCTCAA CCGGTGTATT AACCGGTCCG GACTGGCTGC GGATGGGGCT TTCTTCCCGC CGTTGGACAC ACGTGGTCTG GATGGGGGTT TATCGCCGCG ATGTTATTGT TAAAAATAAC ATTAAATTTA TTGCCGGATT ACATCATCAG GATATTGTCT GGACAACAGA ATTTATGTTT AACGCGCTGC GTGCGCGATA TACCGAGCAA TCATTATATA AATATTATCT GCATAATACG TCAGTGAGTC GGTTGAACAG ACAAGGGAAC AAAAACCTTA ATTATCAACG TCACTATATT AAGATTACCC GCCTGCTGGA GAAATTAAAT CGAAATTATG CCGACAAAAT TACGATTTAT CCGGAATTTC ATCAGCAAAT AACCTACGAA GCATTGCGTG TTTGCCATGC GGTGCGCAAA GAGCCGGATA TTATTACCCG CCAACGGATG ATTGCCGAGA TATTTACTTC CGGTATGTAT AAGCGCCTGA TTACCAATGT GCGCAGCGTG AAGGTGGGTT ATCAGGCGTT ACTGTGGTCT TTCCGCTTAT GGCAATGGCG CGACAAAACG CGGTCGCACC ATCGCATTAC GCGTAGCGCC TTTAATTTGC GCTAG
|
Protein sequence | MMNSTNKLSV IIPLYNAGDD FSACMESLIA QTWTALEIII INDGSTDNSV EIAKHYAENY PHVRLLHQAN AGASVARNRG IEVATGKYVA FVDADDEVYP TMYETLMTMA LEDDLDVAQC NADWCFRETG ETWQSIPTDR LRSTGVLTGP DWLRMGLSSR RWTHVVWMGV YRRDVIVKNN IKFIAGLHHQ DIVWTTEFMF NALRARYTEQ SLYKYYLHNT SVSRLNRQGN KNLNYQRHYI KITRLLEKLN RNYADKITIY PEFHQQITYE ALRVCHAVRK EPDIITRQRM IAEIFTSGMY KRLITNVRSV KVGYQALLWS FRLWQWRDKT RSHHRITRSA FNLR
|
| |