Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3847 |
Symbol | |
ID | 6146221 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3922751 |
End bp | 3924430 |
Gene Length | 1680 bp |
Protein Length | 559 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641618673 |
Product | hypothetical protein |
Protein accession | YP_001745813 |
Protein GI | 170681114 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03368] cellulose synthase operon protein YhjU |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTCAAT TTACGCAAAA TACCGCCATG CCTTCTTCCC TCTGGCAATA CTGGCGCGGC CTTTCCGGCT GGAACTTCTA TTTTCTGGTT AAGTTCGGCC TGTTGTGGGC AGGATATCTT AACTTCCATC CGCTCCTCAA TCTGGTGTTT GCCGCGTTTC TGCTGATGCC CATTCCGCGC TACAGCCTGC ATCGCTTACG CCACTGGATT GCCCTGCCGA TCGGCTTTGC TTTGTTCTGG CATGACACCT GGTTGCCTGG CCCGGAAAGC ATAATGAGCC AGGGTTCGCA GGTGGCGGGG TTCAGTACCG ATTATTTAAT CGACCTTGTC ACACGCTTTA TTAACTGGCA GATGATTGGG GCCATTTTTG TTTTATTAGT GGCCTGGTTA TTCCTGTCAC AATGGATTCG CATTACCGTT TTTGTGGTTG CCATTCTGCT ATGGCTGAAC GTACTTACCC TGGCGGGACC AAGTTTCTCC TTGTGGCCTG CCGGTCAACC GACGACCACT GTAACAACGA CGGGTGGTAA CGCAGCGGCA ACCGTTGCGG CGACGGGTGG CGCACCGGTA GTGGGTGATA TGCCCGCACA AACTGCACCG CCAACAACGG CGAACCTTAA CGCCTGGCTG AATAATTTCT ATAACGCGGA GGCGAAACGT AAATCGACCT TCCCGTCTTC GCTGCCCGCT GATGCTCAGC CATTTGAACT ACTGGTGATT AATATCTGTT CGCTTTCCTG GTCGGATATT GAAGCCGCCG GGTTGATGTC GCATCCGCTG TGGTCGCATT TCGATATTGA GTTCAAGAAC TTTAACTCCG CCACCTCCCA CAGTGGCCCG GCGGCGATCC GTTTACTGCG CGCCAGCTGC GGGCAGACTT CGCACACTAA TCTGTATCAA CCGGCAAATA ATGACTGCTA TCTGTTTGAT AACCTGTCGA AACTGGGCTT TACCCAGCAC CTGATGATGG GACATAACGG CCAGTTCGGC GGTTTTTTGA AAGAAGTTCG CGAAAATGGT GGCATGCAGA CCGAATTGAT GGATCAAACA AATCTGCCGG TTATTTTGCT GGGCTTTGAT GGTTCGCCGG TTTATGACGA TACCGCCGTG CTTAACCGCT GGCTGGACGT TACCGAAAAA GATAAAAACA GCCGTAGTGC CACGTTCTAC AACACGCTTC CACTGCATGA CGGCAACCAT TATCCGGGCG TCAGCAAAAC AGCGGATTAC AAAGCGCGGG CGCAGAAATT CTTTGATGAA CTGGACGCCT TCTTTACTGA ACTGGAGAAA TCGGGTCGTA AAGTGATGGT GGTCGTGGTG CCAGAACACG GCGGCGCGCT GAAGGGCGAC AGAATGCAGG TATCTGGCCT ACGTGATATC CCTAGCCCGT CTATCACAGA CGTCCCCGTT GGGGTGAAAT TCTTCGGCAT GAAGGCACCA CATCAGGGGG CACCGATTGT CATCGACCAA CCGAGCAGCT TCCTGGCTAT CTCCGATCTG GTGGTTCGGG TTCTTGATGG CAAGATTTTC ACCGAAGACA ATGTTGACTG GAAAAAACTC ACCAGTGGGT TGCCACAAAC AGCACCGGTC TCCGAGAACT CAAATGCAGT AGTTATTCAA TACCAGGATA AACCGTACGT TCGCCTGAAC GGCGGCGACT GGGTGCCTTA CCCGCAATAA
|
Protein sequence | MTQFTQNTAM PSSLWQYWRG LSGWNFYFLV KFGLLWAGYL NFHPLLNLVF AAFLLMPIPR YSLHRLRHWI ALPIGFALFW HDTWLPGPES IMSQGSQVAG FSTDYLIDLV TRFINWQMIG AIFVLLVAWL FLSQWIRITV FVVAILLWLN VLTLAGPSFS LWPAGQPTTT VTTTGGNAAA TVAATGGAPV VGDMPAQTAP PTTANLNAWL NNFYNAEAKR KSTFPSSLPA DAQPFELLVI NICSLSWSDI EAAGLMSHPL WSHFDIEFKN FNSATSHSGP AAIRLLRASC GQTSHTNLYQ PANNDCYLFD NLSKLGFTQH LMMGHNGQFG GFLKEVRENG GMQTELMDQT NLPVILLGFD GSPVYDDTAV LNRWLDVTEK DKNSRSATFY NTLPLHDGNH YPGVSKTADY KARAQKFFDE LDAFFTELEK SGRKVMVVVV PEHGGALKGD RMQVSGLRDI PSPSITDVPV GVKFFGMKAP HQGAPIVIDQ PSSFLAISDL VVRVLDGKIF TEDNVDWKKL TSGLPQTAPV SENSNAVVIQ YQDKPYVRLN GGDWVPYPQ
|
| |