Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3511 |
Symbol | |
ID | 6143105 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3588780 |
End bp | 3590384 |
Gene Length | 1605 bp |
Protein Length | 534 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641618340 |
Product | hypothetical protein |
Protein accession | YP_001745487 |
Protein GI | 170683178 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGAAT GGCAAAATTT AACCCGCACA GCGCTACTCG GCACAGACAA AAAAGCGTTT ACACCGTCGG CGTCGCAAAG TGAGATTGGG CTGTTACTGT CTGAACTGGC GAATAACACA TCCGCAGAGC AACAGCTACT GCGCTCGGCG GGCGTTCTGG CGCTCTGCCA TCTGGCTGGC TGGGTGCCGC AGGCGATTGA CCCCGCACCG CTGCCGGAAA TCGATAAAGA AAATGCCGAG CCAATAAACA ACCCGGCTTT TGCGGGACTG CTGCATCTGC TGCTGTCAGA AGGACCGCCG CGGTTGCTGT GGTCGGCGCT TGATCTATTG GTCCGTCGTC AACTTCTTCC TCCGCCCCTG CTGTTACCAG CATTGCTGAA TTATGGAGCG CAAAAGCCAT CTCTACGCGG CATCCTGTCC AATGTTCTGG GCGCTCGCGG GTGCTGGCTG GCGCAAATCA ACGCCGATTG GGGCTATGTT CTCGCCTCCA CTGACGCTCC GCTGGATGAA GAAATGTGGC TGCACGGCAC GTCCGAACAG CGTCTGCAAT ATCTGACGCA AATGCGCATT CGCGCCCCGG AACAGGGTCG CGATCGGCTG GCGCAGGAGA TGAGTTCGCT GGATGCCCGC GAACGCGCAC AGCTGCTGGC TGTGCTGGAA GTGGGGATCT CCCAACACGA CGAAGCCTTT CTGGAACAGA CATTATCCGA TCGCAGCAAA GAGGTGCGGC AAACTGCGGC ACGGTTACTG TGTTGTCTGC CGGAAAGCGC CTGGATCTCG CGCATGAAGT CTCGGTTAGC GCCGTTGCTC AGTTCATCCC CCGCTCCTGA GACCTTACTT GAAAGGCTAA AAGGCTTATC CGGCAAAGAA AAAATGCTGC GCGTCGCCCT TGATGCCCCG GAGGCGTTCT TACCAGAGTG GAAGGCCGAT GCGCTGGAAG AAACTAAGCC CAAAGGCGAA AAGTTGGGCC AGCGCGCATG GTGGCTGTAT CAGATAGTTG CAGCGGTTCC GCTCGACTGG TGGCAAACGC AACTGCAGGC TACCCCCGTT GAACTGCTGC GCTGGGCCGG GAAAACCGAC TGGCAGGAGG CGCTACTGCG CGCCTGGTAT CACGCCGTCC TGCGCGAAAA AAATCCACAG TGGGCGCAGG CATTCCTCGC CCTACTCCCC GTCGGAATAA GCATTCATTC CCCTGCTGGC GTGAAGATTA ACGCCTTTGA GCTTCTGCAA TGCCTGCCCG TAGAAGAACA CGAGCCAGTG TTGACGTTAC TGTTCTCGGT AATGGGCGGC GAACATTTAC AACGCTACAT CGCCCTGTTA CCGCTGGATG CGTCGCTCTT TAGCCTGCGC CTGAGCCAGC AAATGGTCAC GGAGCTGCAC CGTTGGGTGA AACATGATTC CGCGCGCTTT GACTACGCCC TGCGCCATGT GATGAGTGAT TTCGCCTGCC TGCTGGCCCC CGAAGTCCTC AATGATGTCA TTGACAAGTG GCCGCATGAC GCACAACAAA CGCCTTATTG CGAGGCAGCC TTTACTGCCC TGAGCGCCAC GCTGGCGCAC CGAATTCAAC TTCATTCGTT TTTTGTCGGA GAAACTACTT TATGA
|
Protein sequence | MNEWQNLTRT ALLGTDKKAF TPSASQSEIG LLLSELANNT SAEQQLLRSA GVLALCHLAG WVPQAIDPAP LPEIDKENAE PINNPAFAGL LHLLLSEGPP RLLWSALDLL VRRQLLPPPL LLPALLNYGA QKPSLRGILS NVLGARGCWL AQINADWGYV LASTDAPLDE EMWLHGTSEQ RLQYLTQMRI RAPEQGRDRL AQEMSSLDAR ERAQLLAVLE VGISQHDEAF LEQTLSDRSK EVRQTAARLL CCLPESAWIS RMKSRLAPLL SSSPAPETLL ERLKGLSGKE KMLRVALDAP EAFLPEWKAD ALEETKPKGE KLGQRAWWLY QIVAAVPLDW WQTQLQATPV ELLRWAGKTD WQEALLRAWY HAVLREKNPQ WAQAFLALLP VGISIHSPAG VKINAFELLQ CLPVEEHEPV LTLLFSVMGG EHLQRYIALL PLDASLFSLR LSQQMVTELH RWVKHDSARF DYALRHVMSD FACLLAPEVL NDVIDKWPHD AQQTPYCEAA FTALSATLAH RIQLHSFFVG ETTL
|
| |