Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1575 |
Symbol | |
ID | 6143838 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1558354 |
End bp | 1559394 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641616452 |
Product | putative oxidoreductase |
Protein accession | YP_001743630 |
Protein GI | 170680179 |
COG category | [R] General function prediction only |
COG ID | [COG0673] Predicted dehydrogenases and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.544728 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 72 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGACA ACATCCGTGT TGGGTTGATT GGGTATGGTT ATGCGAGCAA AACCTTCCAT GCGCCCCTGA TTGCGGGCAC GCCCGGGCTG GAACTGGCGG TAATCTCCAG CAGCGATGAA ACAAAAGTAA AAGCCGACTG GCCAACGGTT GCGGTTGTCT CTGAGCCGAA GCATCTGTTT AACGATCCCA ACATAGACCT GATTGTCATT CCTACACCCA ACGATACCCA TTTCCCGTTA GCCAAAGCGG CGCTTGAGGC GGGTAAACAT GTGGTCGTTG ATAAACCCTT TACCGTGACA CTGTCACAAG CGCGAGAGCT GGAAGCGCTG GCAAAAAGCC TGGGGCGTGT GCTGTCTGTA TTCCATAACC GTCGCTGGGA TAGCGATTTC CTGACGCTAA AAGGTTTGCT CGTGGAAGGC GTACTGGGTG AAGTTGCTTA CTTTGAGTCT CATTTTGACC GCTTCCGTCC GCAGGTGCGC GATCGTTGGC GTGAACAGGG CGGTCCTGGC AGCGGTATCT GGTACGATTT AGCACCGCAT CTTCTTGATC AGGCCATTAC GCTATTTGGT TTACCGGTCA GCATGACGGT TGATTTGGCA CAGTTACGGC CCGGAGCGCA GTCGACCGAT TATTTCCACG CCATCTTGTC CTATCCGCAG CGGCGAGTCA TTTTACACGG TACCATGCTG GCAGCTGCTG AGTCAGCACG TTATATCGTG CATGGATCCC GAGGCAGTTA TGTGAAATAT GGCCTCGATC CACAGGAAGA ACGTCTGAAA AATGGCGAGC GTCTGCCGCA GGAAGACTGG GGCTACGATA TGCGTGATGG CGTACTTACC CGCGTGGAAG GTGAGGAACG TGTCGAAGAA ACGCTGTTGA CAGTACCAGG GAATTATCCG GCTTACTATG CGGCTATTCG TGATGCGTTA AATGGCGATG GTGAAAATCC GGTTCCGGCA AGTCAGGCAA TCCAGGTAAT GGAGTTGATT GAGCAGGGCA TCGAATCCGC CAAACATCGC GCGACGCTGT GCCTTGCGTG A
|
Protein sequence | MSDNIRVGLI GYGYASKTFH APLIAGTPGL ELAVISSSDE TKVKADWPTV AVVSEPKHLF NDPNIDLIVI PTPNDTHFPL AKAALEAGKH VVVDKPFTVT LSQARELEAL AKSLGRVLSV FHNRRWDSDF LTLKGLLVEG VLGEVAYFES HFDRFRPQVR DRWREQGGPG SGIWYDLAPH LLDQAITLFG LPVSMTVDLA QLRPGAQSTD YFHAILSYPQ RRVILHGTML AAAESARYIV HGSRGSYVKY GLDPQEERLK NGERLPQEDW GYDMRDGVLT RVEGEERVEE TLLTVPGNYP AYYAAIRDAL NGDGENPVPA SQAIQVMELI EQGIESAKHR ATLCLA
|
| |