Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1723 |
Symbol | |
ID | 6143118 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1731172 |
End bp | 1732209 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641616600 |
Product | zinc-binding dehydrogenase family oxidoreductase |
Protein accession | YP_001743778 |
Protein GI | 170682763 |
COG category | [R] General function prediction only |
COG ID | [COG2130] Putative NADP-dependent oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.00000837186 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGGCAAC AAAAGCAGCG TAACCGCCGT TGGGTTCTGG CCTCGCGTCC GCATGGCGCG CCGGTTCCGG AGAATTTCCG TCTTGAAGAA GATGATGTCG CCACACCGGG TGAAGGTCAG GTGTTACTGC GCACAGTTTA TTTGTCACTG GACCCGTATA TGCGTGGACG TATGAGCGAT GAGCCATCTT ATTCACCGCC GGTTGATATT GGCGGCGTGA TGGTCGGCGG TACGGTGAGC CGTGTCGTGG AGTCGAATCA TCCTGATTAT CAGCCTGGCG ACTGGGTGCT GGGCTACAGT GGATGGCAGG ACTATGACAT ATCCAGTGGT GATGATCTGG TGAAACTTGG CGATCATCCG CAAAATCCAT CGTGGTCGCT GGGTGTGCTG GGGATGCCAG GCTTTACCGC TTATATGGGG CTGCTGGATA TCGGTCAGCC TAAAGAGGGC GAAACGCTGG TGGTAGCTGC GGCGACAGGA CCAGTGGGGG CGACGGTGGG GCAAATCGGC AAACTTAAAG GTTGCAGGGT GGTGGGGGTT GCCGGTGGCG CGGAAAAATG CCGCCATGCC ACCGAAGTGC TGGGTTTTGA TGTTTGCCTT GACCATCACG CGGATGATTT TGCCGAACAA CTGGCGAAAG CGTGCCCAAA AGGTATTGAT ATCTATTATG AAAACGTTGG TGGTAAGGTA TTTGATGCGG TGCTACCGTT ACTCAATACA TCTGCGCGCA TTCCCGTCTG CGGCTTAGTG AGCAGCTATA ACGCTACAGA GCTACCACCC GGTCCGGATC GTTTACCCCT GTTGATGGCT ACGGTGCTGA AAAAACGCAT TCGATTACAA GGGTTTATTA TTGGTCAGGA TTATGGTCAC CGTATCCATG AGTTTCAGCA GGAGATGGGA CAATGGGTGA AAGAGGGGAA AATTCACTAT CGTGAACAAA TTACCAACGG TTTGGAGAAC GCCCCACAGA CGTTTATTGG CCTGCTGAAC GGCAAAAACT TCGGTAAAGT TGTTATTCGC GTCGCGGATG ATGATTAA
|
Protein sequence | MGQQKQRNRR WVLASRPHGA PVPENFRLEE DDVATPGEGQ VLLRTVYLSL DPYMRGRMSD EPSYSPPVDI GGVMVGGTVS RVVESNHPDY QPGDWVLGYS GWQDYDISSG DDLVKLGDHP QNPSWSLGVL GMPGFTAYMG LLDIGQPKEG ETLVVAAATG PVGATVGQIG KLKGCRVVGV AGGAEKCRHA TEVLGFDVCL DHHADDFAEQ LAKACPKGID IYYENVGGKV FDAVLPLLNT SARIPVCGLV SSYNATELPP GPDRLPLLMA TVLKKRIRLQ GFIIGQDYGH RIHEFQQEMG QWVKEGKIHY REQITNGLEN APQTFIGLLN GKNFGKVVIR VADDD
|
| |