Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1860 |
Symbol | |
ID | 6146680 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1883797 |
End bp | 1884555 |
Gene Length | 759 bp |
Protein Length | 252 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641616736 |
Product | short chain dehydrogenase |
Protein accession | YP_001743914 |
Protein GI | 170681806 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0000000000000976812 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCATTACC AGCCAAAACA AGATTTACTC AATGATCGCA TTATCCTGGT GACGGGAGCC AGCAATGGTA TTGGTCGTGA AGCCGCGATG ACGTATGCAC GCTATGGTGC GACAGTGATT CTGTTGGGCC GTAATGAAGA AAAATTACGT CAGGTAGCCA GCCACATAAA CGAAGAAACT GGGCGTCAGC CACAGTGGTT TATTCTCGAT TTGCTGACCT GCACGTCCGA AGATTGCCAA CAACTGGCAC AGCGCATTGC CGTTAATTAT CCGCGTCTGG ATGGTGTTTT GCATAATGCC GGATTGCTCG GCGATGTTTG TCCAATGAGC GAACAAAATC CACAGGTCTG GCAGGACGTC ATACAGGTCA ACGTTAATGC CACCTTTATG CTCACCCAGG CACTGCTTCC TTTATTACTC AAATCGGACG CCGGTTCACT GGTCTTTACT TCATCAAGCG TTGGACGTCA GGGGCGTGCA AACTGGGGCG CTTATGCAGC GTCGAAATTT GCCACCGAAG GGATGATGCA GGTACTGGCC GATGAATATC AGCAGCGCCT GCGCGTCAAC TGCATTAACC CAGGCGGTAC GCGCACCGCA ATGCGTGCCA GCGCCTTCCC GACCGAAGAT CCACAGAAAC TTAAAACACC CGCTGATATC ATGCCGCTCT ACCTTTGGCT GATGGGCGAT GACAGCCGCC GTAAAACCGG CATGACCTTT GACGCCCAAC CGGGCCGTAA ACCAGGAATT TCCCAATGA
|
Protein sequence | MHYQPKQDLL NDRIILVTGA SNGIGREAAM TYARYGATVI LLGRNEEKLR QVASHINEET GRQPQWFILD LLTCTSEDCQ QLAQRIAVNY PRLDGVLHNA GLLGDVCPMS EQNPQVWQDV IQVNVNATFM LTQALLPLLL KSDAGSLVFT SSSVGRQGRA NWGAYAASKF ATEGMMQVLA DEYQQRLRVN CINPGGTRTA MRASAFPTED PQKLKTPADI MPLYLWLMGD DSRRKTGMTF DAQPGRKPGI SQ
|
| |