Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ent638_2039 |
Symbol | |
ID | 5113455 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Enterobacter sp. 638 |
Kingdom | Bacteria |
Replicon accession | NC_009436 |
Strand | - |
Start bp | 2212809 |
End bp | 2213558 |
Gene Length | 750 bp |
Protein Length | 249 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640492227 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_001176766 |
Protein GI | 146311692 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.292922 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.566372 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCAAC CACTTTCAGG CAAGATTGCC CTGGTCACGG GCGGGAGTAC GGGCATCGGT CTTGCCACCG TGCAGGAACT GGCGGCGCAA GGGGCAAAAG TCTACCTCAC CGGACGTCGC CAGCAGGAGC TGGATGATGC TGTCGCGCTA GTTGGTGCGT CGGCAACGGG TATCCAGGCG GATGCCTCCC GTCTGGACGA CCTGGATAAG GTCTACGCGC AGATCGCAGA GGAGTCGGGG CGTCTGGACA TTTTGTTTGC CAACGCCGGT GGCGGGGACA TGCTGCCGCT GGGTGCTATC ACCGAAGAGC ATTTTGACCG GATATTCGGA ACTAACGTTC GTGGCGTACT GTTTACTGTC CAGAAGGCGT TACCGCTTCT GAGTGCTGGC TCGTCCATCA TCCTGACCGC GTCTACCGTT TCGGTAAAAG GCACCGCCAA CTTTAGCGTT TATAGCGCCA GCAAGGCGGC AGTGAGAAAT TTTGCGCGTT CATGGGCGCT GGATTTGCAG GGGCGTGGTA TTCGGGTCAA TGTGGTGAGT CCGGGCCCGG TCAAAACGCC TGGATTGGGC GGATTGGTTC CGGAGGAGCA GCGTCAGGGT CTTTATGATG GACTGGCGGC GCAGGTTCCG CTGGGACGGA TTGGTGAGCC AGCGGAAGTC GGGAAAGCCG TTGCATTTCT GGCCTCCGAC GCCGCCAGCT TTATCAATGC TGTTGAGCTG TTTGTCGACG GTGGTATGGC GCAGATCTAA
|
Protein sequence | MSQPLSGKIA LVTGGSTGIG LATVQELAAQ GAKVYLTGRR QQELDDAVAL VGASATGIQA DASRLDDLDK VYAQIAEESG RLDILFANAG GGDMLPLGAI TEEHFDRIFG TNVRGVLFTV QKALPLLSAG SSIILTASTV SVKGTANFSV YSASKAAVRN FARSWALDLQ GRGIRVNVVS PGPVKTPGLG GLVPEEQRQG LYDGLAAQVP LGRIGEPAEV GKAVAFLASD AASFINAVEL FVDGGMAQI
|
| |