Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1766 |
Symbol | |
ID | 6146590 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1775847 |
End bp | 1776707 |
Gene Length | 861 bp |
Protein Length | 286 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641616642 |
Product | putative oxidoreductase |
Protein accession | YP_001743820 |
Protein GI | 170683747 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.00951362 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCAGCA ATACATTTAC TCTCGGTACA AAATCCGTTA ACCGTCTTGG TTATGGCGCG ATGCAACTGG CAGGTCCTGG GGTTTTTGGC CCCCCAAGAG ATCGCCACGT CGCTATAACT GTGCTGCGCG AGGCGCTGGC ATTGGGCGTC AATCACATTG ATACCAGCGA CTTTTATGGT CCGCACGTCA CCAATCAGAT TATCCGCGAA GCGCTTTATC CTTACTCTGA CGACCTGACA ATTGTCACTA AAATTGGTGC GCGGCGTGGA GAGGACGCTT CCTGGTTGCC CGCCTTTTCT CCGGCAGAGT TGCAAAAAGC GGTGCACGAT AATCTACGTA ATCTCGGCCT GGACGTACTG GATGTGGTTA ACCTGCGCGT TATGATGGGG GATGGTCATG GCCCAGCGGA AGGATCGATT GAGGCCAGCC TGACCGTGCT GGCAGAGATG CAACAACAAG GCCTGGTAAA ACATATTGGC CTGAGCAACG TCACACCGAC GCAGGTTGCA GAGGCGCGCA AGATTGCCGA AATTGTCTGT GTGCAAAACG AATACAACAT CGCGCACCGT GCTGATGATG CAATGATTGA TGCTTTGGCC CACGATGGCA TTGCCTACGT GCCGTTCTTC CCGCTCGGGG GCTTTACACC GCTGCAATCC TCTACGCTGT CGGATGTTGC TGCGAGCCTG GGTGCAACAC CCATGCAGGT GGCGCTGGCG TGGCTGTTAC AACGTTCACC GAATATTTTG CTGATCCCAG GGACGTCTTC TGTTGCGCAT TTACGGGAGA ATATGGCTGC TGAAAAATTG CATCTTTCTG AGGAAGTGTT GTCTACGTTG GATGGTATTT CGCGAGAATA A
|
Protein sequence | MSSNTFTLGT KSVNRLGYGA MQLAGPGVFG PPRDRHVAIT VLREALALGV NHIDTSDFYG PHVTNQIIRE ALYPYSDDLT IVTKIGARRG EDASWLPAFS PAELQKAVHD NLRNLGLDVL DVVNLRVMMG DGHGPAEGSI EASLTVLAEM QQQGLVKHIG LSNVTPTQVA EARKIAEIVC VQNEYNIAHR ADDAMIDALA HDGIAYVPFF PLGGFTPLQS STLSDVAASL GATPMQVALA WLLQRSPNIL LIPGTSSVAH LRENMAAEKL HLSEEVLSTL DGISRE
|
| |