Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3781 |
Symbol | |
ID | 6146209 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3847807 |
End bp | 3849009 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641618607 |
Product | pyridine nucleotide-disulfide oxidoreductase family protein |
Protein accession | YP_001745747 |
Protein GI | 170683730 |
COG category | [R] General function prediction only |
COG ID | [COG2081] Predicted flavoproteins |
TIGRFAM ID | [TIGR00275] flavoprotein, HI0933 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGAAAGGT TTGATGCCAT TATTATAGGC GCTGGTGCGG CAGGTATGTT CTGTTCTGCG CTGGCAGGTC AGGCAGGACG CCGGGTTCTG CTGATCGATA ATGGTAAAAA ACCAGGGCGC AAAATCCTCA TGTCTGGCGG TGGGCGCTGC AACTTTACCA ACCTTTATGT CGAACCGGGC GCTTATCTGA GCCAGAATCC GCATTTTTGT AAGTCTGCAC TCGCGCGTTT TACCCAGTGG GATTTCATTG ATCTGGTCAA TAAACACGGC ATCACCTGGC ACGAGAAAAC GTTAGGGCAA CTCTTCTGCG ATGACTCCGC GCAGCAGATT GTCGACATGC TGGTGGATGA GTGCGAGAAG GGCAATGTGA CCTTCAGATT GCGTAGCGAA GTGCTGAGTG TGGCGAAGGA TGAAACAGGC TTCACGCTTG AACTGAACGG CATGACTGTC GGTTGCGAAA AGCTGGTCAT CGCGACCGGT GGGCTGTCAA TGCCGGGGCT GGGCGCATCG CCGTTTGGTT ATAAGATTGC CGAACAATTT GGCCTCAACG TGCTGCCGAC CCGCGCGGGC CTGGTGCCAT TCACTCTGCA TAAACCGTTG CTCGAAGAGT TACAGGTGCT GGCGGGCGTG GCGGTGCCTT CCGTGATTAC CGCTGAAAAC GGCACCGTTT TCCGTGAGAA CTTACTCTTC ACCCACCGCG GCTTATCTGG ACCGGCGGTG TTGCAAATTT CCAGCTACTG GCAACCGGGT GAATTTGTCA GTATCAATCT GCTACCGGAT GTGGACCTCG AAACCTTCCT GAATGAGCAG CGTAATGCAC ATCCGAATCA GAGCCTGAAA AACACACTGG CGGTTCATCT ACCGAAGCGG TTGGTTGAAC GTTTACAGCA ACTCGGGCAA ATCCCGGATG TTTCGCTAAA ACAGCTCAAC GTGCGTGACC AACAGGCACT GATTAGCACA TTGACCGACT GGCGCGTACA ACCCAACGGC ACTGAAGGCT ATCGTACTGC TGAAGTGACG CTCGGCGGCG TGGACACCAA CGAACTCTCT TCACGGACGA TGGAAGCGCG CAAAGTGCCA GGGCTGTACT TCATCGGCGA AGTGATGGAC GTCACCGGTT GGCTGGGGGG CTATAATTTC CAGTGGGCGT GGTCGAGTGC GTGGGCTTGT GCGCAGGATT TGATTGCAGC GAAGTCGTCC TGA
|
Protein sequence | MERFDAIIIG AGAAGMFCSA LAGQAGRRVL LIDNGKKPGR KILMSGGGRC NFTNLYVEPG AYLSQNPHFC KSALARFTQW DFIDLVNKHG ITWHEKTLGQ LFCDDSAQQI VDMLVDECEK GNVTFRLRSE VLSVAKDETG FTLELNGMTV GCEKLVIATG GLSMPGLGAS PFGYKIAEQF GLNVLPTRAG LVPFTLHKPL LEELQVLAGV AVPSVITAEN GTVFRENLLF THRGLSGPAV LQISSYWQPG EFVSINLLPD VDLETFLNEQ RNAHPNQSLK NTLAVHLPKR LVERLQQLGQ IPDVSLKQLN VRDQQALIST LTDWRVQPNG TEGYRTAEVT LGGVDTNELS SRTMEARKVP GLYFIGEVMD VTGWLGGYNF QWAWSSAWAC AQDLIAAKSS
|
| |