Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1509 |
Symbol | |
ID | 6145016 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1494178 |
End bp | 1497234 |
Gene Length | 3057 bp |
Protein Length | 1018 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641616387 |
Product | oxidoreductase, FAD-binding |
Protein accession | YP_001743567 |
Protein GI | 170681522 |
COG category | [C] Energy production and conversion |
COG ID | [COG0247] Fe-S oxidoreductase [COG0277] FAD/FMN-containing dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.784999 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.0000155929 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATTCCAC AGATTTCCCA GGCACCTGGC GTCGTTCAAT TGGTGCTTAA TTTTTTGCAA GAGCTGGAGC AACAGGGTTT TACTGGCGAT ACGGCGACAA GTTATGCCGA TCGTCTGACA ATGTCGACCG ACAACAGTAT TTACCAACTT CTCCCCGATG CGGTGGTATT TCCGCGTTCA ACCGCAGATG TAGCGCTGAT CGCCCGTCTT GCCGCGCAGG AACGCTATTC ATCGTTGATC TTTACCCCCC GCGGCGGCGG CACCGGCACT AACGGTCAGG CGCTCAACCA GGGGATTATT GTTGATATGT CCCGCCATAT GAACCGCATC ATCGAAATTA ACCCTGCAGA GGGCTGGGTG CGCGTTGAGG CCGGGGTGAT AAAAGACCAA CTCAATCAGT ACCTGAAACC GTTCGGCTAC TTTTTTGCGC CGGAGCTTTC GACCAGCAAC CGGGCAACGC TCGGCGGGAT GATCAATACC GATGCATCCG GTCAGGGATC GCTGGTCTAT GGCAAAACGT CAGATCACGT ACTTGGCGTA CGCGCGGTGT TGTTGGGGGG CGATATTCTC GATACGCAAC CTTTACCCGT CGAACTGGCG GAAACGCTGG GTAAATCCAA TACCACAATC GGGCGAATTT ATAACACGGT TTATCAACGT TGCCGTCAGC AACGCCAGTT AATCATCAAC AACTTCCCCA AACTTAACCG CTTTCTTACC GGTTACGATC TGCGTCATGT CTTTAACGAT GAGATGACCG AGTTCGATCT GACGCGCATT CTGACGGGTT CAGAAGGGAC GCTGGCCTTT ATTACCGAAG CGCGGCTGGA TATTACGCGC TTGCCTAAAG TGCGGCGTCT GGTGAACGTC AAATATGACT CTTTTGACTC CGCGCTGCGT AACGCGCCGT TTATGGTTGA GGCGCGGGCG CTTTCGGTAG AGACGGTGGA CTCAAAAGTG CTGAATCTGG CGCGGGAAGA TATTGTCTGG CATTCCGTCA GCGAGTTGAT TACCGATGTA CCTGACAAAG AAATGCTCGG GCTGAACATT GTGGAATTTG CTGGTGATGA TGAGGCGCTG ATTGACGAGC GAGTTAATGC ACTCTGTGTG CGGCTTGATG AGCTGATTGC CAGCCAACAG GCAGGTGTGA TTGGCTGGCA GGTGTGCCGC GAACTGGCGG GCGTTGAACG TATCTATGCG ATGCGCAAAA AAGCCGTTGG TCTGCTTGGC AATGCCAAAG GTGCCGCTAA GCCAATTCCG TTTGCTGAGG ATACCTGCGT ACCGCCGGAA CATCTGGCCG ATTATATTGC TGAATTTCGC GCGCTGCTCG ACAGCCACGG CTTAAGCTAC GGTATGTTCG GTCACGTCGA CGCAGGTGTC TTGCACGTCC GTCCGGCACT GGATATGTGC GATCCACAAC AAGAGATTTT GATGAAGCAA ATCTCTGATG ACGTGGTGGC GCTGACTGCG AAATACGGTG GTTTGTTGTG GGGCGAGCAC GGCAAAGGTT TTCGCGCTGA ATACAGCCCG GCGTTTTTCG GTGAGGAACT TTTTGCAGAA CTGCGCAAAG TGAAAGCGGC ATTTGACCCG CATAACCGAC TCAACCCAGG GAAGATTTGC CCGCCAGAAG GTCTCGATGC GCCGATGATG AAAGTGGACG CGGTGAAGCG CGGTACATTT GATCGGCAGA TCCCCATTGC GGTACGCCAG CAGTGGCGCG GTGCGATGGA GTGTAACGGC AACGGTTTAT GCTTCAACTT TGATGCCCGT AGTCCGATGT GTCCGTCGAT GAAGATCACC CAGAACCGGA TTCATTCACC GAAAGGGCGC GCAACGCTGG TGCGTGAATG GCTGCGCTTG TTGGCGGATC GCGGCGTTGA CCCACTCAAA CTGGAACAAG AACTGCCTGA ATCCGGCGTC AGTTTACGGA CGTTAATTGC CCGCACGCGC AATAGCTGGC ATGCGAATAA AGGCGAATAC GACTTCTCAC ACGAAGTCAA AGAGGCTATG TCGGGCTGTC TGGCCTGTAA AGCGTGTTCG ACCCAGTGCC CCATCAAAAT TGATGTGCCG GAGTTTCGTT CTCGTTTTCT GCAGCTCTAT CACACCCGTT ATTTACGCCC GCTTCGTGAC CACCTCGTTG CTACGGTCGA GAGCTACGCA CCGCTGATGG CACGCGCGCC GAAAACCTTT AACTTCTTCA TTAACCAGCC GCTGGTGCGC AAACTCTCGG AAAAACATAT CGGCATGGTT GATTTGCCAC TGCTGTCAGT CCCCTCGCTA CAACAACAAA TGGTGGGGCA TCGCTCGGCG AATATGACGC TGGAACAGCT TGAAGCTCTC AATACAGAGC AGAAAGCGCG CACGGTGTTG GTGGTGCAGG ACCCCTTTAC CAGCTATTAC GATGCGCAAG TGGTGGCGGA TTTTGTCCGT CTGGTTGAAA AATTAGGTTT CCAGCCTGTG TTACTGCCAT TTTCGCCAAA TGGCAAAGCC CAGCATATCA AAGGCTTTCT TAATCGTTTT GCGAAGACGG CGAAAAAGAC GGCGGATTTC CTCAACCGTA TGGCGAAGCT GGGGATGCCA ATGGTGGGCG TCGATCCGGC GCTGGTACTT TGTTATCGCG ATGAATATAA ACTGGCGCTG GGCGATGAAC GTGGCGAGTT TAACGTCTTA CTGGCGAATG AATGGCTGGC AAGCGCACTT GACTCACAGC CAGTGGCTAC AGTCAGCGGT GAATCATGGT ATTTCTTTGG TCACTGTACC GAAGTTACCG CTTTGCCGGG CGCGCCAGCA CAATGGGCCG CGATATTTGC CCGTTTTGGC GCGAAACTGG AAAATGTCAG CGTGGGTTGC TGCGGCATGG CAGGGACTTA CGGACATGAA GCGAAAAACC ATAAAAATTC GCTCGGGATC TATGAGTTAT CCTGGCATCA GGCGATGCAG CGACTGCCGC GTAATCGCTG TCTGGCGACC GGATATTCCT GCCGTAGCCA GGTAAAACGG GTCGAAGGCA CGGGGGTACG CCATCCTGTG CAGGCTTTAC TGGAGATTAT TAAATGA
|
Protein sequence | MIPQISQAPG VVQLVLNFLQ ELEQQGFTGD TATSYADRLT MSTDNSIYQL LPDAVVFPRS TADVALIARL AAQERYSSLI FTPRGGGTGT NGQALNQGII VDMSRHMNRI IEINPAEGWV RVEAGVIKDQ LNQYLKPFGY FFAPELSTSN RATLGGMINT DASGQGSLVY GKTSDHVLGV RAVLLGGDIL DTQPLPVELA ETLGKSNTTI GRIYNTVYQR CRQQRQLIIN NFPKLNRFLT GYDLRHVFND EMTEFDLTRI LTGSEGTLAF ITEARLDITR LPKVRRLVNV KYDSFDSALR NAPFMVEARA LSVETVDSKV LNLAREDIVW HSVSELITDV PDKEMLGLNI VEFAGDDEAL IDERVNALCV RLDELIASQQ AGVIGWQVCR ELAGVERIYA MRKKAVGLLG NAKGAAKPIP FAEDTCVPPE HLADYIAEFR ALLDSHGLSY GMFGHVDAGV LHVRPALDMC DPQQEILMKQ ISDDVVALTA KYGGLLWGEH GKGFRAEYSP AFFGEELFAE LRKVKAAFDP HNRLNPGKIC PPEGLDAPMM KVDAVKRGTF DRQIPIAVRQ QWRGAMECNG NGLCFNFDAR SPMCPSMKIT QNRIHSPKGR ATLVREWLRL LADRGVDPLK LEQELPESGV SLRTLIARTR NSWHANKGEY DFSHEVKEAM SGCLACKACS TQCPIKIDVP EFRSRFLQLY HTRYLRPLRD HLVATVESYA PLMARAPKTF NFFINQPLVR KLSEKHIGMV DLPLLSVPSL QQQMVGHRSA NMTLEQLEAL NTEQKARTVL VVQDPFTSYY DAQVVADFVR LVEKLGFQPV LLPFSPNGKA QHIKGFLNRF AKTAKKTADF LNRMAKLGMP MVGVDPALVL CYRDEYKLAL GDERGEFNVL LANEWLASAL DSQPVATVSG ESWYFFGHCT EVTALPGAPA QWAAIFARFG AKLENVSVGC CGMAGTYGHE AKNHKNSLGI YELSWHQAMQ RLPRNRCLAT GYSCRSQVKR VEGTGVRHPV QALLEIIK
|
| |