Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1496 |
Symbol | |
ID | 6142960 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1480177 |
End bp | 1481466 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641616374 |
Product | hypothetical protein |
Protein accession | YP_001743554 |
Protein GI | 170684211 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.403861 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGGATG ACAAATTTGA TGCCATTGTG GTCGGTGCGG GCGTTGCCGG TAGCGTTGCC GCACTGGTCA TGGCGCAAGC CGGGCTGGAT GTCCTGGTGA TAGAACGCGG CGACAGTGCC GGATGTAAAA ACATGACCGG CGGGCGTCTT TATGCCCACA CACTTGAAGC GATTATTCCA GGCTTTGCAG CATCAGCGCC GGTAGAACGC AAGGTCACAC GCGAGAAAAT CTCCTTCTTA ACCGATGAGA GTGCCGTTAC CCTCGATTTT CACCGCGAGC AACCAGATGT TCCGCAACAC GCGTCTTATA CCGTATTGCG TAATCGTCTG GACCCGTGGT TGATGGAACA AGCCGAACAG GCTGGCGCGC AGTTTATCCC GGGCGTTCGT GTCGATGCGC TGGTTCGTGA AGGAAATAAG GTCACTGGCG TCCAGGCCGG GGATGATATT CTCGAAGCGA ATGTGGTGAT TCTGGCTGAT GGCGTTAACT CGATGCTTGG CCGCTCGCTG GGAATGGTTC CCGCTTCCGA TCCGCATCAT TACGCTGTTG GTGTTAAAGA GGTTATTGGC CTCACACCAG AACAGATCAA GGATCGCTTT AATATTACGG GCGAGGAAGG TGCCGCCTGG CTGTTTGCCG GTTCCCCTTC TGACGGCCTG ATGGGCGGCG GATTCCTCTA TACCAATAAG GATTCCATAT CGTTGGGGCT GGTTTGTGGA TTGGGTGATA TCGCCCATGC GCAAAAAAGC GTGCCGCAAA TGCTGGAAGA TTTTAAACAA CACCCCGCCA TTCACCCGCT GATTAGCGGC GGCAAACTGC TTGAATATTC CGCGCATATG GTGCCGGAGG GCGGTCTGGC AATGGTGCCG CAGCTGGTTA ACGATGGCGT GATGATCGTT GGTGACGCCG CAGGCTTCTG CCTGAATTTG GGTTTTACAG TTCGCGGCAT GGATTTAGCC ATTGCATCAG CTCAGGCTGC CGCCACAACA GTGATTGCAG CCAAAGAACG CGCGGATTTC TCCGCCAGCA GTCTGGCGCA ATACAAACGT GAGCTGGAAC AAAGCTGCGT CATGCGTGAT ATGCAGCATT TTCGCAAGAT CCCGGCGCTG ATGGAAAATC CGCGCCTGTT TAGTCAGTAT CCGCGCATGG TCGCCGACAT CATGAACGAG ATGTTCACCA TTGACGGCAA ACCAAACCAG CCGGTACGCA AAATGATCAT GGGACACGCG AAGAAAATTG GGCTGATCAA CTTGCTGAAA GATGGCATTA AGGGAGCAAC CGCGCTATGA
|
Protein sequence | MSDDKFDAIV VGAGVAGSVA ALVMAQAGLD VLVIERGDSA GCKNMTGGRL YAHTLEAIIP GFAASAPVER KVTREKISFL TDESAVTLDF HREQPDVPQH ASYTVLRNRL DPWLMEQAEQ AGAQFIPGVR VDALVREGNK VTGVQAGDDI LEANVVILAD GVNSMLGRSL GMVPASDPHH YAVGVKEVIG LTPEQIKDRF NITGEEGAAW LFAGSPSDGL MGGGFLYTNK DSISLGLVCG LGDIAHAQKS VPQMLEDFKQ HPAIHPLISG GKLLEYSAHM VPEGGLAMVP QLVNDGVMIV GDAAGFCLNL GFTVRGMDLA IASAQAAATT VIAAKERADF SASSLAQYKR ELEQSCVMRD MQHFRKIPAL MENPRLFSQY PRMVADIMNE MFTIDGKPNQ PVRKMIMGHA KKIGLINLLK DGIKGATAL
|
| |