Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2173 |
Symbol | |
ID | 6142636 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2179848 |
End bp | 2180957 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641617048 |
Product | MOSC domain-containing protein |
Protein accession | YP_001744222 |
Protein GI | 170682018 |
COG category | [C] Energy production and conversion [R] General function prediction only |
COG ID | [COG0633] Ferredoxin [COG3217] Uncharacterized Fe-S protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000104481 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.521589 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCGACAT TAACCCGGCT TTTTATTCAT CCTGTTAAAT CGATGCGCGG CATTGGTCTT ACACATACCC TGGCAGATGT CAGTGGTCTG GCCTTCGATC GCATCTTTAT GATCACGGAA CCTGACGGTA CGTTTATTAC CGCTCGCCAG TTTCCGCTAA TGGTACGGTT TACTCCTTCA CCCGTGCATG ATGGCTTGCA TCTCACCGCA CCAGATGGCA GTAGTGCATA TGTTCGTTTT GCTGATTTCG CCACACAAGA CGCACCAACC GAAGTTTGGG GCACACATTT TACCGCGCGA ATTGCGCCAG ACGCGATCAA CAAATGGCTA AGTGGATTTT TCTCCCGCGA AGTGCAATTA CGCTGGGTGG GGCCACAAAT GACCCGGCGC GTGAAACGCC ACAACACTGT ACCCCTGTCA TTTGCTGATG GTTATCCTTA CCTTCTTGCT AACGAGGCCT CGTTACGTGA TCTCCAACAA CGTTGTCCGG CCAGTGTAAA AATGGAGCAA TTCCGCCCCA ATCTGGTGGT TTCCGGTGCA TCGGCCTGGG AAGAAGATAG CTGGAAAGTG ATTCGCATTG GTGATGTCGT GTTTGATGTG GTTAAGCCTT GTAGCCGCTG TATTTTCACC ACCGTCAGCC CAGAAAAAGG GCAAAAACAT CCGGCAGGCG AACCATTAAA AACGCTGCAA TCTTTCCGTA CTGCCCAGGA TAACGGCGAT GTCGATTTTG GTCAGAATTT AATTGCCCGT AATAGCGGCG TGATTCGCGT TGGCGATGAG GTGGAAATTC TGGCAACGGC TCCAGCTAAA ATTTACGGCG CAGGTGCCGC CGATGATACT GCCAACATCA CGCAACAACC GGACGCAAAC GTAGATATTG ACTGGCAGGG ACAGGCATTT CGTGGAAATA ACCAACAGGT GCTGCTGGAG CAATTAGAAA ATCAGGGAAT TCGTATCCCT TATTCTTGCC GCGCGGGCAT TTGTGGAAGT TGCCGTGTTC AGCTTTTAGA AGGCGAAGTC ACGCCGCTGA AAAAATCAGC AATGGGTGAT GATGGCACTA TTCTTTGCTG TAGTTGTGTA CCGAAGACTG CACTTAAGCT GACGCGTTAA
|
Protein sequence | MATLTRLFIH PVKSMRGIGL THTLADVSGL AFDRIFMITE PDGTFITARQ FPLMVRFTPS PVHDGLHLTA PDGSSAYVRF ADFATQDAPT EVWGTHFTAR IAPDAINKWL SGFFSREVQL RWVGPQMTRR VKRHNTVPLS FADGYPYLLA NEASLRDLQQ RCPASVKMEQ FRPNLVVSGA SAWEEDSWKV IRIGDVVFDV VKPCSRCIFT TVSPEKGQKH PAGEPLKTLQ SFRTAQDNGD VDFGQNLIAR NSGVIRVGDE VEILATAPAK IYGAGAADDT ANITQQPDAN VDIDWQGQAF RGNNQQVLLE QLENQGIRIP YSCRAGICGS CRVQLLEGEV TPLKKSAMGD DGTILCCSCV PKTALKLTR
|
| |