Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0843 |
Symbol | ybiS |
ID | 6142612 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 846503 |
End bp | 847423 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641615731 |
Product | hypothetical protein |
Protein accession | YP_001742923 |
Protein GI | 170683699 |
COG category | [S] Function unknown |
COG ID | [COG1376] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATATGA AATTGAAAAC ATTATTCGCA GCGGCCTTCG CTGTTGTCGG CTTTTGCAGT ACCGCCTCTG CGGTAACTTA TCCTCTGCCA ACCGACGGGA GTCGCCTGGT TGGTCAGAAT CAGGTGATCA CCATTCCTGA AGGTAACACT CAGCCGCTGG AGTATTTTGC CGCCGAGTAC CAGATGGGGC TTTCCAATAT GATGGAAGCG AACCCGGGTG TGGATACCTT CCTGCCGAAA GGCGGTACTG TACTGAACAT TCCGCAGCAG CTGATCCTGC CGGATACCGT TCATGAAGGC ATCGTCATTA ACAGTGCAGA GATGCGTCTG TATTACTATC CGAAAGGGAC CAACACCGTT ATCGTGCTGC CGATCGGCAT TGGTCAGTTA GGCAAAGATA CGCCTATCAA CTGGACCACC AAAGTTGAGC GTAAGAAAGC AGGCCCGACC TGGACGCCGA CCGCCAAAAT GCACGCAGAG TACCGCGCTG CGGGCGAACC GCTTCCGGCT GTCGTTCCGG CAGGTCCGGA TAACCCGATG GGGCTGTATG CACTCTACAT CGGTCGCCTG TACGCTATCC ATGGCACCAA CGCCAACTTC GGTATCGGCC TGCGTGTAAG TCATGGTTGT GTGCGTCTGC GTAACGAAGA CATAAAATTC CTGTTCGAGA AAGTACCGGT CGGTACCCGC GTACAGTTTA TTGATGAGCC GGTAAAAGCG ACCACCGAGC CAGACGGCAG CCGTTATATT GAAGTCCATA ACCCGCTGTC TACCACCGAA GCCCAGTTTG AAGGTCAGGA AATTGTGCCA ATTACCCTGA CCAAGAGCGT GCAGACAGTA ACTGGTCAGC CTGATGTGGA TCAGGTCGTG CTGGATGAAG CGATCAAAAA TCGTTCCGGG ATGCCGGTTC GTCTGAATTA A
|
Protein sequence | MNMKLKTLFA AAFAVVGFCS TASAVTYPLP TDGSRLVGQN QVITIPEGNT QPLEYFAAEY QMGLSNMMEA NPGVDTFLPK GGTVLNIPQQ LILPDTVHEG IVINSAEMRL YYYPKGTNTV IVLPIGIGQL GKDTPINWTT KVERKKAGPT WTPTAKMHAE YRAAGEPLPA VVPAGPDNPM GLYALYIGRL YAIHGTNANF GIGLRVSHGC VRLRNEDIKF LFEKVPVGTR VQFIDEPVKA TTEPDGSRYI EVHNPLSTTE AQFEGQEIVP ITLTKSVQTV TGQPDVDQVV LDEAIKNRSG MPVRLN
|
| |