Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E3996 |
Symbol | |
ID | 6272573 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 3726096 |
End bp | 3727424 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641727842 |
Product | hypothetical protein |
Protein accession | YP_001882274 |
Protein GI | 187731127 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03368] cellulose synthase operon protein YhjU |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTGGGG CCATTTTTGT TTTATTAGTG GCCTGGTTAT TCCTGTCACA ATGGATTCGC ATTACCGTTT TTGTGGTTGC CATACTGCTA TGGCTGAACG TACTTACCCT GGCGGGACCA AGTTTCTCCT TGTGGCCAGC CGGACAACCG ACGACCACTG TAACAACGAC GGGTGGTAAC GCAGCGGCAA CCGTTGCGAC GACGGGTGGC GCACCGGTAG TGGGTGATAT GCCCGCACAA ACTACACCGC CAACAACGGC GAACCTTAAC GCCTGGCTGA ATAATTTCTA TAACGCGGAG GCGAAACGTA AATCGACCTT CCCGTCTTCG CTGCCCGCTG ATGCTCAGCC ATTTGAACTA CTGGTGATTA ACATCTGTTC GCTTTCCTGG TCGGATATAG AAGCCGCCGG GTTGATGTCG CATCCACTGT GGTCGCATTT CGATATTGAG TTCAAGAACT TTAACTCCGC CACCTCCTAC AGTGGCCCGG CGGCGATCCG TTTACTGCGC GCCAGCTGCG GGCAGACTTC GCACACTAAT CTGTATCAAC CGGCAAATAA CGACTGCTAT CTGTTTGATA ACCTTTCGAA ACTGGGCTTT ACCCAGCACC TGATGATGGG ACATAACGGC CAGTTCGGCG GTTTTTTGAA AGAAGTTCGC GAAAATGGCG GCATGCAGAC TGAATTGATG GATCAAACAA ATCTGCCGGT TATTTTGCTG GGCTTTGATG GTTCGCCGGT TTATGACGAT ACCGCCGTGC TTAACCGCTG GCTGGACGTT ACCGAAAAAG ATAAAAACAG CCGTAGTGCC ACGTTCTACA ACACGCTTCC ACTGCATGAC GGCAACCATT ATCCGGGGGT CAGCAAAACA GCGGATTACA AAGCGCGGGC GCAGAAATTC TTTGATGAAC TGGACGCCTT CTTTACTGAA CTGGAGAAAT CGGGTCGTAA AGTGATGGTG GTCGTGGTGC CGGAACACGG CGGCGCGCTG AAGGGCGACA GAATGCAGGT ATCTGGCCTA CGTGATATCC CTAGCCCGTC TATCACAGAC GTCCCCGTTG GGGTGAAATT CTTCGGCATG AAGGCACCAC ATCAGGGGGC ACCGATTGTC ATCGACCAAC CGAGCAGCTT CCTGGCTATC TCCGATCTGG TGGTTCGCGT TCTTGATGGC AAGATTTTCA CCGAAGACAA TGTTGACTGG AAAAAACTCA CCAGTGGGTT GCCACAAACA GCACCGGTCT CCGAGAACTC AAATGCAGTA GTTATTCAAT ACCAGGATAA ACCGTACGTT CGCCTGAACG GCGGCGACTG GGTGCCTTAC CCGCAGTAA
|
Protein sequence | MIGAIFVLLV AWLFLSQWIR ITVFVVAILL WLNVLTLAGP SFSLWPAGQP TTTVTTTGGN AAATVATTGG APVVGDMPAQ TTPPTTANLN AWLNNFYNAE AKRKSTFPSS LPADAQPFEL LVINICSLSW SDIEAAGLMS HPLWSHFDIE FKNFNSATSY SGPAAIRLLR ASCGQTSHTN LYQPANNDCY LFDNLSKLGF TQHLMMGHNG QFGGFLKEVR ENGGMQTELM DQTNLPVILL GFDGSPVYDD TAVLNRWLDV TEKDKNSRSA TFYNTLPLHD GNHYPGVSKT ADYKARAQKF FDELDAFFTE LEKSGRKVMV VVVPEHGGAL KGDRMQVSGL RDIPSPSITD VPVGVKFFGM KAPHQGAPIV IDQPSSFLAI SDLVVRVLDG KIFTEDNVDW KKLTSGLPQT APVSENSNAV VIQYQDKPYV RLNGGDWVPY PQ
|
| |