Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E4635 |
Symbol | |
ID | 6273256 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 4332185 |
End bp | 4333378 |
Gene Length | 1194 bp |
Protein Length | 397 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641728404 |
Product | ShiF protein |
Protein accession | YP_001882802 |
Protein GI | 187732469 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 43 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCACCAG GGATCGAAGA TACGCCCCAA AAAACACTGT CCTGCTGGCC ACTTGCGTTC AGTGCCGGGC TTCTCGGTAT CGGACAGAAC GGTCTGCTGG TTGTGCTCCC TGTTCTGGTC ATACAGACAA ATCTGAGTCT GTCTGTATGG GCTGCCCTGC TGATGCTTGG CTCAATGCTG TTTCTGCCAT CGTCCCCGTG GTGGGGAAAG CAAATTTCCC GTACTGGCAG TAAACCTGTG GTGTTGTGGG CTCTGGGAGG ATATGGCATA AGCTTTACCC TGCTAGGGCT GGGAAGCGTG CTGATGGCTA CCAGCGCCAT AACAACAGCG GTGGGACTGG GAATATTAAT CATCGCCCGG ATCGCCTACG GGCTGACCGT GTCGGCAATG GTGCCAGCCT GTCAGGTCTG GGCATTGCAG AGAGCTGGAG AAGGGAATCG CATGGCCGCT CTGGCAACCA TCAGCTCCGG CCTGAGTTGC GGCAGGCTAT TCGGGCCGCT GTGCGCGGCA GCAATGCTGG CCATTCATCC TCTGGCGCCA CTGGGGCTAC TGATGGCAGC ACCAGTGCTG GCGCTGCTGA TGCTGCTGCG GTTGCCCGGC ACACCACCAC AGCCCACACC GGAGTGCAAG AGCGTCAGTC TGAAGCGGGA TTGTCTGCCT TATCTGCTTT GCGCAATATT ACTGGCTGCG GCGGTGAGCA TGATGCAGCT TGGACTTTCG CCTGCCCTTA CTCGCCAGTT CGTCACGGAT ACCACCGCCA TTAGCCAACA GGTGGCCTGG CTGTTGGGTC TGTCCGCAGT AGCTGCGCTT ATCGCGCAGT TCGGGGTAGT CCGTCCGCAG CGCCTGACTC CGGTGGCCCT GCTCCTGAGT GCCGGAGTGT TGATGAGTGG TGGTCTGGCT ATCATGCTCT CCGAACAGCT ATGGTTGTTT TACCCGGGCT GTGCAGTGCT GTCATTTGGA GCAGCTCTGG CAACACCCGC TTATCAACTT CTACTGAATG ATAAGCTGGC TGATGGCGCA GGCGCGGGCT GGCTCGCTAC CAGTCACACA CTTGGCTATG GACTGTGCGC TTTGCTGGTG CCGCTGGTAT CGAAAACAGG TGTCGCAATC GCGCTGATTA TGGCAGCATT ATTTGCAACT ATATTATTTA CCATTGTGTC TGTATTTATC TGGCATTACC GTACTATCAA ATAA
|
Protein sequence | MSPGIEDTPQ KTLSCWPLAF SAGLLGIGQN GLLVVLPVLV IQTNLSLSVW AALLMLGSML FLPSSPWWGK QISRTGSKPV VLWALGGYGI SFTLLGLGSV LMATSAITTA VGLGILIIAR IAYGLTVSAM VPACQVWALQ RAGEGNRMAA LATISSGLSC GRLFGPLCAA AMLAIHPLAP LGLLMAAPVL ALLMLLRLPG TPPQPTPECK SVSLKRDCLP YLLCAILLAA AVSMMQLGLS PALTRQFVTD TTAISQQVAW LLGLSAVAAL IAQFGVVRPQ RLTPVALLLS AGVLMSGGLA IMLSEQLWLF YPGCAVLSFG AALATPAYQL LLNDKLADGA GAGWLATSHT LGYGLCALLV PLVSKTGVAI ALIMAALFAT ILFTIVSVFI WHYRTIK
|
| |