Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E0110 |
Symbol | |
ID | 6273265 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 121033 |
End bp | 122886 |
Gene Length | 1854 bp |
Protein Length | 617 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641724366 |
Product | hypothetical protein |
Protein accession | YP_001878925 |
Protein GI | 187731189 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0222206 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATGA CTTTGCCGTT TAAACCCCAT GTGCTGGCAC TAATTTGCAG TGCCGGGCTT TGTGCCGCCT CTGCCGGGCT ATATATAAAA AGCCGCACAG TGGAAGCGGC TGTGGAAACG CAATCGACAC AACTGGCTGT GTCTGACGCT GCCGCGGTTA CGCTTCCTGC AACGGTTTCC GCGCCTCCCG TAACACCCGC CGTCGTTAAA TCCGCATTCA GCACTGCACA AATAGATCAA TGGGTCGCGC CCGTCGCGCT GTATCCCGAC GCCCTACTTT CGCAGGTGCT GATGGCATCA ACCTATCCGG CAAACGTTGC TCAAGCAGTG CAATGGTCGC ACGATAATCC ACTTAAACAA GGCGATGCTG CTATTCAGGC GGTATCTGAC CAGCCGTGGG ACGCCAGCGT TAAATCACTG GTGGCCTTTC CACAATTGAT GGCATTGATG GGCGAAAACC CGCAATGGGT GCAAAACCTG GGCGATGCTT TTCTGGCGCA GCCGCAGGAC GTGATGGACT CGGTACAGCG ATTGCGGCAA CTGGCGCAAC AAACCGGTTC GCTGAAGTCA TCAAACGAAC AAAAAGTTAT TACCACAACG AAGAAAGCTG TACCGGTAAA ACAGACAGTC ACGGCACCCG TCATACCATC CAATACCGTT TTAACTGCCA GCCCCGTCAT TACAGAGCCT GCAACAACCG TCATTTCCAT TGAGCCCGCC AATCCTGATG TGGTCTATAT TCCCAACTAC AACCCAACCG TGGTTTACGG GAACTGGGCC AATACTGCGT ATCCGCCGGT TTATCTGCCA CCACCAGCCG GAGAACCGTT TATTGACAGC TTTGTGCGCG GATTCGGCTA TAGCATGGGT GTTGCTACCA CGTACGCACT ATTCAGCAGC ATCGACTGGG ATGACGACGA TCATGACCAT CATCATCATG ACGATGATGA TTATCATCAC CACGATGGCG GTCATCGTGA CGGTAATGGC TGGCAACACA ACGGCGACAA CATCAATATC GACGTCAACA ATTTCAACCG TATCACCGGT GAGCATCTTA CTGATAAGAA TATGGCATGG CGGCACAATC CAAACTACCG TAATGGTGTG CCCTATCATG ATCAGGATAT GGCAAAGCGG TTTCATCAAA CCGATGTCAA CGGCGGAATG AGTGCCACGC AGCTACCTGC TCCAACACGC GACAGCCAGC GTCAGGCGGC AGCAAGTCAG TTTCAGCAAC GAACACACGC CGCCCCCGTC ATTACACGAG ATACCCAACG TCAGGCAGCG GCACAGCGGT TTAATGAAGC TGAACACTAT GGGAGCTATG ACGACTTCCG CGACTTCAGC CGTCGCCAAC CCCTGACCCA GCAACAAAAG GACGCCGCTC GTCAGCGTTA TCAGTCAGCT TCTCCTGAGC AGCGCCAGGC AGTTCGCGAG AGAATGCAGA CTAACCCGCA GATCCAGCAG CGAAGAGAGG CAGCGCGTGA GCGCATTCAG CCCGCCTCGC CTGAGCAGCG CCAGGCAGTC CGCGAGAAAA TGCAGACTAA CCCACAGATC CAGCAGCGAA GAGACGCAGC GCGTGAGCGT ATTCAGTCAG CCTCGCCTGA GCAGCGCCAG GTGTTTAAGG AAAAAGTACA GCAGCGCCCA CTGAACCAAC AGCAACGTGA TAACGCCCGC CAGCGTGTTC AATCAGCATC ACCTGAACAA CGTCAGGTTT TTCGGGAGAG AGTTCAGGAG AGCCGCCCAC AACGTCTAAA CGACAGTAAC CGTACTGCCA GATTGAATAA CGATCAACGG TCAGCAGTAC GCGAACGTCT CTCTGAGCGC GGAGCAAGGC GACTGGAAAG GTAA
|
Protein sequence | MKMTLPFKPH VLALICSAGL CAASAGLYIK SRTVEAAVET QSTQLAVSDA AAVTLPATVS APPVTPAVVK SAFSTAQIDQ WVAPVALYPD ALLSQVLMAS TYPANVAQAV QWSHDNPLKQ GDAAIQAVSD QPWDASVKSL VAFPQLMALM GENPQWVQNL GDAFLAQPQD VMDSVQRLRQ LAQQTGSLKS SNEQKVITTT KKAVPVKQTV TAPVIPSNTV LTASPVITEP ATTVISIEPA NPDVVYIPNY NPTVVYGNWA NTAYPPVYLP PPAGEPFIDS FVRGFGYSMG VATTYALFSS IDWDDDDHDH HHHDDDDYHH HDGGHRDGNG WQHNGDNINI DVNNFNRITG EHLTDKNMAW RHNPNYRNGV PYHDQDMAKR FHQTDVNGGM SATQLPAPTR DSQRQAAASQ FQQRTHAAPV ITRDTQRQAA AQRFNEAEHY GSYDDFRDFS RRQPLTQQQK DAARQRYQSA SPEQRQAVRE RMQTNPQIQQ RREAARERIQ PASPEQRQAV REKMQTNPQI QQRRDAARER IQSASPEQRQ VFKEKVQQRP LNQQQRDNAR QRVQSASPEQ RQVFRERVQE SRPQRLNDSN RTARLNNDQR SAVRERLSER GARRLER
|
| |