Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E1014 |
Symbol | |
ID | 6268463 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 936922 |
End bp | 937980 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641725158 |
Product | putative prophage tail fiber protein |
Protein accession | YP_001879680 |
Protein GI | 187730611 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGTAG TGATATCAGG TGCGCTGACT GATGGCGCAG GCATCCCCAT GTCCGGATGC CAGATTATTC TGAAATCCCG TGTAAACACC TCAGAAGTGG TGATGCGTAC CAGGGCTGAT GTAGTGACCG GAAATAACGG TGAATATTCG TTTGAGGCAC AGGTCGGAAA ATATTGCGTG TATCTGAAAC GGGACTGGCG CGACGAGTAC TGTGTTGGCG ACATTGCTGT ATACGACGAC TCAAAGCCCG GCACTCTGAA AACGGCTAAC CATCTGTCTG AAATCGCAGC AGCAGGCGAA AAGGCACAAC AGAAGTCCCG GGATAATCTG GGGCTGAAAA GTGCGGCCAC GATGGAAGCA CAGAGCGACA TTTACGACCG GACAAAAGGC CGTCTGGCGA TACCCGGCGC ATTCGGCTTT GGGTGTGCTT TTCTGCCTGA AGATGTTATC CGTTTTGACA CTAAGAGTGA TTTCCAGGCC TGGGTAAGGA ATGCGCTGCC AGGTGAATAT TCCGTTGCTG GCCCCTACGA CATCATCATA CCCGACACAC GGTTTGAAGG GGTGCTCAGC ATCCGGTGGA CTGATGCACG CCCTGAGACA ACAGAACCGC GGTACAGAGC CAAATCCCTT ACTTTTTACG GCATTAACGG CCCCATTTAT CACACCCGCT ACTGCTACTG GCCCATATCC AGACTGACTG ACTGGGTGAA AATAAATATA ACCACAGAAG ATATTATTTA CAGAATCGTG GCGAGCTCTG TCCGCAACAG ATGGGGAGAC CCTGACATTG GCGGGCTGAT TATTGCTGCG TACCAGGGAG AAGCTGACGG TGATAAAGTC ATCAGACTTG TCAGGGGGCA GTCATACAGA GGCTCACGAC TGGGACCGGT GGGGATTTCA GTGCCCAGTA CTCCCACCGG AACGTATATA GCATCCCCAC AATTTTTCAT TACGGGATGT TCAGAGCATT CATTACCGGG GTCATATTGC GCCCTGTCCG GGGTGCCGGA TGCTCATGTC TCTGGCGCAA TGCCCGGGCT TTTTATTCGC ACATCGTGA
|
Protein sequence | MSVVISGALT DGAGIPMSGC QIILKSRVNT SEVVMRTRAD VVTGNNGEYS FEAQVGKYCV YLKRDWRDEY CVGDIAVYDD SKPGTLKTAN HLSEIAAAGE KAQQKSRDNL GLKSAATMEA QSDIYDRTKG RLAIPGAFGF GCAFLPEDVI RFDTKSDFQA WVRNALPGEY SVAGPYDIII PDTRFEGVLS IRWTDARPET TEPRYRAKSL TFYGINGPIY HTRYCYWPIS RLTDWVKINI TTEDIIYRIV ASSVRNRWGD PDIGGLIIAA YQGEADGDKV IRLVRGQSYR GSRLGPVGIS VPSTPTGTYI ASPQFFITGC SEHSLPGSYC ALSGVPDAHV SGAMPGLFIR TS
|
| |