Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E1478 |
Symbol | |
ID | 6273226 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 1348276 |
End bp | 1349886 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641725578 |
Product | phage protein |
Protein accession | YP_001880084 |
Protein GI | 187732421 |
COG category | [R] General function prediction only |
COG ID | [COG5301] Phage-related tail fibre protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 43 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCGCTTG AGGATGCGAG CACGACGAAA AAGGGGATAG TACAGCTCAG CAGTGCGACT AACAGCACTT CCGAGTCACT GGCGGCAACG CCAAAAGCCG TTAAGGCCGC GTATGAGCTG GCTAACGGGA AATACACCGC ACAGGATGCA ACGACAGCAC AGAAAGGGAT AGTTCAGCTT AGCAACGCGA CCAACAGCAC ATCTGAAATG CTGGCGGCAA CGCCAAAGTC GGTAAAGGCA GCCTATGACC TTGCTAACGG GAAATATACT GCTCAGGACG CTACGACAGC ACAAAAAGGA ATTGTCCAGC TCAGTAGTGC AACCAACAGC GCATCTGAAA CGCTTGCCGC GACACCGAAA GCAGTGAAAG CAGCTAATGA TAATGCGAAT GGTCGGGTAC CTTCTGCCCG TAAGGTGAAT GGTAAGGCGC TTTCAGCGGA TATAACACTG ACGCCGAAAG ATATTGGTAC GCTTAACTCA ACAACAATGT CATTCAGCGG TGGTGCTGGT TGGTTCAAAT TAGCAACGGT AACCATGCCA CAGGCGAGTT CTGTTGTTTC AATTACGTTG ATTGGTGGTG CGGGATTTAA CGTGGGGTCA CCTCAACAGG CAGGTATATC TGAACTTGTT TTGCGTGCAG GTAATGGTAA TCCGAAGGGG ATTACTGGTG CTTTATGGCA GCGCACATCG ACAGGGTTTA CAAATTTTGC CTGGGTCAAT ACATCTGGTG ATACTTACGA TATTTACGTT GCAATCGGAA ATTATGCGAC TGGTGTAAAT ATTCAATGGG ATTATACCAG TAATGCCAGC GTGACGATTC ATACGTCACC AGCATATTCT GCTAATAAGC CGGAAGGGTT AACGGACGGT ACAGTTTATT CACTCTATAC GCCATCAGAG CAGTTTTATC CTCCTGGCGC ACCAATCCCG TGGCCATCAG ATACCGTTCC GTCTGGCTAT GCCCTGATGC AGGGGCAGAC TTTTGACAAA TCTGCATACC CGAAACTTGC AGCCGCTTAT CCGTCAGGCG TGATCCCTGA TATGCGTGGC TGGACGATTA AGGGCAAACC CGCCAGTGGT CGTGCCGTAT TGTCTCAGGA ACAGGACGGC ATTAAATCGC ACACCCACAG CGCCAGCGCA TCCAGTACGG ATTTGGGGAC GAAAAACACA TCGTCGTTTG ATTACGGAAC CAAATCCACG AATAACACCG GGGCGCATAC GCACAGTCTG AGTGGCTCTA CGGGGTCTGC CGGTGATCAT ACTCATGGTA ATGGTATTCG TTGGCCAGGA GGCGGCGGTT CTGCGTTAGC ATTTTATGAT GGCGGTGGGT TCACTTATGT CCAGGATTCA CAGTATCAAG TAAGCCCGGG GACTTCTTCC CGTAGATTGT ATTATCAACG TATTCAGACA CAGTCAGCAG GTGCTCATAC CCACTCGCTG TCTGGTACTG CAGCAAGTTC TGGCGCACAT GCACATACTG TAGGTATTGG TGCGCATACG CACTCCGTTG CGATTGGTTC ACATGGACAC ACCATCACCG TTAACGCTGC TGGTAACGCG GAAAACACCG TCAAAAACAT CGCATTTAAC TATATTGTGA GGCTTGCATA A
|
Protein sequence | MALEDASTTK KGIVQLSSAT NSTSESLAAT PKAVKAAYEL ANGKYTAQDA TTAQKGIVQL SNATNSTSEM LAATPKSVKA AYDLANGKYT AQDATTAQKG IVQLSSATNS ASETLAATPK AVKAANDNAN GRVPSARKVN GKALSADITL TPKDIGTLNS TTMSFSGGAG WFKLATVTMP QASSVVSITL IGGAGFNVGS PQQAGISELV LRAGNGNPKG ITGALWQRTS TGFTNFAWVN TSGDTYDIYV AIGNYATGVN IQWDYTSNAS VTIHTSPAYS ANKPEGLTDG TVYSLYTPSE QFYPPGAPIP WPSDTVPSGY ALMQGQTFDK SAYPKLAAAY PSGVIPDMRG WTIKGKPASG RAVLSQEQDG IKSHTHSASA SSTDLGTKNT SSFDYGTKST NNTGAHTHSL SGSTGSAGDH THGNGIRWPG GGGSALAFYD GGGFTYVQDS QYQVSPGTSS RRLYYQRIQT QSAGAHTHSL SGTAASSGAH AHTVGIGAHT HSVAIGSHGH TITVNAAGNA ENTVKNIAFN YIVRLA
|
| |