Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E1286 |
Symbol | |
ID | 6273229 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 1175200 |
End bp | 1177128 |
Gene Length | 1929 bp |
Protein Length | 642 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641725407 |
Product | phage terminase large subunit (GpA) |
Protein accession | YP_001879918 |
Protein GI | 187731821 |
COG category | [R] General function prediction only |
COG ID | [COG5525] Bacteriophage tail assembly protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATATAT CAGAGCAACA ACTGAATAAT ATGATGAGTG CTGTCACAAC AGCATTACAG CCCCTGATAA GGGCATTGCC GGTGACGCCA GTTGAATGGG CTGATCAAAA TTATTATCTG CCTAAAGAAT CTTCATATGG TGAGGGAGAA TGGAAAACGC TGCCGTTCCA GGTCGCCATT ATGAACTGTA TGGGTAACGA CCAGGTTCGC ACGGTTAACC TGATTAAATC TGCCCGTGTT GGCTATACAA AGATGTTGCT GGGGGTGGTC GGGTATTTTA TTGAGCATAA ATCACGAAAC AGTCTGCTTT TTCAGCCCAC GGATTCTGCC GCTGAAGATT TTATGAAGTC TCACGTGGAG GCGACGATTC GGGATGTGCC ATGCCTGAAA GACCTTTCCC CATGGCTGGG TCGTAAACAT CGTGATAATA CCCTCACGCT GAAACGTTTT TCATCGGGCG TGGGCTTCTG GTGCCTGGGC GGGGCTGCCG CTAAAAACTA CCGTGAAAAA TCCGTGGACG TGGTCTGCTA TGACGAACTT TCCTCGTTCG AACCGGATGT CGAAAAAGAG GGTTCGCCAA CCCTGCTGGG GGATAAACGT ATTGAGGGCT CTGTATGGCC CAAATCCATT CGCGGCTCGA CGCCTAAAAT CAAAGGCACC TGCCAGATCG AAAAAGCGGC CAACGAGTCG GCGCATTTTA TGCGTTTTTA TGTGCCCTGC CCACACTGTG GGGAGGAGCA GTATCTGAAA TTTGGCGATG AGTCCACGCC TTTTGGGCTT AAATGGGAGA AGGACAGTCC CGAAAGTGTT TTCTACCTCT GTGAACATCA TGGCTGCGTG ATCCATCAGT CTGAACTGGA CCAGAGCAAC GGGCGGTGGA TCTGTGAAAA CACGGGCATG TGGACCCGTG ACGGTCTGAC GTTTTTCAGC GCTGCGGGTA ATGAAATTCC GCCGCCGCGC TCCATCACGT TCCATATCTG GACGGCGTAC AGTCCGTTTA CCACCTGGGT ACAGATAGTC TATGACTGGC TGGATGCACT GAAAGATCCC AACGGCCTGA AAACCTTTGT GAACACCACG CTGGGCGAGA CCTGGGAAGA AGCCGTGGGC GAAAAACTCG ATCACCAGGT ACTGATGGAT AAGGTCGTGC ATTACACGGC GGCGGTACCT GCCCGGGTGG TTTATCTGAC GGCGGGCATT GACTCGCAGC GAAACCGTTT TGAGATGTAT GTCTGGGGAT GGGCACCGGG AGAGGAAGCT TTTCTGGTGG ATAAAATCAT CATTATGGGC CGTCCCGATG AGGAAGAGAC GCTGTTACGT GTGGATGCGG CGATCAACAA AAAATACTGC CATGCAGACG GAACCGAAAT GACCATTTCC CGTGTCTGCT GGGACACCGG GGGGATCGAT GGTGAAATTG TCTATCAGAG GTCAAAAAAA CACGGTGTTT TCCGGGTGCT GCCGGTAAAA GGCGCATCTG TCTATGGCAA GCCGGTGATC ACCATGCCGA AAACCCGCAA TCAGCGGGGC GTGTATCTGT GTGAAGTGGG GACGGACACC GCAAAAGAAA TTCTCTATGC CCGTATGAAA GCCGATCCCA CGCCTGCGGA TGAAGCCACG TCGTATGCCA TCCGTTTTCC TGATGATCCG GAGATTTTTT CGCAGACAGA GGCGCAGCAA CTGGTCGCGG AAGAGCTTGT GGAGAAGTGG GAAAAAGGAA AGATGCGTCT GCTGTGGGAT AACAAAAAGC GGCGTAACGA AGCGCTGGAC TGCCTGGTGT ATGCCTACGC GGCATTACGT GTGTCCGTGC AACGCTGGCA GCTTGATCTG GCTGTACTGG CAAAATCCCG GGAAGAAGAG ACGACCCGGC CAACCCTTAA AGAACTGGCA GCGAAGCTGT CCGGAGGAGT GAATGGTTAC AGTCGCTGA
|
Protein sequence | MNISEQQLNN MMSAVTTALQ PLIRALPVTP VEWADQNYYL PKESSYGEGE WKTLPFQVAI MNCMGNDQVR TVNLIKSARV GYTKMLLGVV GYFIEHKSRN SLLFQPTDSA AEDFMKSHVE ATIRDVPCLK DLSPWLGRKH RDNTLTLKRF SSGVGFWCLG GAAAKNYREK SVDVVCYDEL SSFEPDVEKE GSPTLLGDKR IEGSVWPKSI RGSTPKIKGT CQIEKAANES AHFMRFYVPC PHCGEEQYLK FGDESTPFGL KWEKDSPESV FYLCEHHGCV IHQSELDQSN GRWICENTGM WTRDGLTFFS AAGNEIPPPR SITFHIWTAY SPFTTWVQIV YDWLDALKDP NGLKTFVNTT LGETWEEAVG EKLDHQVLMD KVVHYTAAVP ARVVYLTAGI DSQRNRFEMY VWGWAPGEEA FLVDKIIIMG RPDEEETLLR VDAAINKKYC HADGTEMTIS RVCWDTGGID GEIVYQRSKK HGVFRVLPVK GASVYGKPVI TMPKTRNQRG VYLCEVGTDT AKEILYARMK ADPTPADEAT SYAIRFPDDP EIFSQTEAQQ LVAEELVEKW EKGKMRLLWD NKKRRNEALD CLVYAYAALR VSVQRWQLDL AVLAKSREEE TTRPTLKELA AKLSGGVNGY SR
|
| |