Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E1454 |
Symbol | |
ID | 6269814 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 1327230 |
End bp | 1329158 |
Gene Length | 1929 bp |
Protein Length | 642 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641725555 |
Product | phage terminase large subunit (GpA) |
Protein accession | YP_001880061 |
Protein GI | 187730353 |
COG category | [R] General function prediction only |
COG ID | [COG5525] Bacteriophage tail assembly protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTTCAG ACGCACAGAA GGCAGCTAAT GCAGCCGGTG CGATAGCTAC AGGGCTTTTA TCTCTCATTA TTCCTGTTCC ACTGACGACA GTTCAGTGGG CCAATAAACA TTATTACCTT CCTAAAGAGT CGTCTTATAC CCCGGGGCGG TGGGAAACAC TGCCGTTTCA GGTTGGCATC ATGAACTGTA TGGGCAACGA TTTGATTCGC ACTGTTAACC TGATTAAATC TGCCCGTGTT GGTTATACAA AGATGTTGCT GGGAGTGGAG GCTTATTTTA TTGAGCATAA ATCACGCAAC AGCCTTCTTT TTCAGCCCAC GGACTCAGCT GCTGAAGATT TTATGAAATC TCATGTTGAG CCAACGATAA GGGATGTTCC TGCATTGCTG GAGCTGGCTC CATGGTTCGG AAGAAAACAC CGCGATAATA CGCTCACCCT GAAGCGTTTT TCCTCCGGTG TGGGTTTCTG GTGTCTGGGG GGAGCGGCAG CAAAAAACTA CCGTGAAAAA TCCGTGGATG TGGTTTGTTA TGACGAGCTT TCCTCGTTCG AACCGGATGT TGAAAAAGAG GGTTCGCCAA CCCTGCTGGG GGATAAACGT ATTGAGGGCT CTGTATGGCC AAAATCCATT CGCGGCTCGA CGCCTAAAAT CAAAGGCTCC TGCCAGATCG AAAAAGCCGC TAACGAGTCG GCACATTTCA TGCGTTTTTA TGTGCCCTGT CCGCACTGTG GGGAGGAGCA GTATCTGAAA TTTGGCGATG ATGCCTCGCC TTTCGGTCTT AAGTGGGAGA AGAATAAGCC AGAAAGTGTT TTCTACCTTT GTGAGCATCA TGGCTGTGTG ATCCATCAGT CTGAGCTTGA CCAGAGTAAC GGGCGGTGGA TCTGTGAAAA CACGGGCATG TGGACTCGTG ACGGTCTGAC ATTTTTCAGC GCCGCGGATA ATGAAATTCC GCCGCCGCGC TCCATCACAT TCCATATCTG GACGGCGTAC AGTCCGTTCA CCACCTGGGT ACAGATTGTC TATGACTGGC TGGATGCACT GAAAGATCCC AACGGCCTGA AAACCTTTGT GAACACCACG CTGGGCGAGA CCTGGGAAGA GGCCGTGGGC GAAAAACTCG ATCACCAGGT GCTGATGGAT AAGGTTGTGC GTTACACGGC TGCGGTGCCT TCCCGGGTGG TTTATCTGAC GGCGGGCATT GACTCGCAGC GAAACCGTTT TGAGATGTAT GTCTGGGGAT GGGCTCCGGG AGAGGAAGCC TTTCTGGTGG ATAAAATCAT CATTATGGGG CGTCCCGATG AGGAAGAGAC GCTGTTACGT GTGGATGTGG CGATCAACAA AAAATACCGC CATGCAGACG GAACCGAAAT GACCATTTCC CGTGTCTGCT GGGACACCGG GGGGATCGAT GGCGAAATTG TCTATCAGAG GTCAAAAAAA CACGGTGTTT TCCGGGTGCT GCCGGTAAAA GGTGCATCTG TTTATGGCAA GCCGGTGATC ACCATGCCAA AAACCCGCAA TCAGCGGGGC GTGTATCTGT GCGAAGTGGG GACGGACACC GCAAAAGAAA TTCTCTATGC CCGTATGAAA GCCGATCCCA CGCCTGCGGA TGAAGCCACG TCGTATGCCA TCCGTTTTCC TGATGATCCG GAGATTTTTT CGCAGACAGA GGCGCAGCAA CTGGTGGCGG AAGAGCTTGT GGAGAAGTGG GAAAAAGGAA AGATGCGTCT GCTGTGGGAT AACAAAAAGC GGCGTAACGA AGCGCTGGAC TGCCTGGTGT ATGCCTACGC GGCATTACGT GTGTCCGTGC AACGCTGGCA GCTTGATCTG GCTGTACTGG CAAAATCCCG GGAAGAAGAG ACGACCCGGC CAACCCTTAA AGAACTGGCA GCGAAGCTGT CCGGAGGAGT GAATGGTTAC AGTCGCTGA
|
Protein sequence | MISDAQKAAN AAGAIATGLL SLIIPVPLTT VQWANKHYYL PKESSYTPGR WETLPFQVGI MNCMGNDLIR TVNLIKSARV GYTKMLLGVE AYFIEHKSRN SLLFQPTDSA AEDFMKSHVE PTIRDVPALL ELAPWFGRKH RDNTLTLKRF SSGVGFWCLG GAAAKNYREK SVDVVCYDEL SSFEPDVEKE GSPTLLGDKR IEGSVWPKSI RGSTPKIKGS CQIEKAANES AHFMRFYVPC PHCGEEQYLK FGDDASPFGL KWEKNKPESV FYLCEHHGCV IHQSELDQSN GRWICENTGM WTRDGLTFFS AADNEIPPPR SITFHIWTAY SPFTTWVQIV YDWLDALKDP NGLKTFVNTT LGETWEEAVG EKLDHQVLMD KVVRYTAAVP SRVVYLTAGI DSQRNRFEMY VWGWAPGEEA FLVDKIIIMG RPDEEETLLR VDVAINKKYR HADGTEMTIS RVCWDTGGID GEIVYQRSKK HGVFRVLPVK GASVYGKPVI TMPKTRNQRG VYLCEVGTDT AKEILYARMK ADPTPADEAT SYAIRFPDDP EIFSQTEAQQ LVAEELVEKW EKGKMRLLWD NKKRRNEALD CLVYAYAALR VSVQRWQLDL AVLAKSREEE TTRPTLKELA AKLSGGVNGY SR
|
| |