Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E1467 |
Symbol | |
ID | 6268753 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 1337250 |
End bp | 1339823 |
Gene Length | 2574 bp |
Protein Length | 857 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 641725568 |
Product | tail length tape measure protein |
Protein accession | YP_001880074 |
Protein GI | 187732238 |
COG category | [S] Function unknown |
COG ID | [COG5281] Phage-related minor tail protein |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type [TIGR01541] phage tail tape measure protein, lambda family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 45 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCAGC CGGTTGGTGA TCTTGTTATT GACCTGAGTC TGGATGCTGT CCGTTTCGAT GAGCAGATGA GCCGGGTAAG GCGTCATTTT TCAGGTCTGG ATACCGACGC CAGAAAAACC GCCAGTGCTG TTGAACAGGG CCTGAGCCGC CAGGCGCTGG CTGCACAAAA AGCCGGGATT TCCGTCGGGC AGTATAAAGC GGCCATGCGA ACCCTGCCCG CACAGTTTAC GGATATCGCC ACGCAGCTTG CCGGTGGTCA GAATCCCTGG TTGATCCTGC TGCAACAGGG CGGTCAGGTG AAGGACTCCT TCGGCGGGAT GATCCCCATG TTCAGGGGGC TTGCCGGTGC GATCACCCTG CCGATGGTCG GGGTCACCTC GCTGGCGGTG GCGACAGGTG CGCTGGCGTA CGCCTGGTAC CAGGGCGACG CCACGCTTTC AGAATTTAAT AAAACGCTGG TCCTTTCCGG CAATCAGGCC GGACTGACTG CCGATCGTAT GCTGACGCTC TCAAGAGCCG GGCAGGCAGC AGGGCTGACG TTTAACCAGG CGAGAGAGTC ACTGGCAGCC CTGGTGAATG CCGGTGTGCG TGGTGGTGAA CAGTTTGATG CCATCAACCA GAGTGTCGCG CGTTTTGCGT CTGCATCCGG TGTGGAGGTG GATAAAGTCG CTGAAGCCTT CGGGAAGCTG ACCACTGACC CGACGTCGGG ACTGATGGCG ATGGCGCGCC AGTTCCGTAA CGTGACGGCA GAGCAGATTG CGTATGTTGC ACAGCTGCAG CGTTCCGGAG ACGAGGCCGG GGCATTGCAG GCGGCGAACG ATATCGCCAC GAAAGGCTTT GATGAGCAGA CCCGTCGCCT GAAAGAAAAC ATGGGAACAC TGGAGACCTG GGCGGATAAA ACAGGGAAGG CATTCAAATC GATGTGGGAT GCCATTCTGG ATATCGGTCG TCCGGAATCC TCAGCGGATA TGCTCGCCAG TGCGCAGAAG GCATTTGATG AGGCGGATAA AAAATGGCAG TGGTACCAGA GCCGGAGCCA GCGCCGGGGA AAGACCTCCT CTTTTCGTGC GAACCTTCAG GGGGCATGGG ATGACCGGGA AAATGCCCGT CTGGGTCTGG CAGCGGCCAC GCTGCAGTCG GATATGGAAA AAGCCGGTGA ACTGGCGGCA AGGGACAGGG CTGAGCGTGA GTCGTCACAG CTGAAGTATA CCGGAGAGGC GCAGAAGGCG TATGAGCGCC TGCAGACGCC GCTGGATAAA TATACCGCCC GTCAGAAAGA GCTGAATAAG GCCCTGAAAG ACGGAAAAAT CCTGCAGGCG GATTACAACA CGCTGATGGC GTCGGCAAAA AAGGATTATG AATCGACGCT GAAAAAGCCG TCAGGTGTGA AGGTGTCTGC CGGTGAGCGC CAGGAAGACC GGGCGCATGC AGCCCTGCTG GCGCTTGAAA CCGAGCTCAG GACGCTGGAA AAACACAGCG GTGTGAATGA GAAAATCAGC CAGCAGCGCC GGGATTTATG GGAAGCGGAA AGTCAATATG TGGTCCTGAA AGAGGCCGCC ACGAAACGGC AGTTATCTGA GCAGGAAAAA TCCCTGCTGG CTCATGAGAA AGAGACGCTG GAGTACAAAC GCCAGCTGGC TGAGCTGGGA GACAAGATTG AACACCAGAA GCGGCTGAAT GAGCTGGCAC AGCAGGCGGC GCGGTTTGAA CAGCAGCAAA GCGCGAAGCA GGCGGCAATC AGCGCAAAAG CCCGCGGCCT CACCGACCGT CAGGCGCAGC GGGAGTCGGA AGAGCAGCGC CTTCGTGAGG TGTACGGTGA TAATCCGGCT GCGCTGGCGA AGGCCACATC GGCACTGAAG AACACCTGGT CTGCGGAGGA GCAGCTTCGT GGAAGCTGGA TGGCCGGGAT GAAGTCCGGC TGGAGTGAGT GGGAAGAGAG CGCCACGGAC AGTATGTCGC AGGTTAAAAG TGCTGCCACG CAGACCTTTG ATGGTATTGC ACAGAATATG GCGGCGATGC TGACCGGCAG TGAGCAGAAC TGGCGCAGCT TCACCCGTTC CGTGCTGTCC ATGATGACAG AAATTCTGCT TAAGCAGGCA ATGGTGGGGA TTGTCGGGAG TATCGGCAGC GCCATTGGCG GGGCTGTTGG TGGCGGCGCA TCCGCGTCAG GCGGTACAGC CATTCAGGCC GCTGCGGCGA AATTCCATTT TGCAACCGGA GGATTTACGG GAACCGGCGG CAAATATGAG CCAGCGGGGA TTGTTCACCG TGGTGAATTT GTCTTCACAA AGGAGGCAAC CAGCCGGATT GGCGTGGGGA ATCTCTACCG GCTGATGCGC GGCTATGCCA CCGGTGGTTA TGTCGGTACA CCGGGCAGTC TGGCTGACAG CCGGTCGCAG GCGTCCGGGA CGTTTGAGCA GAATAACCAT GTGGTGATTA ACAACGACGG CACGAACGGG CAGATAGGTC CGGCTGCTCT GAAGGCGGTG TATGACATGG CCCGCAAGGG TGCCCGTGAT GAAATTCAGA CACAGATGCG TGATGGTGGA CTGTTCTCCG GAGGTGGACG ATGA
|
Protein sequence | MSQPVGDLVI DLSLDAVRFD EQMSRVRRHF SGLDTDARKT ASAVEQGLSR QALAAQKAGI SVGQYKAAMR TLPAQFTDIA TQLAGGQNPW LILLQQGGQV KDSFGGMIPM FRGLAGAITL PMVGVTSLAV ATGALAYAWY QGDATLSEFN KTLVLSGNQA GLTADRMLTL SRAGQAAGLT FNQARESLAA LVNAGVRGGE QFDAINQSVA RFASASGVEV DKVAEAFGKL TTDPTSGLMA MARQFRNVTA EQIAYVAQLQ RSGDEAGALQ AANDIATKGF DEQTRRLKEN MGTLETWADK TGKAFKSMWD AILDIGRPES SADMLASAQK AFDEADKKWQ WYQSRSQRRG KTSSFRANLQ GAWDDRENAR LGLAAATLQS DMEKAGELAA RDRAERESSQ LKYTGEAQKA YERLQTPLDK YTARQKELNK ALKDGKILQA DYNTLMASAK KDYESTLKKP SGVKVSAGER QEDRAHAALL ALETELRTLE KHSGVNEKIS QQRRDLWEAE SQYVVLKEAA TKRQLSEQEK SLLAHEKETL EYKRQLAELG DKIEHQKRLN ELAQQAARFE QQQSAKQAAI SAKARGLTDR QAQRESEEQR LREVYGDNPA ALAKATSALK NTWSAEEQLR GSWMAGMKSG WSEWEESATD SMSQVKSAAT QTFDGIAQNM AAMLTGSEQN WRSFTRSVLS MMTEILLKQA MVGIVGSIGS AIGGAVGGGA SASGGTAIQA AAAKFHFATG GFTGTGGKYE PAGIVHRGEF VFTKEATSRI GVGNLYRLMR GYATGGYVGT PGSLADSRSQ ASGTFEQNNH VVINNDGTNG QIGPAALKAV YDMARKGARD EIQTQMRDGG LFSGGGR
|
| |