Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E0885 |
Symbol | |
ID | 6268549 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 826445 |
End bp | 829057 |
Gene Length | 2613 bp |
Protein Length | 870 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641725047 |
Product | minor tail protein H |
Protein accession | YP_001879574 |
Protein GI | 187731601 |
COG category | [S] Function unknown |
COG ID | [COG5281] Phage-related minor tail protein |
TIGRFAM ID | [TIGR01541] phage tail tape measure protein, lambda family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCAGC CAGCGGGTGA TCTGGTTATT GATTTGAGTC TGGATGCGGC CCGGTTTGAT GAACAGATGG CCCGGGTACG CCGTCATTTT TCCAGTCTGG AGGCGGATGC CAGAAAAACC GCCAGTACTG TTGAACAGGG GCTGAGCCGA CAGGCGCTGG CTGCACAAAA AGCCGGGATA TCAGTCGGAC AGTATAAGGC TGCCATGCGC ACACTGCCCG CACAGTTCAC GGATATTGTC ACTCAGCTTG CCGGTGGTCA GAATCCCTTC CTTATCATGC TGCAGCAGGG GGGGCAGATC AGCGATTCAT TCGGTGAACC GCTCAGCCTG CTTACCCTGC TGAAGGAGGA ACTTCTCGGG ATCAGGGATG CCTCTGAATC ATCAGAGGAG TCGCTGTCAG ATACGGCAAA TGCACTGGCT GAAAATGCCC GGAATGCCGG TGAGCTGGGA CGATTTATGT CGGTGGTCCG TGTGGCGGCA GGTGGCGGGG TTGCCGTACT GGCCGCGCTT GCTGCCGCCG CCTGGCAGGC AGAGCAGGCT GACCGGGCCT TATTGCGTTC ACTGATCCTG ACCGGAGGGG CGGCTGCCAC CACAACGGCA GAATTGTGGA AAATGGCCGG GGTGATCAGC GATGAAGCCG GTGGTGGTAT CAGACAGGCG GCAGAAAATC TGGCCCGTCT GGCAGAAAGC GGGAAATATA CCGCCGGGCA GCTACGGATC ATGGGGGAAA CCTCTCAGAG ATGGCTGCAG ACGGTGGGGG ACGATGCCGG GAAGGTGGAA AAAGCCTTTG AAGGGATTGC AGCAGATCCG GTGAAGGCGC TGGCCTCCCT GAATCAGCAG TATAACTTCC TGAGCGTTTC CCAGTTACGC CATATTGATG AGCTTGAGCG CACGAAAGGT AAACAGGCTG CGGTGACGGA GGCGATGTCC CTGTTTGCGG ATGTCATGAA TGCACGTCTG GAGCAACTTG ATAAAGCGGC CACGCCGGTG GAAAAAATCT GGGACGATGT TAAAACCTGG ACTTCTGACG CATGGGCATG GATAGGTGAT CATACACTGG GGGCACTCAG TCTGATCACT GACGTGGTGG CCGGAACCGT TGAACAGGTG AAGCTGCTGC TTGTGCAGGG GGATCTGGCG CTGGCTGAAT TTATTCAGTC AGCCTGGGAA ACGACAAAGA ATGTGCCCGG CGTTGGTGCG TTGTTTGGTG AACTGGCAGA AGAGAACCGC GTATTTATTG AGAAAACAAA ACGCGATGAA CTGGCGCTGA GAAAATCCAT TGCGGAACGG GATGCGCGTA TACGCCAGGG GGAAATGGGG TACATCAACC GCTCGCGTGC AACAGGCGTC AGCAAAGGTC TTGGGCAGCA GGAAGCCGTC AGCCGTCTGG CTGAAGAGCT GACAGGTAAA AAGCATACAT CACCGAAAAC GCGCTCTGCC GGGGAGAGGG AAGAGGAGCA GGCAAGAGAG GCTCTGCTTG CCCTTGAAGC TGAGCTCAGG ACGCTGGAAA AACACAGCGG TGCGAATGAG AAAATCAGCC GGCAGCGCCG TGATTTATGG AAGGCGGAAA GTCAGTATGC GGTCCTGAAA GAGGCTGCCA CGAAACGGCA GTTATCCTGG CAGGAAAAAT CCCTGCTGGC TCATGAGAAA GAGACGCTGG AGTACAAACG CCAGCTGGCT GAGCTGGGAG ACAAGATTGA ACACCAGAAG CGGCTGAATG AGCTGGCACA GCAGGCGGCG CGGTTTGAAC AGCAGCAGAG CGCGAAGCAG GCGGCAATCA GCGCAAAAGC CCGCGGACTC ACCGACCGTC AGGCGCAGCG GGAGTCGGAA GAGCAGCGCC TTCGTGAGGT GTACGGTGAT AATCCGGCTG CGCTGGCGAA GGCCACATCT GCACTGAAGA ACACCTGGTC TGCGGAGGAG CAGCTTCGTG GAAGCTGGAT GGCCGGGATG AAGTCCGGCT GGGGCGAGTG GGCGGAAAGT GCGACGGACA GTTTTTCGCA GGTAAAAAGT GTGGCCACGC AGACCTTTGA CGGTATTGCA CAGAATATGG CGGCGATGCT GACCGGCAGC GAACAGAACT GGCGTGGTTT CACCCGTTCC GTGCTCTCCA TGCTGACAGA GATTTTTCTG AAGCAGGCGA TGGTGGGGAT TGTCGGGCGT ATCGGCAGCG CCATTGGTGG TGCTTTCGGT GGTGGCGCAT CCGCTTCCAC GGGGACGGCC ATTCAGGCTG CGGCGGCGAA CTTCCATTTC GCGACCGGAG GATTTACGGG GACGGGCGGC AAATATGAGC CTGCGGGGAT TGTCCATCGC GGGGAGTTTG TCTTCACGAA GGAGGCGACC AGCCGGATTG GTGTCGGCAA CCTGTATCGT CTGATGCGCG GGTATGCGGA AGGTGGTTAT GTCGGCGGTG CCGGAAGTCC GGCGCAGATG CGGCGGGCCG AAGGCATTAA TTTTAATCAG AACAATCACG TGGTGATTCA GAACGACGGC CCCAACGGGC AGGCAGGGCC GCAGCTGATG AAAGCGGTGT ATGAGATGGC CCGTAAAGGT GCGCAGGATG AGCTCCGGCT GCAGTTGCGT GATGGCGGTC TGTTATCGGG GAGCGGGCGA TGA
|
Protein sequence | MSQPAGDLVI DLSLDAARFD EQMARVRRHF SSLEADARKT ASTVEQGLSR QALAAQKAGI SVGQYKAAMR TLPAQFTDIV TQLAGGQNPF LIMLQQGGQI SDSFGEPLSL LTLLKEELLG IRDASESSEE SLSDTANALA ENARNAGELG RFMSVVRVAA GGGVAVLAAL AAAAWQAEQA DRALLRSLIL TGGAAATTTA ELWKMAGVIS DEAGGGIRQA AENLARLAES GKYTAGQLRI MGETSQRWLQ TVGDDAGKVE KAFEGIAADP VKALASLNQQ YNFLSVSQLR HIDELERTKG KQAAVTEAMS LFADVMNARL EQLDKAATPV EKIWDDVKTW TSDAWAWIGD HTLGALSLIT DVVAGTVEQV KLLLVQGDLA LAEFIQSAWE TTKNVPGVGA LFGELAEENR VFIEKTKRDE LALRKSIAER DARIRQGEMG YINRSRATGV SKGLGQQEAV SRLAEELTGK KHTSPKTRSA GEREEEQARE ALLALEAELR TLEKHSGANE KISRQRRDLW KAESQYAVLK EAATKRQLSW QEKSLLAHEK ETLEYKRQLA ELGDKIEHQK RLNELAQQAA RFEQQQSAKQ AAISAKARGL TDRQAQRESE EQRLREVYGD NPAALAKATS ALKNTWSAEE QLRGSWMAGM KSGWGEWAES ATDSFSQVKS VATQTFDGIA QNMAAMLTGS EQNWRGFTRS VLSMLTEIFL KQAMVGIVGR IGSAIGGAFG GGASASTGTA IQAAAANFHF ATGGFTGTGG KYEPAGIVHR GEFVFTKEAT SRIGVGNLYR LMRGYAEGGY VGGAGSPAQM RRAEGINFNQ NNHVVIQNDG PNGQAGPQLM KAVYEMARKG AQDELRLQLR DGGLLSGSGR
|
| |