Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E0892 |
Symbol | |
ID | 6272727 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 832407 |
End bp | 833435 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641725054 |
Product | phage major capsid protein E |
Protein accession | YP_001879581 |
Protein GI | 187733048 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 48 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGATTGT TTACGACCCG CCAGTTACTC GGTTATACCG AACAAAAAGT GAAATTCCGT GCGCTGTTTC TGGAGCTGTT TTTCCGCCGT ACGGTGAATT TCCACACCGA AGAGGTGATG CTGGACAAAA TTACCGGAAA AACGCCGGTG GCGGCCTATG TCTCCCCGAT CGTTGAAGGA AAAGTGCTTC GCCATCGCGG TGGTGAAACC CGCGTGTTAC GTCCGGGCTA CGTCAAGCCC AAACACGAAT TTAATTACCA GCAGGCGGTT GAGCGCCTTC CTGGTGAAGA TCCGGCTCAG CTGAACGACC CGGCCTACCG TCGTCTGCGT ATCATTACCG ATAACCTCAA ACAGGAAGAG CACGCCATTG TCCAGGTGGA AGAAATGCAG GCGGTGAATG CCGTGCTGTA TGGCAAATAC ACCATGGAAG GGGATCAGTT TGATACTGTC GAGGTGGATT TCGGGCGCTC TGAAGGAAAT AACATTGAGC AGGCTGACGG TAAAAAATGG TCTGAGCAGG ACCGTGATAC GTTTGATCCG ACGCATGATA TTGACCTCTA CTGCGATCAG ACCAGCGGCC TTGTGAATAT CGCCATTATG GACGGTACGG TCTGGCGTCT GCTGAATGGC TTTAAGCTGT TCCGCGAAAA ACTGGATACC CGTCGCGGCT CAAATTCACA ACTCGAAACG GCAGTGAAAG ACCTGGGGGC GGTGGTGTCC TTCAAGGGGT ATTACGGCGA TCTGGCCATT GTGGTGGCGA AAACGTCTTA TGTGGCAGAG GACGGTACCG AAAAACGTTA TCTGCCGGAG GGCACACTGG TCCTGGGGAA TACGGCAGCA GAGGGCATTC GTTGCTATGG TGCCATTCAG GATGCGCAGG CGTTGTCCGA AGGTGTGGTG GCCTCTTCCC GTTATCCGAA ACACTGGCTG ACTGTGGGCG ATCCGGCCCG TGAATTCACC ATGACGCAGT CCGCACCGCT GATGGTGCTG CCGGATCCGG ATGAGTTTGT GGTGGTACAG GTGAAATAA
|
Protein sequence | MGLFTTRQLL GYTEQKVKFR ALFLELFFRR TVNFHTEEVM LDKITGKTPV AAYVSPIVEG KVLRHRGGET RVLRPGYVKP KHEFNYQQAV ERLPGEDPAQ LNDPAYRRLR IITDNLKQEE HAIVQVEEMQ AVNAVLYGKY TMEGDQFDTV EVDFGRSEGN NIEQADGKKW SEQDRDTFDP THDIDLYCDQ TSGLVNIAIM DGTVWRLLNG FKLFREKLDT RRGSNSQLET AVKDLGAVVS FKGYYGDLAI VVAKTSYVAE DGTEKRYLPE GTLVLGNTAA EGIRCYGAIQ DAQALSEGVV ASSRYPKHWL TVGDPAREFT MTQSAPLMVL PDPDEFVVVQ VK
|
| |