Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E1459 |
Symbol | |
ID | 6272440 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 1332874 |
End bp | 1333902 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641725560 |
Product | phage major capsid protein E |
Protein accession | YP_001880066 |
Protein GI | 187734000 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGATTGT TTACGACCCG CCAGTTACTC GGTTATACCG AACAAAAAGT GAAATTCCGT GCGCTGTTTC TGGAGCTGTT TTTCCGCCGT ACGGTGAATT TCCACACCGA AGAGGTGATG CTGGACAAAA TTACCGGAAA AACGCCGGTG GCGGCCTATG TCTCCCCGAT CGTTGAAGGA AAAGTGCTTC GCCATCGCGG TGGTGAAACC CGCGTGTTAC GTCCGGGCTA CGTCAAGCCG AAACACGAAT TTAATTACCA GCAGGCGGTT GAGCGCCTTC TTGGTGAAGA TCCGGCTCAG CTGAACGACC CGGCCTACCG TCGTCTGCGT ATCATTACCG ATAACCTCAA ACAGGAAGAG CACGCCATTG TCCAGGTGGA AGAAATGCAG GCGGTGAATG CCGTGCTGTA TGGCAAATAC ACCATGGAAG GGGATCAGTT TGATACTGTC GAGGTGGATT TCGGGCGCTC TGAAGGAAAT AACATTGAGC AGGCTGACGG TAAAAAATGG TCTGAGCAGG ACCGTGATAC GTTTGATCCG ACGCATGATA TTGACCTCTA CTGCGATCAG GCCAGCGGCC TTGTGAATAT CGCCATTATG GACGGTACGG TCTGGCGTCT GCTGAATGGC TTTAAGCTGT TCCGCGAAAA ACTGGATACC CGTCGCGGCT CAAATTCACA ACTCGAAACG GCAGTGAAAG ATCTGGGCGC AGTGGTGTCC TTCAAGGGGT ATTACGGCGA TCTGGCCATT GTGGTGGCGA AAACGTCTTA TGTGGCAGAG GACGGTACCG AAAAACGTTA TCTGCCGGAG GGCATGCTGG TGTTGGGGAA TACGGCGGCA GAGGGGATTC GTTGCTATGG TGCCATTAAG GATGCACAGG CGTTGTCTGA AGGAGTGGTG GCTTCTTCCC GTTACCCGAA ACACTGGCTG ACCGTGGGCG ATCCGTCCTG TGAATTCACC ATGACGCAGT CCGCTCCGCT GATGGTGCTG CCGGATCCGG ATGAGTTTGT GGTGGTACAG GTGAAATAA
|
Protein sequence | MGLFTTRQLL GYTEQKVKFR ALFLELFFRR TVNFHTEEVM LDKITGKTPV AAYVSPIVEG KVLRHRGGET RVLRPGYVKP KHEFNYQQAV ERLLGEDPAQ LNDPAYRRLR IITDNLKQEE HAIVQVEEMQ AVNAVLYGKY TMEGDQFDTV EVDFGRSEGN NIEQADGKKW SEQDRDTFDP THDIDLYCDQ ASGLVNIAIM DGTVWRLLNG FKLFREKLDT RRGSNSQLET AVKDLGAVVS FKGYYGDLAI VVAKTSYVAE DGTEKRYLPE GMLVLGNTAA EGIRCYGAIK DAQALSEGVV ASSRYPKHWL TVGDPSCEFT MTQSAPLMVL PDPDEFVVVQ VK
|
| |