Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E1521 |
Symbol | |
ID | 6269279 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 1387983 |
End bp | 1389188 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641725619 |
Product | hypothetical protein |
Protein accession | YP_001880125 |
Protein GI | 187732223 |
COG category | [S] Function unknown |
COG ID | [COG4950] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTTATCGC CGATCCGTCT TTCTCCCCTT CCCGCCTTGC GTCAGGATAA CGATTTCCTT TACGACCAAG GAGCGCCCAT GGAACAACGC CACATCACCG GCAAAAGCCA CTGGTATCAT GAAACGCAAT CCAGTACTGC GGAGTATGAC GTTCTGCCTC TGGTCCCGGA AGCCGCAAAG GTCAGCGATC CCTTTCTGCT CGACGTGATC CTTGATGAAG AAACGCTGGC TCCCTTCCTT TCATGGCTGG TCCCTGCGCG CGTTCTTGCA GTGGAATTGT TCCCTGACCA GCTTACCGTG ACCCGTTCAC AGACCTTCAC CGCTTATGAA CGCTTGTCTA CGGCCCTGAC GGTTGCTCAG GTTTGCGGCG TCCAGCGGTT ATGTAACTAC TATTCGGCGC GACTTACGCC GCTCCCCGGG CCTGATTCCT CTAGGGAAAG CAATCATAGG TTGGCACAAA TCACGCAATA TGCCCGCCAA CTGGCTAGCT CGCCTTCTAT TATCGACAAC CGATCGCGCC AGCATCTGAA TGACGTCGAT CTTACTGCCT GGGACTGTGT GATCATTAAC CAAATCATTG GTTTTATTGG CTTTCAGGCG CGGACCATTG CGACATTTCA GGCTTATCTC GGGCACCCGG TACGCTGGTT ACCCGGTCTG GAAATACAAA ACTACGCCGA CGCGTCACTG TTTGCTGATG AATCAATACG CTGGCGAAGC AGCTATGAGG TGGAAAAACT ACCTGAAGAG TACACAAAAA GTTCAACAGC AGAACTTTGC CAACTGGCCG AAACACTCTC TCTCCACCCT ATTTCACTTT CCCTTCTCGA AAGGTTGTTA AACAGTACAC GGGTTAATAC ACAGCCGGAT AATAAGCTTG CGGCGTTGTT ATGCGCGCGG ATAAATGGCA GTCCTGCTTG TTTTGCCGCC TGTATGGATT CATCAAATGA ATATAAAAAA ATCAGCCCCC TTCTGCGCAA GGGCGAAAAT GAAATTAACC AATGGGCTGA CCGTCATTCT GTTGAGCACG CTACCGTTCA GGCGATACAA TGGCTGACCC GAGCACCCGA TCGCTTTAGC ACCGCCCAGT TCAGCCCTTT ACTCGAACAC GAACAATCAT CAACGCAGAT TATTAATCTG CTGGTATGGA GCGGGCTGTG TGGCTGGATA AATCGCTTAA AAATCGCGTT GGGTGAGACA TATTAA
|
Protein sequence | MLSPIRLSPL PALRQDNDFL YDQGAPMEQR HITGKSHWYH ETQSSTAEYD VLPLVPEAAK VSDPFLLDVI LDEETLAPFL SWLVPARVLA VELFPDQLTV TRSQTFTAYE RLSTALTVAQ VCGVQRLCNY YSARLTPLPG PDSSRESNHR LAQITQYARQ LASSPSIIDN RSRQHLNDVD LTAWDCVIIN QIIGFIGFQA RTIATFQAYL GHPVRWLPGL EIQNYADASL FADESIRWRS SYEVEKLPEE YTKSSTAELC QLAETLSLHP ISLSLLERLL NSTRVNTQPD NKLAALLCAR INGSPACFAA CMDSSNEYKK ISPLLRKGEN EINQWADRHS VEHATVQAIQ WLTRAPDRFS TAQFSPLLEH EQSSTQIINL LVWSGLCGWI NRLKIALGET Y
|
| |