Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E2572 |
Symbol | |
ID | 6269055 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 2372942 |
End bp | 2373850 |
Gene Length | 909 bp |
Protein Length | 302 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641726554 |
Product | hypothetical protein |
Protein accession | YP_001881034 |
Protein GI | 187733953 |
COG category | [S] Function unknown |
COG ID | [COG0391] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01826] conserved hypothetical protein, cofD-related |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.000000058678 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCAATC GTACGCTGGC TGATCTTGAT CGTGTCGTTG CTCTCGGCGG AGGGCATGGA CTGGGACGCG TTCTCTCATC ACTTTCGTCT TTGGGTTCTC GTTTAACGGG TATCGTCACC ACCACGGATA ATGGTGGCTC GACGGGGCGT ATTCGCCGTT CAGAAGGCGG CATTGCCTGG GGCGATATGC GCAACTGCCT CAACCAACTG ATAACGGAAC CGAGCGTCGC CTCCGCGATG TTTGAATACC GTTTTGGCGG CAATGGCGAA CTTTCCGGGC ACAACCTTGG AAACTTGATG TTAAAGGCGC TGGATCACCT TAGCGTGCGG CCTCTGGAAG CCATCAATTT AATTCGTAAT CTGCTGAAAG TGGATACGCA TTTGATTCCA ATGTCAGAGC ATCCCGTCGA TCTGATGGCG ATTGACGATC AGGGGCATGA AGTTTACGGC GAGGTCAATA TCGACCAGTT AACTACGCCG ATTCAAGAGT TATTATTAAC GCCGAATGTA CCCGCAACGC GTGAGGCAGT TCACGCTATC AATGAAGCGG ATCTCATCAT TATTGGGCCT GGCAGTTTTT ATACCAGCCT GATGCCAATT CTGCTGCTGA AGGAAATCGC CCAGGCATTA CGCCGCACGC CAGCGCCGAT GGTTTATATC GGCAATCTGG GGCGTGAGTT GAGTTTACCT GCGGCTAATT TGAAGCTGGA AAGCAAGCTG GCAATTATGG AGCAGTATGT TGGTAAAAAA GTCATTGATG CGGTCATCGT CGGGCCAAAA GTGGATGTCT CGGCGGTGAA AGAGCGGATT GTGATCCAGG AGGTACTGGA GGCCAGCGAT ATTCCGTATC GTCATGACCG CCAGTTGTTA CATAACGCGC TGGAAAAGGC GTTACAGGCT TTAGGTTAA
|
Protein sequence | MRNRTLADLD RVVALGGGHG LGRVLSSLSS LGSRLTGIVT TTDNGGSTGR IRRSEGGIAW GDMRNCLNQL ITEPSVASAM FEYRFGGNGE LSGHNLGNLM LKALDHLSVR PLEAINLIRN LLKVDTHLIP MSEHPVDLMA IDDQGHEVYG EVNIDQLTTP IQELLLTPNV PATREAVHAI NEADLIIIGP GSFYTSLMPI LLLKEIAQAL RRTPAPMVYI GNLGRELSLP AANLKLESKL AIMEQYVGKK VIDAVIVGPK VDVSAVKERI VIQEVLEASD IPYRHDRQLL HNALEKALQA LG
|
| |