Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E2627 |
Symbol | |
ID | 6270973 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 2429107 |
End bp | 2430009 |
Gene Length | 903 bp |
Protein Length | 300 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641726596 |
Product | hypothetical protein |
Protein accession | YP_001881076 |
Protein GI | 187731476 |
COG category | [S] Function unknown |
COG ID | [COG5464] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01784] conserved hypothetical protein (putative transposase or invertase) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 43 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGAAT CAACAACCTC CTCCCCGCAT GATGCGGTAT TTAAAACCTT TATGTTCACA CCCGAAACCG CACGGGATTT TCTCGAAATA CATTTACCAG AACCACTGCG CAAGCTTTGC AACCTGCAAA CCTTACGCCT GGAACCCACT AGTTTTATTG AAAAAAGTTT ACGCGCTTAC TACTCGGATG TTTTGTGGTC CGTGGAAACC AGCGACGGTG ACGGCTATAT CTACTGCGTG ATTGAACATC AAAGCTCTGC AGAAAAGAAT ATGGCTTTTC GGCTAATGCG CTATGCCACT GCCGCCATGC AGCGTCACCT GGATAAAGGC TATGACAGAG TTCCGCTGGT GGTGCCATTG CTGTTTTATC ATGGCGAAAC ATCGCCCTAC CCGTACTCAC TTAACTGGCT GGATGAGTTT GACGATCCGC AACTTGCCCG GCAGTTGTAC ACCGAAGCTT TTCCGTTGGT GGATATCACC ATCGTACCTG ACGATGAGAT CATGCAACAT CGGCGTATAG CTCTGCTGGA ACTGATTCAA AAGCATATTC GCGACCGCGA TTTAATCGGC ATGGTCGACA GGATCACCAC GCTTTTGGTT AGAGGCTTCA CTAATGACAG CCAGCTACAA ACACTGTTTA ATTATCTGCT GCAATGCGGC GATACCTCCC GTTTCACCCG TTTTATTGAG GAGATTGCCG AACGTTCACC ACTACAAAAG GAGAGATTAA TGACTATTGC TGAACGGCTA CGGCAGGAAG GGCATCAAAT TGGCTGGCAG GAAGGTATGC ATGAACAAGC CATTAAAATT GCTTTGCGCA TGCTGGAACA GGGCATTGAT CGTGACCAGG TGCTCGCGGC CACCCAGCTA AGCGAAGCCG ATCTGGCAGC GAATAACCAC TAA
|
Protein sequence | MTESTTSSPH DAVFKTFMFT PETARDFLEI HLPEPLRKLC NLQTLRLEPT SFIEKSLRAY YSDVLWSVET SDGDGYIYCV IEHQSSAEKN MAFRLMRYAT AAMQRHLDKG YDRVPLVVPL LFYHGETSPY PYSLNWLDEF DDPQLARQLY TEAFPLVDIT IVPDDEIMQH RRIALLELIQ KHIRDRDLIG MVDRITTLLV RGFTNDSQLQ TLFNYLLQCG DTSRFTRFIE EIAERSPLQK ERLMTIAERL RQEGHQIGWQ EGMHEQAIKI ALRMLEQGID RDQVLAATQL SEADLAANNH
|
| |