Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E4293 |
Symbol | |
ID | 6268407 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 4010887 |
End bp | 4011687 |
Gene Length | 801 bp |
Protein Length | 266 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641728104 |
Product | putative sugar phosphatase |
Protein accession | YP_001882524 |
Protein GI | 187733143 |
COG category | [R] General function prediction only |
COG ID | [COG0561] Predicted hydrolases of the HAD superfamily |
TIGRFAM ID | [TIGR00099] Cof subfamily of IIB subfamily of haloacid dehalogenase superfamily [TIGR01484] HAD-superfamily hydrolase, subfamily IIB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 43 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACCAGG TTGTTGCGTC TGATTTAGAT GGCACGTTAC TTTCTCCCGA CCATACGTTA TCCCCTTACG CCAAAGAAAC TCTGAAGCTG CTCACCGCGC GCGGCATCAA CTTTGTGTTT GCGACCGGTC GTCACCACGT TGATGTGGGG CAAATTCGCG ATAATCTGGA GATTAAGTCT TACATGATTA CCTCCAATGG TGCGCGCGTT CACGATCTGG ATGGTAATCT GATTTTTGCT CATAACCTGG ATCGCGACAT TGCCAGCGAT CTGTTTGGCG TAGTCAACGA CAATCCGGAC ATCATTACTA ACGTTTATCG CGACGACGAA TGGTTTATGA ATCGCCATCG CCCGGAAGAG ATGCGCTTTT TTAAAGAAGC GGTGTTCAAA TATGCGCTGT ATGAGCCTGG ATTACTGGAG CCGGAAGGCG TCAGCAAAGT GTTCTTCACC TGCGATTCCC ATGAACAACT GCTGCCGCTG GAGCAGGCGA TTAACGCTCG TTGGGGCGAT CGCGTCAACG TCAGTTTCTC TACCTTAACC TGTCTGGAAG TGATGGCGGG CGGCGTTTCA AAAGGCCATG CGCTGGAAGC GGTGGCGAAG AAACTGGGCT ACAGCCTGAA GGATTGTATT GCGTTTGGTG ACGGGATGAA CGACGCCGAA ATGCTGTCGA TGGCGGGGAA AGGCTGCATT ATGGGCAGTG CGCACCAGCG TCTGAAAGAC CTTCATCCCG AGCTGGAAGT GATTGGTACT AATGCCGACG ACGCGGTGCC GCATTATCTG CGTAAACTCT ATTTATCGTA A
|
Protein sequence | MYQVVASDLD GTLLSPDHTL SPYAKETLKL LTARGINFVF ATGRHHVDVG QIRDNLEIKS YMITSNGARV HDLDGNLIFA HNLDRDIASD LFGVVNDNPD IITNVYRDDE WFMNRHRPEE MRFFKEAVFK YALYEPGLLE PEGVSKVFFT CDSHEQLLPL EQAINARWGD RVNVSFSTLT CLEVMAGGVS KGHALEAVAK KLGYSLKDCI AFGDGMNDAE MLSMAGKGCI MGSAHQRLKD LHPELEVIGT NADDAVPHYL RKLYLS
|
| |