Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E2526 |
Symbol | supH |
ID | 6269333 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 2326684 |
End bp | 2327499 |
Gene Length | 816 bp |
Protein Length | 271 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641726510 |
Product | sugar phosphatase SupH |
Protein accession | YP_001880990 |
Protein GI | 187733293 |
COG category | [R] General function prediction only |
COG ID | [COG0561] Predicted hydrolases of the HAD superfamily |
TIGRFAM ID | [TIGR00099] Cof subfamily of IIB subfamily of haloacid dehalogenase superfamily [TIGR01484] HAD-superfamily hydrolase, subfamily IIB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.0998065 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGTAA AAGTTATCGT CACAGACATG GACGGTACTT TTCTTAACGA CGCCAAAACG TACAACCAAC CACGTTTTAT GGCGCAATAT CAGGAACTGA AAAAGCGCGG CATTGAGTTC GTTGTCGCCA GCGGTAATCA GTATTACCAG CTTATTTCGT TCTTTCCTGA GCTAAAGGAT GAGATCTCTT TTGTCGCGGA AAACGGCGCA CTGGTTTACG AACATGGCAA GCAACTGTTC CACGGCGAAC TGACCCGACA TGAATCGCGG ATTGTTATTG GCGAGTTGCT AAAAGATAAG CAACTCAATT TTGTCGCCTG CGGTCTGCAA AGTGCATATG TCAGCGAAAA TGCCCCCGAA GCATTTGTCG CACTGATGGC AAAACACTAC CATCGCCTGA AAGCTGTAAA AGATTATCAG GAGATTGACG ACGTACTGTT CAAGTTTTCG CTCAACCTGC CAGATGAACA AATCCCGTTA GTGATCGACA AACTGCACAT AGCGTTCGAT GGCATTATGA AACCCGTCAC CAGTGGTTTT GGCTTTATCG ACCTGATTAT TCCCGGTCTA CATAAAGCAA ACGGTATTTC GCGGTTACTG AAACGCTGGG ATCTGTCACC GCAAAATGTG GTAGCGATTG GCGACAGCGG TAACGATGCG GAGATGCTGA AAATGGCGCG TTATTCCTTT GCGATGGGCA ATGCTGCGGA AAACATTAAA CAAATCGCCC GTTACGCTAC CGATGATAAT AATCATGAAG GCGCGCTGAA TGTGATTCAG GCGGTGCTGG ATAACACATC CCCTTTTAAC AGCTGA
|
Protein sequence | MSVKVIVTDM DGTFLNDAKT YNQPRFMAQY QELKKRGIEF VVASGNQYYQ LISFFPELKD EISFVAENGA LVYEHGKQLF HGELTRHESR IVIGELLKDK QLNFVACGLQ SAYVSENAPE AFVALMAKHY HRLKAVKDYQ EIDDVLFKFS LNLPDEQIPL VIDKLHIAFD GIMKPVTSGF GFIDLIIPGL HKANGISRLL KRWDLSPQNV VAIGDSGNDA EMLKMARYSF AMGNAAENIK QIARYATDDN NHEGALNVIQ AVLDNTSPFN S
|
| |