Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A4223 |
Symbol | |
ID | 6871108 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 4067252 |
End bp | 4068097 |
Gene Length | 846 bp |
Protein Length | 281 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642787156 |
Product | sugar phosphatase |
Protein accession | YP_002217782 |
Protein GI | 198242544 |
COG category | [R] General function prediction only |
COG ID | [COG0561] Predicted hydrolases of the HAD superfamily |
TIGRFAM ID | [TIGR00099] Cof subfamily of IIB subfamily of haloacid dehalogenase superfamily [TIGR01484] HAD-superfamily hydrolase, subfamily IIB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 69 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTATCA AACTTATTGC TATCGACATG GATGGCACCC TTCTGCTGCC CGATCACACC ATTTCTCCGG CGGTTAAAAA CGCGATTGCC GCTGCGCGTG AAAAAGGGGT AAATGTGGTG CTGACCACAG GCCGTCCGTA TGCGGGTGTG CACAGTTACC TGAAAGAACT TCACATGGAA CAGCCCGGCG ATTATTGCAT CACCTATAAC GGGGCGCTGG TGCAGAAAGC AGGGGACGGC AGTACGGTTG CGCAAACGGC GCTCAGCTAT GATGACTACC GTTACCTGGA AAAACTGTCC CGTGAGGTGG GTTCTCACTT CCACGCATTA GACCGAAATA CGCTTTATAC CGCTAACCGC GATATCAGCT ACTACACGGT GCATGAGTCG TATGTGGCGA CCATTCCGCT GGTATTTTGT GAAGCGGAGA AGATGGACCC GAACACCCAG CTCCTGAAAG TTATGATGAT CGATGAGCCT GCCGTTCTCG ACCGGGCGAT TGCGCGTATA CCGGCAGAGG TGAAGGAAAA GTACACCGTG CTGAAAAGTG CGCCGTACTT CCTTGAAATC CTCGATAAAC GGGTTAATAA AGGCACTGGC GTAAAATCAC TGGCCGAGGC GCTGGGTATT AAGCCAGAGG AGGTGATGGC GATTGGCGAT CAGGAAAACG ACATTGCGAT GATCGAATAC GCCGGCATGG GCGTGGCAAT GGACAACGCC ATTCCGTCGG TCAAAGAGGT GGCTAACTTT GTGACTAAAT CGAACCTTGA AGATGGTGTT GCCTGGGCGA TTGAAAAATT TGTGCTGAAC CCCGATCACT CATCCGGCCA TTTCCCCGCC CGATAA
|
Protein sequence | MAIKLIAIDM DGTLLLPDHT ISPAVKNAIA AAREKGVNVV LTTGRPYAGV HSYLKELHME QPGDYCITYN GALVQKAGDG STVAQTALSY DDYRYLEKLS REVGSHFHAL DRNTLYTANR DISYYTVHES YVATIPLVFC EAEKMDPNTQ LLKVMMIDEP AVLDRAIARI PAEVKEKYTV LKSAPYFLEI LDKRVNKGTG VKSLAEALGI KPEEVMAIGD QENDIAMIEY AGMGVAMDNA IPSVKEVANF VTKSNLEDGV AWAIEKFVLN PDHSSGHFPA R
|
| |