Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal223_0372 |
Symbol | |
ID | 7089510 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS223 |
Kingdom | Bacteria |
Replicon accession | NC_011663 |
Strand | + |
Start bp | 419079 |
End bp | 419759 |
Gene Length | 681 bp |
Protein Length | 226 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 643459293 |
Product | nucleotidase |
Protein accession | YP_002356330 |
Protein GI | 217971579 |
COG category | [R] General function prediction only |
COG ID | [COG1011] Predicted hydrolase (HAD superfamily) |
TIGRFAM ID | [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E [TIGR02254] HAD superfamily (subfamily IA) hydrolase, TIGR02254 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.000751956 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0000283623 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCCTTGC CTTACCAATG GATTTTGTTC GATGCCGATG AAACCCTATT TTATTTTGAT GCCTTAAAAG GGCTTAAGTT GATGTTTAGT GAGTTTGGGG TCGATTTTAC CCAAGCCGAT TTTGATGAGT ATCAGTTGGT TAACAAACCG CTTTGGGTCG ATTATCAAGA TGGCAAAATA ACTGCCGCCG AGTTGCAGAC CATACGTTTC GAACCTTGGG CAGCCAAATT ATCAGTCACG GCCATGACGC TCAATAGTGC ATTTTTATCA GCAATGGCCG AAATTTGTTC GCCGCTACCT GGTGCTCGCG AGTTATTAGC CGCGCTGCAA GGTAAAGCTA AACTAGGCAT CATTACCAAC GGTTTCACTG AACTACAAAC CGTGCGATTA GAGCGTACAG GCTTGCAGCA TCATTTTGAT ATTTTAGTGA TTTCAGAAAA AGTCGGTATC GCTAAACCCG ATGTGGGTAT CTTCGATCAC GCCTTCGAAC TCATGGGCCA TCCTGAGCGC GATACTGTGT TGATGGTCGG CGATAACCCG CATTCAGATA TCCAAGGCGG CATCAATGCT GGTATTCATA CATGCTGGTA TAACGTCCAC GGCCACGACG TACCTGCCGG TATCACCCCG CACTATCAAG TGGGCTCGCA CCAAGAGCTG CAAAAAATCC TGTTACCGTA A
|
Protein sequence | MSLPYQWILF DADETLFYFD ALKGLKLMFS EFGVDFTQAD FDEYQLVNKP LWVDYQDGKI TAAELQTIRF EPWAAKLSVT AMTLNSAFLS AMAEICSPLP GARELLAALQ GKAKLGIITN GFTELQTVRL ERTGLQHHFD ILVISEKVGI AKPDVGIFDH AFELMGHPER DTVLMVGDNP HSDIQGGINA GIHTCWYNVH GHDVPAGITP HYQVGSHQEL QKILLP
|
| |