Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sputcn32_2072 |
Symbol | |
ID | 5079498 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella putrefaciens CN-32 |
Kingdom | Bacteria |
Replicon accession | NC_009438 |
Strand | + |
Start bp | 2369994 |
End bp | 2370662 |
Gene Length | 669 bp |
Protein Length | 222 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640499234 |
Product | HAD family hydrolase |
Protein accession | YP_001183592 |
Protein GI | 146293168 |
COG category | [R] General function prediction only |
COG ID | [COG0546] Predicted phosphatases |
TIGRFAM ID | [TIGR01449] 2-phosphoglycolate phosphatase, prokaryotic [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000098787 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTTGT CACAGGTTAA AGCAGTACTG TTTGATCTTG ATGGTACGCT TGCCGATACC GCGCCAGATC TTGTGCAGGC ACTGAACTTA AGTCTCTGTG ACGCGGGGAT TGAAGCAAAA CCCTTCGAGC TATTACGCAG CGCCGCTTCC CACGGCAGTT TTGCCTTAGT TGATGCCGCG CTCCCCAATG CTGATGAGTC CCTGCGTATT CAAATCCAGC AAGGGCTACT CGCCCACTAT CAACGTATTA ATGGCGACTA TTGTCGATTA TTTACCGGTA TGGATGTATT GCTCGACTGG CTTGAACTAC AGCAACTGCC ATTTGGCGTT ATTACCAATA AACCAGCGCG CTTTACTCGC CCGCTGGTTA AAAAACTGAA TTTGCACCAG CGGATGCAAG TTGTTATCAG CGGCGATTCA ACCCGCTATG CTAAACCCCA TACGGCGCCT ATGTTACTGG GTGCGCAGCA GCTTAACTGC GCCCCTGAAC ATATACTGTA CTTAGGAGAT GCTGAACGAG ATTTACTCGC CGCCAAAGCG GCAGGTATGG TCGGCGGTGT GGCATTATGG GGTTATCTAG GGGAAGAAGA TAAACCACAA AATTGGCCAG CATTGGCACA ATTTAGTTCA CCTTTGGCTG TCCATCAGGC TTTAGTTGCA ACCCGTTAG
|
Protein sequence | MSLSQVKAVL FDLDGTLADT APDLVQALNL SLCDAGIEAK PFELLRSAAS HGSFALVDAA LPNADESLRI QIQQGLLAHY QRINGDYCRL FTGMDVLLDW LELQQLPFGV ITNKPARFTR PLVKKLNLHQ RMQVVISGDS TRYAKPHTAP MLLGAQQLNC APEHILYLGD AERDLLAAKA AGMVGGVALW GYLGEEDKPQ NWPALAQFSS PLAVHQALVA TR
|
| |