Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_2219 |
Symbol | |
ID | 4252792 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | + |
Start bp | 2650105 |
End bp | 2651895 |
Gene Length | 1791 bp |
Protein Length | 596 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 638118845 |
Product | sulfatase |
Protein accession | YP_734349 |
Protein GI | 113970556 |
COG category | [R] General function prediction only |
COG ID | [COG3083] Predicted hydrolase of alkaline phosphatase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.198637 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.459897 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCGAGC GAAAAAAGCA AATGAGCCGC GATCGCGTGT CACGACTCAT CAACTGGGGA CATTGGTTCG CCTTCTTTAA TGGCCTGTTG GCCATGATTG TCGGCACACG CTATCTGAGC AGTGTGGGTT ATCCCGAAAC CTGGTTTGGC TGGGGCTACC TCGCCGTCAG CACCATTGGC CAGTTCAGTT TTCTTGCTTT TATCGCTTAC TTGATCTGCC TATTCCCGCT GACCTTAATC TTGCCTTACT CCAAGATTTT AAGGGGCTTA GCGGCAGTGA CCGCCACCTT AAGCCTGTGT ATCTTATTGT ATGACACCAT AGTGTATGCC GATTATGGCA TGCACTTGAG CCCCTTCGCC TTTGACTTAG CCTGGGCCGA TTTAAATGCC CTGCTCCACG GCACCTCTTA TATTGTCACG CCCATTGCCA TTTTGGTGAT TGAGCTAACG GCGGCTAACT TCCTGTGGAA ACGGATTGAG AAAATCCAAA AGCTGAATCT TGGCAATAAA GTGATTACCT TTATTGGGGT GTGTTTTGTC AGCAGCCATT TGATCCACAT TTGGGCCGAC GCGGCCGATA TCACTGAAAT TACCCGTTTC GATGATACTT ATCCGCTGTC ATACCCCGCC ACGGCTCGCT CCTTTATGGA AAGCCATGGG ATTGATGGTT CTTCGCAATC GGATGATGAA GCCAATCATG CGACCAGCAC GCTCAGTTAT CCCGCACAGC CACTACAATG CCAAGCCGAC AGCAAACCCA ATGTGTTAAT GCTGACCATC GATAGCTTAC GTGCCGACAT GGTGGACGCT AAGACCATGC CGTTTTTGCA TCAATACACT GAGCAGAATC AGAGCTTTAC TCAGCATTAC AGTGGCGGTA ATCAATTTAG AACCGGCATG TTCTCCCTGC TCTATGGCTT ACAAGGCAGC TATGGCGATG CGCGCATCTT CAATAGCACT AGCCCAATCA TGACCCAAAG CTTTAAACAG GCGGGTTATC AGCTTGGCTT ATTTATCCCC GAAACCAATC TGAATTTACG CTCGGCGCAG GCCATGTTTA ATGATTTTAC CCCTGTCATC GCCAAAGAAA CCAATGGCAG TGCCGATGCG GATTTACGCA GCGTAGGCCA CTTTAAACAA TGGCAAAGCG AGCAACAGAG CCCATGGTTT GCCCTCGTCA ACCTGAAGGC GCCGGAGAAT TTTGATACCC CAGTCGGCTT CCTTGGCATC GAAACCGTCA AGGCCGATGC GAATTTGAAA CCGGCCCAAA AGGTGCTGTT TAACCAATAT CGCCAATCGT TGAATTTTAT TGATAAGCAA ATCCAAGCGA TAGTGAGTGA GTTGCCGAGC GATACCTTAG TGGTGATCAC CGGCGTTAAT GGCAAAATTT TCACCAGCAA CAGCGACGAA GCCCAGCGCA ATCTGTCTCC CGAGAGTGTC AGAGTGCCTA TGGTCATTCA TTGGCCCAAT GTCGGCGCCA GTAAGGTTAA ATACCGCACG AGTCACTATG GTGTAGTGCC TACCTTGATG ACCCATATCT TAGGTTGCAC CAATAACACC ACGGACTACA GCGCGGGCCG TAGCCTGTTG CAACCGAACC AAGAGACCTG GATTTACATC GGCGACAGTC GCATTTTTGC CATTTACCAA CAGTCGGAAA TCACCGTCAT CGACCGCCAT GGTAAATACC GCATTTACGA TGAAAACTTT GAGCACAGAC TGCATAAGAA GATGAGCGCG CCTGAGCTTA TCCAAGTGAT GCGAGAGGGA CGTCGCCTCT ACAATCATTA A
|
Protein sequence | MVERKKQMSR DRVSRLINWG HWFAFFNGLL AMIVGTRYLS SVGYPETWFG WGYLAVSTIG QFSFLAFIAY LICLFPLTLI LPYSKILRGL AAVTATLSLC ILLYDTIVYA DYGMHLSPFA FDLAWADLNA LLHGTSYIVT PIAILVIELT AANFLWKRIE KIQKLNLGNK VITFIGVCFV SSHLIHIWAD AADITEITRF DDTYPLSYPA TARSFMESHG IDGSSQSDDE ANHATSTLSY PAQPLQCQAD SKPNVLMLTI DSLRADMVDA KTMPFLHQYT EQNQSFTQHY SGGNQFRTGM FSLLYGLQGS YGDARIFNST SPIMTQSFKQ AGYQLGLFIP ETNLNLRSAQ AMFNDFTPVI AKETNGSADA DLRSVGHFKQ WQSEQQSPWF ALVNLKAPEN FDTPVGFLGI ETVKADANLK PAQKVLFNQY RQSLNFIDKQ IQAIVSELPS DTLVVITGVN GKIFTSNSDE AQRNLSPESV RVPMVIHWPN VGASKVKYRT SHYGVVPTLM THILGCTNNT TDYSAGRSLL QPNQETWIYI GDSRIFAIYQ QSEITVIDRH GKYRIYDENF EHRLHKKMSA PELIQVMREG RRLYNH
|
| |