Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_2083 |
Symbol | |
ID | 4252656 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | + |
Start bp | 2481206 |
End bp | 2483143 |
Gene Length | 1938 bp |
Protein Length | 645 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 638118707 |
Product | 3-phytase |
Protein accession | YP_734213 |
Protein GI | 113970420 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG4247] 3-phytase (myo-inositol-hexaphosphate 3-phosphohydrolase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.829168 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAACAG CATTTACCCA AATTAGTGCC ATAGCACTGA CCTTAGGCCT TATCAGTGCG CCCTTGAGCG CCTTAGCTCA ACCCGTCTCA GCCCAAGCTT ATAAAGGCAG CCAAGCCCAA GCCTTGCTGT TTAACAACGA TAACAACGCA GTAATCCCTG CAGTCGATTG GCTGTGGGTG AGTGAAAAGC AAGGTTTGAT GCGACAAAGC ATGCCAAGCG CAGATAAAAC GCCCCTGCCC GCCGCGAAAA CCTTAGTTAA GGGTGAGTTT GAGCTGTTAG CCTTTAGCGA TCCCTATGCT TTAACGCTCG ATCGCCGCGC CGATCGCATT CGCCCTATCA TGATCAAACA GATAGAAGGC AAAGTCGCGA CCAATGTATT GCCATTGCTG CCGATGGCAT CGTTTGAAAT CAATTGGATC TGCATTCAGC CAAGGCCGCA GGATGGCAAT ATCTATGCGT GGTTTGGCGG CGAAGACGGT TATAGCGAGC AATGGCTGTT AGGCGATGCC GGACACTTTA TGCCGAAAAA GCTGCGCAGT CAGGCGATTC CCGTCAACAG CACCAGCTGC GCCATCGATG GCGACAAACT GCTAGTCAGT GAGCCAGAAG CCGGTGTCTG GCAATTTGAT GCCAGCCCCT TTGCCGACAA TAGCGCTAAG TTGGTTCTCG CCGCACTCAA TAATGATATC GCGGGCATGC AAGTCATCGA TGGCAAGTTA TTGTTAAGCG ATAAAAAAGG CGCCATTACC TTAGATAAAC AGGCTATTGC GAACTACGAC TTAGGGAAGG TACAGGGATT CAGTGGTTAT CTAAGCGGAA CCAGCACCAA CAAGGCAATC CAGTTTGCTC TCTATGATGA TAAAACCGAT CAGTACTTAT TTAGCCAAGC AGTCTTACCT AAAGACCTGA GTCAATCAAA TAACGAGTCT AGCGATAATA TTATCGAAAT CCCTGCTTGG GTTGAAAGCG CGCCATCCGA TAGGCCCGGC GATACCATGG ACGACCCTGC GATTTGGGTG CATCCAACTC AACCAGAAAA AAGCCTAGTG CTTGGCACCA ACAAACGTTG GGGACTCTTG AGCTTTAATA TGCGTGGCGA GCAGGTTCAA GCCCTGCCAT CGGGGCGTAT CAATAACGTC GATCTGCGCC AGCAAGTGAT GCTGGGCGGT AAAAAACGCG ATATCGCGGT GGCGACCTTA AGAGATAACG ACAGCCTCGC CTTCTATGAA ATCGACGCTG AAGGGAAGCT CAACGAGTAT TCCAATCAAG CCACCAATAT GGTGGATATT TATGGTCTGT GCCTGTATCA AGATGCCGAT ACCCTGTATG TGTTCGCCAA TGAAAAGTCT GGCCGCATCG CCCAATACCG CGTCGATTGG CAGGCAAATG GCCCAAGTAT CGCGCTGGTG CGCGATATTC ATACTCCAAG CCAAGTCGAA GGTTGTGTGG TCGATGAGGC TCAGCACGCG CTATTTATCG GTGAAGAGGA CAAAGGCATT TGGCGCTTTA ATGCCAAGGC CAATGGCGGC ACCCAAGGCG AACTCATCAT TAAAGCCGAG GGCGACTTAG TCCCTGATGT CGAAGGGATT TCACTCTACC AAGGCGCAAC TATTCACGGT AAAAAGCAGG ATCTGCTGGT GGTCTCCAGC CAAGGCGATA ACAGCTACTT GCTGTATCAG GCGCAGCCGC CCTACGCCCA ATTAGGCAAA TTCCGCATTG GAATGAACCT TAACGGGATG GAAAATGGCC GTGAAACCAG TATCGATGCG AGCAGTGAAA CCGATGGCTT AGCCGTGACT CACTTAAGCG TCGGCACGGG AGCTTGGCAA CAGGGAATGC TAGTAGTACA GGATGGGCAT AATCATTTAC CCGATAATAA TCAATCCTTT AAATGGCTGC CTTGGCGCAG TATTGTCGAG AAGTTATCGC TTAACTGA
|
Protein sequence | MTTAFTQISA IALTLGLISA PLSALAQPVS AQAYKGSQAQ ALLFNNDNNA VIPAVDWLWV SEKQGLMRQS MPSADKTPLP AAKTLVKGEF ELLAFSDPYA LTLDRRADRI RPIMIKQIEG KVATNVLPLL PMASFEINWI CIQPRPQDGN IYAWFGGEDG YSEQWLLGDA GHFMPKKLRS QAIPVNSTSC AIDGDKLLVS EPEAGVWQFD ASPFADNSAK LVLAALNNDI AGMQVIDGKL LLSDKKGAIT LDKQAIANYD LGKVQGFSGY LSGTSTNKAI QFALYDDKTD QYLFSQAVLP KDLSQSNNES SDNIIEIPAW VESAPSDRPG DTMDDPAIWV HPTQPEKSLV LGTNKRWGLL SFNMRGEQVQ ALPSGRINNV DLRQQVMLGG KKRDIAVATL RDNDSLAFYE IDAEGKLNEY SNQATNMVDI YGLCLYQDAD TLYVFANEKS GRIAQYRVDW QANGPSIALV RDIHTPSQVE GCVVDEAQHA LFIGEEDKGI WRFNAKANGG TQGELIIKAE GDLVPDVEGI SLYQGATIHG KKQDLLVVSS QGDNSYLLYQ AQPPYAQLGK FRIGMNLNGM ENGRETSIDA SSETDGLAVT HLSVGTGAWQ QGMLVVQDGH NHLPDNNQSF KWLPWRSIVE KLSLN
|
| |