Gene Shewmr4_2083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_2083 
Symbol 
ID4252656 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp2481206 
End bp2483143 
Gene Length1938 bp 
Protein Length645 aa 
Translation table11 
GC content50% 
IMG OID638118707 
Product3-phytase 
Protein accessionYP_734213 
Protein GI113970420 
COG category[I] Lipid transport and metabolism 
COG ID[COG4247] 3-phytase (myo-inositol-hexaphosphate 3-phosphohydrolase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.829168 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACAG CATTTACCCA AATTAGTGCC ATAGCACTGA CCTTAGGCCT TATCAGTGCG 
CCCTTGAGCG CCTTAGCTCA ACCCGTCTCA GCCCAAGCTT ATAAAGGCAG CCAAGCCCAA
GCCTTGCTGT TTAACAACGA TAACAACGCA GTAATCCCTG CAGTCGATTG GCTGTGGGTG
AGTGAAAAGC AAGGTTTGAT GCGACAAAGC ATGCCAAGCG CAGATAAAAC GCCCCTGCCC
GCCGCGAAAA CCTTAGTTAA GGGTGAGTTT GAGCTGTTAG CCTTTAGCGA TCCCTATGCT
TTAACGCTCG ATCGCCGCGC CGATCGCATT CGCCCTATCA TGATCAAACA GATAGAAGGC
AAAGTCGCGA CCAATGTATT GCCATTGCTG CCGATGGCAT CGTTTGAAAT CAATTGGATC
TGCATTCAGC CAAGGCCGCA GGATGGCAAT ATCTATGCGT GGTTTGGCGG CGAAGACGGT
TATAGCGAGC AATGGCTGTT AGGCGATGCC GGACACTTTA TGCCGAAAAA GCTGCGCAGT
CAGGCGATTC CCGTCAACAG CACCAGCTGC GCCATCGATG GCGACAAACT GCTAGTCAGT
GAGCCAGAAG CCGGTGTCTG GCAATTTGAT GCCAGCCCCT TTGCCGACAA TAGCGCTAAG
TTGGTTCTCG CCGCACTCAA TAATGATATC GCGGGCATGC AAGTCATCGA TGGCAAGTTA
TTGTTAAGCG ATAAAAAAGG CGCCATTACC TTAGATAAAC AGGCTATTGC GAACTACGAC
TTAGGGAAGG TACAGGGATT CAGTGGTTAT CTAAGCGGAA CCAGCACCAA CAAGGCAATC
CAGTTTGCTC TCTATGATGA TAAAACCGAT CAGTACTTAT TTAGCCAAGC AGTCTTACCT
AAAGACCTGA GTCAATCAAA TAACGAGTCT AGCGATAATA TTATCGAAAT CCCTGCTTGG
GTTGAAAGCG CGCCATCCGA TAGGCCCGGC GATACCATGG ACGACCCTGC GATTTGGGTG
CATCCAACTC AACCAGAAAA AAGCCTAGTG CTTGGCACCA ACAAACGTTG GGGACTCTTG
AGCTTTAATA TGCGTGGCGA GCAGGTTCAA GCCCTGCCAT CGGGGCGTAT CAATAACGTC
GATCTGCGCC AGCAAGTGAT GCTGGGCGGT AAAAAACGCG ATATCGCGGT GGCGACCTTA
AGAGATAACG ACAGCCTCGC CTTCTATGAA ATCGACGCTG AAGGGAAGCT CAACGAGTAT
TCCAATCAAG CCACCAATAT GGTGGATATT TATGGTCTGT GCCTGTATCA AGATGCCGAT
ACCCTGTATG TGTTCGCCAA TGAAAAGTCT GGCCGCATCG CCCAATACCG CGTCGATTGG
CAGGCAAATG GCCCAAGTAT CGCGCTGGTG CGCGATATTC ATACTCCAAG CCAAGTCGAA
GGTTGTGTGG TCGATGAGGC TCAGCACGCG CTATTTATCG GTGAAGAGGA CAAAGGCATT
TGGCGCTTTA ATGCCAAGGC CAATGGCGGC ACCCAAGGCG AACTCATCAT TAAAGCCGAG
GGCGACTTAG TCCCTGATGT CGAAGGGATT TCACTCTACC AAGGCGCAAC TATTCACGGT
AAAAAGCAGG ATCTGCTGGT GGTCTCCAGC CAAGGCGATA ACAGCTACTT GCTGTATCAG
GCGCAGCCGC CCTACGCCCA ATTAGGCAAA TTCCGCATTG GAATGAACCT TAACGGGATG
GAAAATGGCC GTGAAACCAG TATCGATGCG AGCAGTGAAA CCGATGGCTT AGCCGTGACT
CACTTAAGCG TCGGCACGGG AGCTTGGCAA CAGGGAATGC TAGTAGTACA GGATGGGCAT
AATCATTTAC CCGATAATAA TCAATCCTTT AAATGGCTGC CTTGGCGCAG TATTGTCGAG
AAGTTATCGC TTAACTGA
 
Protein sequence
MTTAFTQISA IALTLGLISA PLSALAQPVS AQAYKGSQAQ ALLFNNDNNA VIPAVDWLWV 
SEKQGLMRQS MPSADKTPLP AAKTLVKGEF ELLAFSDPYA LTLDRRADRI RPIMIKQIEG
KVATNVLPLL PMASFEINWI CIQPRPQDGN IYAWFGGEDG YSEQWLLGDA GHFMPKKLRS
QAIPVNSTSC AIDGDKLLVS EPEAGVWQFD ASPFADNSAK LVLAALNNDI AGMQVIDGKL
LLSDKKGAIT LDKQAIANYD LGKVQGFSGY LSGTSTNKAI QFALYDDKTD QYLFSQAVLP
KDLSQSNNES SDNIIEIPAW VESAPSDRPG DTMDDPAIWV HPTQPEKSLV LGTNKRWGLL
SFNMRGEQVQ ALPSGRINNV DLRQQVMLGG KKRDIAVATL RDNDSLAFYE IDAEGKLNEY
SNQATNMVDI YGLCLYQDAD TLYVFANEKS GRIAQYRVDW QANGPSIALV RDIHTPSQVE
GCVVDEAQHA LFIGEEDKGI WRFNAKANGG TQGELIIKAE GDLVPDVEGI SLYQGATIHG
KKQDLLVVSS QGDNSYLLYQ AQPPYAQLGK FRIGMNLNGM ENGRETSIDA SSETDGLAVT
HLSVGTGAWQ QGMLVVQDGH NHLPDNNQSF KWLPWRSIVE KLSLN