Gene Shewmr4_2219 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_2219 
Symbol 
ID4252792 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp2650105 
End bp2651895 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content49% 
IMG OID638118845 
Productsulfatase 
Protein accessionYP_734349 
Protein GI113970556 
COG category[R] General function prediction only 
COG ID[COG3083] Predicted hydrolase of alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.198637 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.459897 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGAGC GAAAAAAGCA AATGAGCCGC GATCGCGTGT CACGACTCAT CAACTGGGGA 
CATTGGTTCG CCTTCTTTAA TGGCCTGTTG GCCATGATTG TCGGCACACG CTATCTGAGC
AGTGTGGGTT ATCCCGAAAC CTGGTTTGGC TGGGGCTACC TCGCCGTCAG CACCATTGGC
CAGTTCAGTT TTCTTGCTTT TATCGCTTAC TTGATCTGCC TATTCCCGCT GACCTTAATC
TTGCCTTACT CCAAGATTTT AAGGGGCTTA GCGGCAGTGA CCGCCACCTT AAGCCTGTGT
ATCTTATTGT ATGACACCAT AGTGTATGCC GATTATGGCA TGCACTTGAG CCCCTTCGCC
TTTGACTTAG CCTGGGCCGA TTTAAATGCC CTGCTCCACG GCACCTCTTA TATTGTCACG
CCCATTGCCA TTTTGGTGAT TGAGCTAACG GCGGCTAACT TCCTGTGGAA ACGGATTGAG
AAAATCCAAA AGCTGAATCT TGGCAATAAA GTGATTACCT TTATTGGGGT GTGTTTTGTC
AGCAGCCATT TGATCCACAT TTGGGCCGAC GCGGCCGATA TCACTGAAAT TACCCGTTTC
GATGATACTT ATCCGCTGTC ATACCCCGCC ACGGCTCGCT CCTTTATGGA AAGCCATGGG
ATTGATGGTT CTTCGCAATC GGATGATGAA GCCAATCATG CGACCAGCAC GCTCAGTTAT
CCCGCACAGC CACTACAATG CCAAGCCGAC AGCAAACCCA ATGTGTTAAT GCTGACCATC
GATAGCTTAC GTGCCGACAT GGTGGACGCT AAGACCATGC CGTTTTTGCA TCAATACACT
GAGCAGAATC AGAGCTTTAC TCAGCATTAC AGTGGCGGTA ATCAATTTAG AACCGGCATG
TTCTCCCTGC TCTATGGCTT ACAAGGCAGC TATGGCGATG CGCGCATCTT CAATAGCACT
AGCCCAATCA TGACCCAAAG CTTTAAACAG GCGGGTTATC AGCTTGGCTT ATTTATCCCC
GAAACCAATC TGAATTTACG CTCGGCGCAG GCCATGTTTA ATGATTTTAC CCCTGTCATC
GCCAAAGAAA CCAATGGCAG TGCCGATGCG GATTTACGCA GCGTAGGCCA CTTTAAACAA
TGGCAAAGCG AGCAACAGAG CCCATGGTTT GCCCTCGTCA ACCTGAAGGC GCCGGAGAAT
TTTGATACCC CAGTCGGCTT CCTTGGCATC GAAACCGTCA AGGCCGATGC GAATTTGAAA
CCGGCCCAAA AGGTGCTGTT TAACCAATAT CGCCAATCGT TGAATTTTAT TGATAAGCAA
ATCCAAGCGA TAGTGAGTGA GTTGCCGAGC GATACCTTAG TGGTGATCAC CGGCGTTAAT
GGCAAAATTT TCACCAGCAA CAGCGACGAA GCCCAGCGCA ATCTGTCTCC CGAGAGTGTC
AGAGTGCCTA TGGTCATTCA TTGGCCCAAT GTCGGCGCCA GTAAGGTTAA ATACCGCACG
AGTCACTATG GTGTAGTGCC TACCTTGATG ACCCATATCT TAGGTTGCAC CAATAACACC
ACGGACTACA GCGCGGGCCG TAGCCTGTTG CAACCGAACC AAGAGACCTG GATTTACATC
GGCGACAGTC GCATTTTTGC CATTTACCAA CAGTCGGAAA TCACCGTCAT CGACCGCCAT
GGTAAATACC GCATTTACGA TGAAAACTTT GAGCACAGAC TGCATAAGAA GATGAGCGCG
CCTGAGCTTA TCCAAGTGAT GCGAGAGGGA CGTCGCCTCT ACAATCATTA A
 
Protein sequence
MVERKKQMSR DRVSRLINWG HWFAFFNGLL AMIVGTRYLS SVGYPETWFG WGYLAVSTIG 
QFSFLAFIAY LICLFPLTLI LPYSKILRGL AAVTATLSLC ILLYDTIVYA DYGMHLSPFA
FDLAWADLNA LLHGTSYIVT PIAILVIELT AANFLWKRIE KIQKLNLGNK VITFIGVCFV
SSHLIHIWAD AADITEITRF DDTYPLSYPA TARSFMESHG IDGSSQSDDE ANHATSTLSY
PAQPLQCQAD SKPNVLMLTI DSLRADMVDA KTMPFLHQYT EQNQSFTQHY SGGNQFRTGM
FSLLYGLQGS YGDARIFNST SPIMTQSFKQ AGYQLGLFIP ETNLNLRSAQ AMFNDFTPVI
AKETNGSADA DLRSVGHFKQ WQSEQQSPWF ALVNLKAPEN FDTPVGFLGI ETVKADANLK
PAQKVLFNQY RQSLNFIDKQ IQAIVSELPS DTLVVITGVN GKIFTSNSDE AQRNLSPESV
RVPMVIHWPN VGASKVKYRT SHYGVVPTLM THILGCTNNT TDYSAGRSLL QPNQETWIYI
GDSRIFAIYQ QSEITVIDRH GKYRIYDENF EHRLHKKMSA PELIQVMREG RRLYNH