Gene Shewmr4_3391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_3391 
Symbol 
ID4253957 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp4045472 
End bp4046989 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content47% 
IMG OID638120029 
Productsulfatase 
Protein accessionYP_735514 
Protein GI113971721 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.169259 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTACAG GAAAAAAACC GCGCTCAAAT AAGTTTGTGC TAAATGCTTG CACACTTGCT 
TTGGGCGCCG CGTCAGTGAC AGCCTATGCT GCAGATAAAC CAAATATTCT GGTAATTTTT
GGTGATGATG TGGGTTACTG GAACTTAAGT ACCTACAACA ATGGAATGTT GGCATATAGC
ACGCCCAATA TTGACAGTAT TGCCGCAGAA GGAACCAAGT TTACTAACTT TTATGCTCAG
CAGAGCTCTA CCGCGGGTCG TAGTGCCTTT ATTACTGGCC AGATGCCTAA GCGTACAGGT
CTATCTAAAG TCGGTATGCC AGGCGCCCCC GAAGGGATCA GCGAAAAAGA TCCCACGATT
GCAACTATGT TAAAAAATTT AGGTTATGCA ACGGGTCAAT TTGGTAAAAA CCACTTAGGC
GACCGCGACG AGCATTTACC GACTAACCAT GGTTTCGACG AATTCTTTGG TAACTTGTAC
CACCTCAATG CCGAAGAAGA GCCTGAAAAC GTCGATTATC CTAAAGATCC TGCCTTCCGT
AAGAAGTTTG GCCCACGTGG CGTCATCCAT TCCTATGCCG ATGGCAAAAT CGAAGATACA
GGTCCACTGA CCCGTAAGCG TATGGAAACC GTCGATGGCG AGTTCCTCGA TGCTGCCGAA
ACCTTTATCG AGAAGCAAGT TAAGGCGGAT AAACCCTTCT TCACTTGGTT TAACACCACG
CGTATGCACA ACTATACCCA TGTGCCTGAC TCCTATGCCG GTAAAACCGG TGCCGGATTC
TATGCCGATG GTATGAAGCA GCACGACGAT GAGGTGGGTA AGTTACTGAA GAAGATCAAA
GATCTGGGTA TTGATGATAA CACTATCATC ATCTACACCT CGGATAACGG TCCCATGGTC
GACATGTGGC CTGATGCGGG TGTGACGCCC TTCCGCAGTG AGAAAAACAC CGGTTGGGAA
GGTGCATTCC GTGTGCCTGG CATGATCAAA TGGCCTGGAC ACATCAAGCC TGGAACCACT
AAAAACGGTA TGGTCTCTTT GGAAGACTTT TTCCCAACCT TAGTCGCCGC AGCGGGCGGT
GAAGGCGTTG AGAAAGAATT ACTCAAGGGT AAAAAGGTCG GTAAACAAAC CTACAAAGTG
CACCTCGATG GTTACAACCA ACTGCCTTAC TTCACCGATA AGACTCAAGA GTCGGCTCGT
AAAGAGTTCG TCTACTGGAG TGACGACGGT GACTTATTGG CATTGCGTTA TAACCAATAC
AAGTTCCACT TCATGATCCA AGAACATGAA ACGGGCTTTG CGGTTTGGCA ATATCCATTC
ACTAAGTTGC GTGTACCTTT GATTTTCGAC CTCAGTGTCG ATCCATTCGA GAAGGGTGAC
AAAGGTATGG GCTACAACAC TTGGATGTAC GAACGTGCAT TCTTGATGGG CCCTGCCATG
GCTAAGGTTG CTGAAGTGAT GGAAAGCTTT AAAGAGTTCC CACCGCGGAT GGAAGCTGGT
ACGTTTGTTC CTAGATAA
 
Protein sequence
MSTGKKPRSN KFVLNACTLA LGAASVTAYA ADKPNILVIF GDDVGYWNLS TYNNGMLAYS 
TPNIDSIAAE GTKFTNFYAQ QSSTAGRSAF ITGQMPKRTG LSKVGMPGAP EGISEKDPTI
ATMLKNLGYA TGQFGKNHLG DRDEHLPTNH GFDEFFGNLY HLNAEEEPEN VDYPKDPAFR
KKFGPRGVIH SYADGKIEDT GPLTRKRMET VDGEFLDAAE TFIEKQVKAD KPFFTWFNTT
RMHNYTHVPD SYAGKTGAGF YADGMKQHDD EVGKLLKKIK DLGIDDNTII IYTSDNGPMV
DMWPDAGVTP FRSEKNTGWE GAFRVPGMIK WPGHIKPGTT KNGMVSLEDF FPTLVAAAGG
EGVEKELLKG KKVGKQTYKV HLDGYNQLPY FTDKTQESAR KEFVYWSDDG DLLALRYNQY
KFHFMIQEHE TGFAVWQYPF TKLRVPLIFD LSVDPFEKGD KGMGYNTWMY ERAFLMGPAM
AKVAEVMESF KEFPPRMEAG TFVPR