Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_3391 |
Symbol | |
ID | 4253957 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | - |
Start bp | 4045472 |
End bp | 4046989 |
Gene Length | 1518 bp |
Protein Length | 505 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 638120029 |
Product | sulfatase |
Protein accession | YP_735514 |
Protein GI | 113971721 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.169259 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTACAG GAAAAAAACC GCGCTCAAAT AAGTTTGTGC TAAATGCTTG CACACTTGCT TTGGGCGCCG CGTCAGTGAC AGCCTATGCT GCAGATAAAC CAAATATTCT GGTAATTTTT GGTGATGATG TGGGTTACTG GAACTTAAGT ACCTACAACA ATGGAATGTT GGCATATAGC ACGCCCAATA TTGACAGTAT TGCCGCAGAA GGAACCAAGT TTACTAACTT TTATGCTCAG CAGAGCTCTA CCGCGGGTCG TAGTGCCTTT ATTACTGGCC AGATGCCTAA GCGTACAGGT CTATCTAAAG TCGGTATGCC AGGCGCCCCC GAAGGGATCA GCGAAAAAGA TCCCACGATT GCAACTATGT TAAAAAATTT AGGTTATGCA ACGGGTCAAT TTGGTAAAAA CCACTTAGGC GACCGCGACG AGCATTTACC GACTAACCAT GGTTTCGACG AATTCTTTGG TAACTTGTAC CACCTCAATG CCGAAGAAGA GCCTGAAAAC GTCGATTATC CTAAAGATCC TGCCTTCCGT AAGAAGTTTG GCCCACGTGG CGTCATCCAT TCCTATGCCG ATGGCAAAAT CGAAGATACA GGTCCACTGA CCCGTAAGCG TATGGAAACC GTCGATGGCG AGTTCCTCGA TGCTGCCGAA ACCTTTATCG AGAAGCAAGT TAAGGCGGAT AAACCCTTCT TCACTTGGTT TAACACCACG CGTATGCACA ACTATACCCA TGTGCCTGAC TCCTATGCCG GTAAAACCGG TGCCGGATTC TATGCCGATG GTATGAAGCA GCACGACGAT GAGGTGGGTA AGTTACTGAA GAAGATCAAA GATCTGGGTA TTGATGATAA CACTATCATC ATCTACACCT CGGATAACGG TCCCATGGTC GACATGTGGC CTGATGCGGG TGTGACGCCC TTCCGCAGTG AGAAAAACAC CGGTTGGGAA GGTGCATTCC GTGTGCCTGG CATGATCAAA TGGCCTGGAC ACATCAAGCC TGGAACCACT AAAAACGGTA TGGTCTCTTT GGAAGACTTT TTCCCAACCT TAGTCGCCGC AGCGGGCGGT GAAGGCGTTG AGAAAGAATT ACTCAAGGGT AAAAAGGTCG GTAAACAAAC CTACAAAGTG CACCTCGATG GTTACAACCA ACTGCCTTAC TTCACCGATA AGACTCAAGA GTCGGCTCGT AAAGAGTTCG TCTACTGGAG TGACGACGGT GACTTATTGG CATTGCGTTA TAACCAATAC AAGTTCCACT TCATGATCCA AGAACATGAA ACGGGCTTTG CGGTTTGGCA ATATCCATTC ACTAAGTTGC GTGTACCTTT GATTTTCGAC CTCAGTGTCG ATCCATTCGA GAAGGGTGAC AAAGGTATGG GCTACAACAC TTGGATGTAC GAACGTGCAT TCTTGATGGG CCCTGCCATG GCTAAGGTTG CTGAAGTGAT GGAAAGCTTT AAAGAGTTCC CACCGCGGAT GGAAGCTGGT ACGTTTGTTC CTAGATAA
|
Protein sequence | MSTGKKPRSN KFVLNACTLA LGAASVTAYA ADKPNILVIF GDDVGYWNLS TYNNGMLAYS TPNIDSIAAE GTKFTNFYAQ QSSTAGRSAF ITGQMPKRTG LSKVGMPGAP EGISEKDPTI ATMLKNLGYA TGQFGKNHLG DRDEHLPTNH GFDEFFGNLY HLNAEEEPEN VDYPKDPAFR KKFGPRGVIH SYADGKIEDT GPLTRKRMET VDGEFLDAAE TFIEKQVKAD KPFFTWFNTT RMHNYTHVPD SYAGKTGAGF YADGMKQHDD EVGKLLKKIK DLGIDDNTII IYTSDNGPMV DMWPDAGVTP FRSEKNTGWE GAFRVPGMIK WPGHIKPGTT KNGMVSLEDF FPTLVAAAGG EGVEKELLKG KKVGKQTYKV HLDGYNQLPY FTDKTQESAR KEFVYWSDDG DLLALRYNQY KFHFMIQEHE TGFAVWQYPF TKLRVPLIFD LSVDPFEKGD KGMGYNTWMY ERAFLMGPAM AKVAEVMESF KEFPPRMEAG TFVPR
|
| |