Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Swoo_3650 |
Symbol | |
ID | 6117984 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella woodyi ATCC 51908 |
Kingdom | Bacteria |
Replicon accession | NC_010506 |
Strand | - |
Start bp | 4443330 |
End bp | 4444778 |
Gene Length | 1449 bp |
Protein Length | 482 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 641635201 |
Product | sulfatase |
Protein accession | YP_001762007 |
Protein GI | 170727981 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00127194 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00847064 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAAGAAAC ACTATCCTCT TTTATTAGTA AGCCTTGTCA GCTTCTGTGC GTTTGCTGAC TCTAAACCTG AATTGCAACT TGAGAAGTCA TCTAATGGCG CTGATCAAAA CGTGCTTGTC CTGCTGATAG ATGATCTTGG ATGGACAGAT TTGGGCGCCT ATGGCAGTCA ATATTACGAG TCTCCAAACA TAGATGCTTT AGCATCACAG AGTCGGCTCT ATACTCAAGC TTACTCCTCA TCCCCTGTTT GTTCTCCCTC TCGTGCAGCC TTAATGACGG GAAAACACCC CAGTAAACTC AAAATTACGA CCCATTTTCC AGGTTATAAA GCTAAGTCTC CTAAGTTGAA GGAACCGTGG AAAGCAGATC ACTTGGCATT AACTGAACTC ACCTTAGCCG AAGCCTTTAA ATCCCAAGGC TACGAGACTT TTTTTGCGGG GAAGTGGCAT ATGGGAGGTG AGGGCTACCT ACCGACAGAT CAAGGTTTTG ATATCAATAT TGGTGGCATG CATCGTGGCT CTCCACCTGG TGGGTATTAT GATCCCTATA AAAACCCTAA TCTTCCGAAT CGTAACAAAG GTGAGCACCT GACTAAGCGG CTGACCGATG AAACCATAGA CTTTTTATCT CAAAAGCATG AGAAACCTTT TTTTGCCCTC TTGTCATATT ACGGGGTGCA TACTCCCCTG CAGGCGGGGC CTGATAAGTT GGCTTACTTT AAGGAGAAAA CCAATACCGT GGCAGGTGAG AAAGCTTTTC TAATCGATAA GGGCCATCAA AGCCGAACTC AGATAAATCA AGTCGATGCC AACTACGCCT CCATGATCTG GGCGGTAGAC AAGTCTGTTG GCCGTATACT TGAGTCGCTT GAAAAGCAGG GGCTAGATAA GAACACTTTG GTGGTACTAA CCTCTGATAA TGGTGGTTTC TCAACACGTC ATCAGGGGGA TGAAAGAGTA ACATCGACTG CCAATCTACC ACTGCGCTCT GGTAAAGGTT GGGTCTATGA AGGGGGAGTG CGTATTCCTC TGCTTATTCA TCAACCCGGT CAGCAAATTC AGAGTCAGCA CGATACCTTG ACGACATCGG CTGATCTCTA TCCAACATTG GCTAATGTTG CTGGTGCTAA GATCCCAGAG GGGATTGATG GCTCAGATAT CTTCTTATTG GATGAGGAGC CTGAACTTGC TAAGCAGCGA GTCATAGTCT GGCATCATCC GCATTATCAT GGCAGCGGTA ACAAGCCAAG TGCCGCTATT CGTGTGGGAG ATTGGAAACT ATTGCATTTT TACGAGCAAG ATAGGGTGGA GTTATACAAC CTAAGCAATG ACATCGCTGA GCAGGTCAAT CTTGAGCAGT TAGAGCCTAA GCGACGAGCG CATTTACTTG CACTACTGGA TGAGTGGTAT CGTGATAATG ATATCGAGCA AGTCAGCTTG CTTGAGTAA
|
Protein sequence | MKKHYPLLLV SLVSFCAFAD SKPELQLEKS SNGADQNVLV LLIDDLGWTD LGAYGSQYYE SPNIDALASQ SRLYTQAYSS SPVCSPSRAA LMTGKHPSKL KITTHFPGYK AKSPKLKEPW KADHLALTEL TLAEAFKSQG YETFFAGKWH MGGEGYLPTD QGFDINIGGM HRGSPPGGYY DPYKNPNLPN RNKGEHLTKR LTDETIDFLS QKHEKPFFAL LSYYGVHTPL QAGPDKLAYF KEKTNTVAGE KAFLIDKGHQ SRTQINQVDA NYASMIWAVD KSVGRILESL EKQGLDKNTL VVLTSDNGGF STRHQGDERV TSTANLPLRS GKGWVYEGGV RIPLLIHQPG QQIQSQHDTL TTSADLYPTL ANVAGAKIPE GIDGSDIFLL DEEPELAKQR VIVWHHPHYH GSGNKPSAAI RVGDWKLLHF YEQDRVELYN LSNDIAEQVN LEQLEPKRRA HLLALLDEWY RDNDIEQVSL LE
|
| |