Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Swoo_1752 |
Symbol | |
ID | 6116015 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella woodyi ATCC 51908 |
Kingdom | Bacteria |
Replicon accession | NC_010506 |
Strand | + |
Start bp | 2177122 |
End bp | 2178882 |
Gene Length | 1761 bp |
Protein Length | 586 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 641633256 |
Product | arylsulfotransferase |
Protein accession | YP_001760132 |
Protein GI | 170726106 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00146407 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00673142 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAATTTAA AACCTTTAGC TATCTGCACT TTGCTTGTTT CAGGCGCTTT GCTATCGAGT GTTTCACATA CTGTTATTGC TGGAGGGGAT CTTCCTGGTA CCTCTTCATC AGCTGTTGCC CAAGGTCAAT TAGGCTTTGT GTATGTTGAC CCCTATCATT TCAGTCCTCT AGTTGCCTTA ATCGATTTAG GTGGTAAGCA GATATCTGAT GTGTCAGTTG AGGTGAAAGG GAGTGGCAAG AAGGGAGTCG ATATTCGATA TCAAGTCAGT CAAAACCAGC TTAATACCCA TGATGGGATA CCTGTGTTTG GGCTCTACCC TGATCTGCTA AATACAGTCT ATGTTCGTTA TAAGCTCAAT GGAGCTTTTG TTAACGAAAC CTATAGGATC AGAACCAGTG CATTGCCACA TATCCATATT GATGGACAAG AGCGTAGTCA GTACGAGTAT GAAGCAGTAA CAGTTGCTGA TGGATTTCAG AATCGGCTCT ATCTGGTTGA CGGACAGGTG ATTCCGCCCA ATGTACGTGA TCACTGGGAT TGGCTGCCTA CCCAAAATAT CATAGATACC AATGGTGATG TACGTTGGCA TCTCAATGCT GGGGTGATCT ATGACCGTCC AGGTGGCTCT ATGTCATTTA AGCAAACATC AGATGGAAAA TTGATATTTG GTCAGGGAGT GATGTTTGAG CAATATGCAG GTTTTAACTC AGCTTATTAC GCAAAGTATG ACTTTATTGG TAAGCCCATC TTAAAGCGTA CTTTACCTCG TGGTTTTATC GGTTTATCCC ATGAGATAAC AGAGATGCCT AATGGACACC TGTTATTACG AGTGGGGAAA AAAGATTATC TGACAAAAGA GGGCCTTAAG CTTGATACTG TGCGTGACCA TGTGATTGAG GTCGATAGTA ATGGCGATGT GGTTAAGGTG TGGGATTTCA ACAAAATCTT AGATCCGCTT CGAGATACGG TTCTTAAATC TCTGGATATG GGAGCTGTGT GCCTTAATGT TGATGTTAAA GAAGCTGGGA AGACTAGAAG TGCTGAACAA CTGGCAAATG AGCCGTTTGG CGATGTAGCA GGTGTTGGAA CTGGCCGTAA TTGGATCCAT ATTAACTCCA TTGGTTATGA TGCAAGTGAT GACAGCATTA TCATCAGCAG TCGACATCAA TCAGCTGTGG TGAAAGTTGG ACGGGATGAT GAGGTTAAGT GGATCTTGGG AACACCTGAA GGTTGGAGCG ATGCTTTGGC TGCTAAAGTA TTAACCCCTG TTGATAGCCA AGGTGAGCAG CTTAGGTGTG ATAAAGGGGG CTGTGATGGC GAATTTGATT GGAGCTGGAC ACAACATACG GCATGGCCAG TCCCTGAGCG CGGCACTGTG ACAGTGTTTG ATAACGGTGA TGGACGTGGG CTTAAGCAAC CATTTTTTGC TACCGATAAA TATTCTCGCG GTGTTGAGTA TAAAGTCGAC ATGGATAAGA TGACAGTACA ACAGGTGTGG GAGTACGGCA AAGAGCGAGG TTATGAGTGG TATAGTCCGA TAACATCGAT AACCCAGTGG CAAGCAGACC ATCAAACCAT GTTTATGGCC TCAGCTTCTG CAGGGCTGCT AGAGAGTGAG AAAGCCCCTG AACACTGGAT AACTGAAGTG GATCCTAAGA CTAATGAGGT GAAGGTTGAA ATTAAGGTTA AGACTTTAAC TAAGCATGAG CCTGGCTACA GAAGCACCGT TGTTCACCCT GAGAAGATGT TCAGTCAGTA A
|
Protein sequence | MNLKPLAICT LLVSGALLSS VSHTVIAGGD LPGTSSSAVA QGQLGFVYVD PYHFSPLVAL IDLGGKQISD VSVEVKGSGK KGVDIRYQVS QNQLNTHDGI PVFGLYPDLL NTVYVRYKLN GAFVNETYRI RTSALPHIHI DGQERSQYEY EAVTVADGFQ NRLYLVDGQV IPPNVRDHWD WLPTQNIIDT NGDVRWHLNA GVIYDRPGGS MSFKQTSDGK LIFGQGVMFE QYAGFNSAYY AKYDFIGKPI LKRTLPRGFI GLSHEITEMP NGHLLLRVGK KDYLTKEGLK LDTVRDHVIE VDSNGDVVKV WDFNKILDPL RDTVLKSLDM GAVCLNVDVK EAGKTRSAEQ LANEPFGDVA GVGTGRNWIH INSIGYDASD DSIIISSRHQ SAVVKVGRDD EVKWILGTPE GWSDALAAKV LTPVDSQGEQ LRCDKGGCDG EFDWSWTQHT AWPVPERGTV TVFDNGDGRG LKQPFFATDK YSRGVEYKVD MDKMTVQQVW EYGKERGYEW YSPITSITQW QADHQTMFMA SASAGLLESE KAPEHWITEV DPKTNEVKVE IKVKTLTKHE PGYRSTVVHP EKMFSQ
|
| |