Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sputcn32_1979 |
Symbol | |
ID | 5078427 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella putrefaciens CN-32 |
Kingdom | Bacteria |
Replicon accession | NC_009438 |
Strand | - |
Start bp | 2265107 |
End bp | 2267098 |
Gene Length | 1992 bp |
Protein Length | 663 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640499138 |
Product | sulfatase |
Protein accession | YP_001183499 |
Protein GI | 146293075 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACGCG GAATTTTCAA ACATTTGTTT ACGGGCCACG TCTGGGGACC CTACGTACAT TTGGTGCGGA TGATTTTTAT TGGCATGTTT GTGCTTAGTA TGAGTCGGCT CTCACTAATA CTCTGGTTAT ACGACCGGGT GGAACCTACC AAAATGTTGG GGAAGATTTT ACTCCAAGGT GTGAGAGCTG ATTTTATCTT GATGTGTCTT CTGGCTGCGA TTCCGGTATT GCTGTCCATT GTTTTGGTGT GGTCACCGCT GAAAAGAGTA TGGTTTCGTC TCACATTTAT ATGGAGCTTA CTAGCGCTGA TATTACTGAC CTTTCTCGAA CTGTCGACAC CGTCATTTGT GTTGCAATAT GACATCAGGC CCAATCGCCT GTACATTGAA TATCTGAAAT ATCCAAAAGA AGTCTTTGCG ACGCTTTGGA ATGGATTTAG GTTGCCGTTG TTGGTGGGAG TGTCACTGAC AGTGGTATCC GCATTAATAT TTAGGAGACA ATTGCAAGTC TCTGCAGATC AACACCGCCT ATGGCCAATC AAAACCCACT TATTGGGTTG GCTACTTGCT GTAATGGTTG TTGTTGCGGG GATCCGTTCG ACAACGCAAC ACCGGCCCGC AAACCCTGCT ATGTTTGCAA TTACTGCCGA TGCCATGGTG AATTCGTTAG TCATCAACTC TGGGTATTCT GTGCTGTATG CCTTGTATAG TCTCAAGCAC GAGGCGCGCA GTACTGAGAT TTACGGCAAA CTGTCTGAAG CTGACATGAT AGTTCAGACT CGTGATTGGC CATGGCTCAA AGACTATGAA TATCCAAACC CTGAATATCC AACACTTCAT TGGCAAACCG CAAAGGTCCA ACGTCAAAAG TCATTAAATA TCGTTATTGT TCTGCAGGAA AGCTTGGGGG CGACTTTTGT ACAGTCGCTA GGAGGTGCGC CGGTCACTCC TCAGCTTGAA CAATTGAAAA CTCAAGGGAT CTGGTTTAAC AATTTATATG CAACAGGCAC CCGGTCAGTT CGGGGTATTG AAGCCGTTTT GGCAGGCTTT ATGCCAACGC CTGCACAGAG TGTTGTGAAG TTATCCAACA GCCAACAAGG TTTCAGCACG CTTGCATCTG TACTTCAGCA ACAAGGTTAC CATACTCAGT TTGTATATGG TGGTGAATCT CATTTTGACA ACATGCGTAG TTTCTTTACT GGTAACGGTT TTAATGATGT CGTTGACTTG CCGAGAATTA AGGCTCCCAA GTTCGTTGGC AGTTGGGGGG CCAGTGACGA AGACTTGTTT GATACTGCAC ATCAGCAGCT CATTCAAATG CACCAGACCG GTAAACCATT TTTCAGCCTT ATCTTTACAT CAACCAATCA CGAACCTTTT GAGTATCCTG ATGGCAGGAT TGAGCTATAC GAGCAACCCA AATCGACTGC CTATAATGCT GTCAAATACG CCGACTGGGC TATGGGAGAG TTTTTCCGTA AGGCAAAAGG CAGTGCTTAC TGGCAGGACA CCATCTTTCT TGTCGTTGCT GATCATGATA ACAGGGTGTA TGGCAGTAAC TTAATTCCGG TTGAGAAATT TCAGATCCCC GGTTTGATTC TGGGAGGTTC AGTTAGACCT GCCACGATTG AACCACTAGC CAGCCAAATA GATTTAGCGC CGACATTGTT AAGCATTGCA GGCGTATCTT CTTGTCACCC ATTTGATGGC CGTGATTTCA TCGCAGATCC GATAAGTCCG GGACGTGCGA TGATGCAATT TGATAATCTG TTTGCTCTGA TGACGGAGCA GGAATTAACG ATTTTGCGAC CGAACGATAC ACCTGTAGGT GCCGACTACG ACAGAATAAA TCGCATTTTG ACATTGAAAC AGGGGGATGT CGCTGAAGCT TCTCGGCAAA AAGCTCTTGC CCATGTACAA CTACCGTCCT TCTTATATAG AGAACGGAAG TACAGCAATA AAGCCAAGTG TCAGACATCG CATCAACAGT GA
|
Protein sequence | MQRGIFKHLF TGHVWGPYVH LVRMIFIGMF VLSMSRLSLI LWLYDRVEPT KMLGKILLQG VRADFILMCL LAAIPVLLSI VLVWSPLKRV WFRLTFIWSL LALILLTFLE LSTPSFVLQY DIRPNRLYIE YLKYPKEVFA TLWNGFRLPL LVGVSLTVVS ALIFRRQLQV SADQHRLWPI KTHLLGWLLA VMVVVAGIRS TTQHRPANPA MFAITADAMV NSLVINSGYS VLYALYSLKH EARSTEIYGK LSEADMIVQT RDWPWLKDYE YPNPEYPTLH WQTAKVQRQK SLNIVIVLQE SLGATFVQSL GGAPVTPQLE QLKTQGIWFN NLYATGTRSV RGIEAVLAGF MPTPAQSVVK LSNSQQGFST LASVLQQQGY HTQFVYGGES HFDNMRSFFT GNGFNDVVDL PRIKAPKFVG SWGASDEDLF DTAHQQLIQM HQTGKPFFSL IFTSTNHEPF EYPDGRIELY EQPKSTAYNA VKYADWAMGE FFRKAKGSAY WQDTIFLVVA DHDNRVYGSN LIPVEKFQIP GLILGGSVRP ATIEPLASQI DLAPTLLSIA GVSSCHPFDG RDFIADPISP GRAMMQFDNL FALMTEQELT ILRPNDTPVG ADYDRINRIL TLKQGDVAEA SRQKALAHVQ LPSFLYRERK YSNKAKCQTS HQQ
|
| |