Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BAS2737 |
Symbol | |
ID | 2852778 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus anthracis str. Sterne |
Kingdom | Bacteria |
Replicon accession | NC_005945 |
Strand | - |
Start bp | 2711593 |
End bp | 2713566 |
Gene Length | 1974 bp |
Protein Length | 657 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 637505982 |
Product | sulfatase |
Protein accession | YP_028995 |
Protein GI | 49185743 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000322483 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACAAT TCTTATTAAA GAGCAAAAGT GTGCTAAGCA ATCATTTTGG ATTCTTTCTG TTTGCCGTTA TTTTATTTTG GCTCAAAACA TATGCGGCTT ATGTAACAGA ATTTAATTTA GGTATTTCAA ATACAATTCA AAAATTCTTG CTGTTTTTCA ACCCGCTTAG TTCAGCAGTG TTATTTTTAG GACTTGCATT ATTTGCAAAA GGGAAACGAT CTTATATTTG GTTAATTGTT ATCAACTTGT TATTGTCGAT TCTTTTATAT GCAAACGTCG TATACTATCG CTTTTTCAGT GACTTTATTA CGTTCCCGAC GTTGACACAA ACGAATAACT TTGGAGATTT AGGTGGTAGT ATTGTTGCGT TGCTACATCT TTATGATCCG CTATACTTCT TAGATACAAT TATTTTAATT GTGTTAGTTG CAACGAAATT TGCAAATCCA AAACCAATTC GTGTTGCGAA ATATAAAGTA TCACTAGTAT TTGTAGCAGG TATTTTATTA TTCAGTGTTA ACTTAGGACT TGCAGAATCT GACCGTCCTG AATTATTAAC AAGAACGTTT GATCGTAATT ATATTGTGAA ATATTTAGGG GCATATAACT ATACGATTTA TGATGGTATT CAAAGTGCGA AAGCATCAAC GGAAAGAGCG TTAGCTGATG GAGATAATAT GACGGAAGTA AGAAATTATT TAACATCAAC TTACGCAAGT CCAAATCCTG AGTATTTCGG TAAAGGTAAG GGAATGAACG TAATTTATAT TCATTTAGAG TCATTCCAAA ACTTCTTAAT TGATTATAAA TTAAACGGTC AAGAAGTTAC GCCGTTCTTA AACTCATTTA CAAAAGATGC GAATACGCTT TACTTTGATA ACTTCTTCCA TCAAACAGGA CAAGGGAAAA CGTCTGATGC GGAGTTTATG TTAGAGAATT CTATGTTTGG TTTACCACAA GGATCTGTAT TTACAACGAA ATCTCATAAC ACGTATCAAT CAGCACCAGC AATTTTAGGA CAACAAGGAT ACACATCAGC TGTATTCCAT GGTAACTATA AAACATTTTG GAACCGTGAT GATATTTATA AATCATTTGG TTTTAATAAA TTCTTTGATG CTTCATACTA TGATATGAAC GAAAAAGATG TAGTAAACTA TGGATTAAAA GATAAACCAT TCTTTAATGA ATCCATTCCG TTATTAGAAA CATTGAAACA ACCGTTCTAT ACGAAGTTTA TTACGTTATC GAATCACTTC CCGTATCCAA TTGATAAAGA GGAAGCGACA ATTGAACCAG CGAACACAGG TGACTCATCT GTAGATACGT ATTTCCAAAC AGCACGTTAT TTAGACGAAT CTGTAAAAGG TTTCATCGAT TACTTGAAGC AATCTGGTTT ATATGATAAT TCTATTATCG TTATGTACGG AGACCATTAC GGTATTTCAG ATAATCATAA CGCAGCAATG TCAAAAGTAA TGGGTAAAGA AATTAACTCG TTTGAAAATG CACAGTTACA GCGTGTACCT TTAATCGTTC GTGTACCAGG TGTGAAAGGT GGCGTACAAC ATCAATACGG TGGTGAAATT GATGTTCTTC CAACTCTTTT ACACTTACTA GGAACAGATA CAAAAAATTA TGTTCAATTT GGTTCAGATT TATTATCACC AGATCATAAA CAAGTTGTTG CGTTCCGTAA CGGCAACTTC GTAAGCCCAA CAGTTACTGC ACTAAACGGC AAATATTATG ATACAACAAC TGGAAAACCT GTAGAATTTA CAGATGAAAT AAAACAAAAT GAACAAATGG TTCAAAACTC GTTAAAATAC TCTGACCAAG TCGTAAATGG TGACTTATTA CGATTCTACA CACCGGAAGG ATTTACACCG ATAGATCGTT CGAAGTATAA CTATAACAAT CGTGATAAAA ACAAAACGAA AGTAAAAACG GCTCCGGAAG GGGAAGCTAA ATAA
|
Protein sequence | MKQFLLKSKS VLSNHFGFFL FAVILFWLKT YAAYVTEFNL GISNTIQKFL LFFNPLSSAV LFLGLALFAK GKRSYIWLIV INLLLSILLY ANVVYYRFFS DFITFPTLTQ TNNFGDLGGS IVALLHLYDP LYFLDTIILI VLVATKFANP KPIRVAKYKV SLVFVAGILL FSVNLGLAES DRPELLTRTF DRNYIVKYLG AYNYTIYDGI QSAKASTERA LADGDNMTEV RNYLTSTYAS PNPEYFGKGK GMNVIYIHLE SFQNFLIDYK LNGQEVTPFL NSFTKDANTL YFDNFFHQTG QGKTSDAEFM LENSMFGLPQ GSVFTTKSHN TYQSAPAILG QQGYTSAVFH GNYKTFWNRD DIYKSFGFNK FFDASYYDMN EKDVVNYGLK DKPFFNESIP LLETLKQPFY TKFITLSNHF PYPIDKEEAT IEPANTGDSS VDTYFQTARY LDESVKGFID YLKQSGLYDN SIIVMYGDHY GISDNHNAAM SKVMGKEINS FENAQLQRVP LIVRVPGVKG GVQHQYGGEI DVLPTLLHLL GTDTKNYVQF GSDLLSPDHK QVVAFRNGNF VSPTVTALNG KYYDTTTGKP VEFTDEIKQN EQMVQNSLKY SDQVVNGDLL RFYTPEGFTP IDRSKYNYNN RDKNKTKVKT APEGEAK
|
| |