Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sterm_2100 |
Symbol | |
ID | 8597565 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sebaldella termitidis ATCC 33386 |
Kingdom | Bacteria |
Replicon accession | NC_013517 |
Strand | + |
Start bp | 2227270 |
End bp | 2228712 |
Gene Length | 1443 bp |
Protein Length | 480 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | |
Product | sulfatase |
Protein accession | YP_003308885 |
Protein GI | 269120708 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACCAA ACATAATATT AATAATGGTA GATCAGATGA GGGGAGACTG TTTGGGGATA AACGGACATC CGGTGGTAGA GACGCCAAAT CTGGATATGA TGGCAGGCGA AGGATATAAT TTTAAGAATG CCTATTCGGC AGTACCAAGC TGTATTGCAG CAAGAGCAGC ACTTATGACA GGAATGAATC AGAGGAATCA TGGAAGAGTG GGCTATAAAA ACAATGTAAC GTGGAATTAT AAAAATATGC TTGCAGAAAC TTTTGCTAAA AATGATTACT ATACACAATG TGTGGGAAAA ATGCATGTAC ATCCTGAAAG AAGCCTGTGC GGTTTTCACA ATATTCTTCT GCATAACGGA TACTCAAATA ACAGCAGAAA CAGCAGAAAA ACATATGAAT CAGTATTTTA TAATGTAGAT GATTATTTAT ACTGGCTTAA AGAGAAGAAG GGAATTTCGG CAGAGCTTAC AGACAGCGGA CTTGACTGTA ATTCATGGGT AGCAAGATCC TGGCCTCATG AAGAACAGTA TCATCCTACT AACTGGGTAG TCAATGAAGG AATAAATTTT CTGAGAAGAA GGGATAAAAG AAAAAATTTC TTTTTGAAGC TGTCCTTTAT CAGACCGCAT TCACCGCTTG ATCCGCCTGA ATATTACTAT AATATGTATA TTAACAGAGA GATTGATAAT CCGATACCTG CAGAAGAAGA GAATATAAAG GAAGCTTATA ATATCAATGC AGCAGAGGGA CAAATATCAA AAGAAGCAAT GAAAAGAGCA AAGGTAGCGT ATTATGGAAG CATAACACAT ATTGACCATC AGATAGGACG ATTTTTGATG GTGTTGAAAG AAAATGACCT GCTAAAAGAA AGTATAGTGC TCTTTGTTTC AGATCACGGA GATTTGATGG GAGATCACGG TTTATTCAGA AAATCCATGC CGTATCAGGG GAGCATACAT GTTCCGTTTA TAGTTTATGA TCCGGGGAAT TTTCTTAACG GCGGAGTGAT GAGAGAACCG GATGAGCTCG TAGAGCTGAG AGATATCATG CCGTCACTGC TGGATTTCTG TAATATTGAA ATTCCTGATA CTGTAGACGG AAAAAGCATA AAAGAAATAA TAGAAAATAA GCCGGTAAAA TGGCGTGAAT ATCTGCACGG CGAGCATTTT AACCATGAAA AATCAAATCA GTATATAGTT GATAAAAAAA TGAAATATAT GTGGTTTTCT CAGACAGGAG CGGAAAAGCT TTTTGATCTT GAAAATGATC CGAAGGAGCT GAATGATCTC TCAGAAAAAG CGGAATATAC AGATGTAATA GAAAAATACA GAAAAATTCT GGTTAAGGAA TTGGAAGGCA GGGAAGAAGG ATATTCGGAC GGTATAAATC TTATTGCAGG AAAAGAAGCC AGAGAGTGCC TGAGCCATAT TCTGAATGAG TGA
|
Protein sequence | MKPNIILIMV DQMRGDCLGI NGHPVVETPN LDMMAGEGYN FKNAYSAVPS CIAARAALMT GMNQRNHGRV GYKNNVTWNY KNMLAETFAK NDYYTQCVGK MHVHPERSLC GFHNILLHNG YSNNSRNSRK TYESVFYNVD DYLYWLKEKK GISAELTDSG LDCNSWVARS WPHEEQYHPT NWVVNEGINF LRRRDKRKNF FLKLSFIRPH SPLDPPEYYY NMYINREIDN PIPAEEENIK EAYNINAAEG QISKEAMKRA KVAYYGSITH IDHQIGRFLM VLKENDLLKE SIVLFVSDHG DLMGDHGLFR KSMPYQGSIH VPFIVYDPGN FLNGGVMREP DELVELRDIM PSLLDFCNIE IPDTVDGKSI KEIIENKPVK WREYLHGEHF NHEKSNQYIV DKKMKYMWFS QTGAEKLFDL ENDPKELNDL SEKAEYTDVI EKYRKILVKE LEGREEGYSD GINLIAGKEA RECLSHILNE
|
| |