Gene Sterm_2100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_2100 
Symbol 
ID8597565 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp2227270 
End bp2228712 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content38% 
IMG OID 
Productsulfatase 
Protein accessionYP_003308885 
Protein GI269120708 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACCAA ACATAATATT AATAATGGTA GATCAGATGA GGGGAGACTG TTTGGGGATA 
AACGGACATC CGGTGGTAGA GACGCCAAAT CTGGATATGA TGGCAGGCGA AGGATATAAT
TTTAAGAATG CCTATTCGGC AGTACCAAGC TGTATTGCAG CAAGAGCAGC ACTTATGACA
GGAATGAATC AGAGGAATCA TGGAAGAGTG GGCTATAAAA ACAATGTAAC GTGGAATTAT
AAAAATATGC TTGCAGAAAC TTTTGCTAAA AATGATTACT ATACACAATG TGTGGGAAAA
ATGCATGTAC ATCCTGAAAG AAGCCTGTGC GGTTTTCACA ATATTCTTCT GCATAACGGA
TACTCAAATA ACAGCAGAAA CAGCAGAAAA ACATATGAAT CAGTATTTTA TAATGTAGAT
GATTATTTAT ACTGGCTTAA AGAGAAGAAG GGAATTTCGG CAGAGCTTAC AGACAGCGGA
CTTGACTGTA ATTCATGGGT AGCAAGATCC TGGCCTCATG AAGAACAGTA TCATCCTACT
AACTGGGTAG TCAATGAAGG AATAAATTTT CTGAGAAGAA GGGATAAAAG AAAAAATTTC
TTTTTGAAGC TGTCCTTTAT CAGACCGCAT TCACCGCTTG ATCCGCCTGA ATATTACTAT
AATATGTATA TTAACAGAGA GATTGATAAT CCGATACCTG CAGAAGAAGA GAATATAAAG
GAAGCTTATA ATATCAATGC AGCAGAGGGA CAAATATCAA AAGAAGCAAT GAAAAGAGCA
AAGGTAGCGT ATTATGGAAG CATAACACAT ATTGACCATC AGATAGGACG ATTTTTGATG
GTGTTGAAAG AAAATGACCT GCTAAAAGAA AGTATAGTGC TCTTTGTTTC AGATCACGGA
GATTTGATGG GAGATCACGG TTTATTCAGA AAATCCATGC CGTATCAGGG GAGCATACAT
GTTCCGTTTA TAGTTTATGA TCCGGGGAAT TTTCTTAACG GCGGAGTGAT GAGAGAACCG
GATGAGCTCG TAGAGCTGAG AGATATCATG CCGTCACTGC TGGATTTCTG TAATATTGAA
ATTCCTGATA CTGTAGACGG AAAAAGCATA AAAGAAATAA TAGAAAATAA GCCGGTAAAA
TGGCGTGAAT ATCTGCACGG CGAGCATTTT AACCATGAAA AATCAAATCA GTATATAGTT
GATAAAAAAA TGAAATATAT GTGGTTTTCT CAGACAGGAG CGGAAAAGCT TTTTGATCTT
GAAAATGATC CGAAGGAGCT GAATGATCTC TCAGAAAAAG CGGAATATAC AGATGTAATA
GAAAAATACA GAAAAATTCT GGTTAAGGAA TTGGAAGGCA GGGAAGAAGG ATATTCGGAC
GGTATAAATC TTATTGCAGG AAAAGAAGCC AGAGAGTGCC TGAGCCATAT TCTGAATGAG
TGA
 
Protein sequence
MKPNIILIMV DQMRGDCLGI NGHPVVETPN LDMMAGEGYN FKNAYSAVPS CIAARAALMT 
GMNQRNHGRV GYKNNVTWNY KNMLAETFAK NDYYTQCVGK MHVHPERSLC GFHNILLHNG
YSNNSRNSRK TYESVFYNVD DYLYWLKEKK GISAELTDSG LDCNSWVARS WPHEEQYHPT
NWVVNEGINF LRRRDKRKNF FLKLSFIRPH SPLDPPEYYY NMYINREIDN PIPAEEENIK
EAYNINAAEG QISKEAMKRA KVAYYGSITH IDHQIGRFLM VLKENDLLKE SIVLFVSDHG
DLMGDHGLFR KSMPYQGSIH VPFIVYDPGN FLNGGVMREP DELVELRDIM PSLLDFCNIE
IPDTVDGKSI KEIIENKPVK WREYLHGEHF NHEKSNQYIV DKKMKYMWFS QTGAEKLFDL
ENDPKELNDL SEKAEYTDVI EKYRKILVKE LEGREEGYSD GINLIAGKEA RECLSHILNE