Gene Sterm_1723 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_1723 
Symbol 
ID8597192 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp1841819 
End bp1843183 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content37% 
IMG OID 
Productsulfatase 
Protein accessionYP_003308512 
Protein GI269120335 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATATTT TATACATACA CACACATGAT TCCGGAAGAT TTTTGAAACC GTACGGATAT 
AACGTGCCGA CTGACTATTT ATTGGAATTT GCCAAGGATG CCGTTGTTTT CAGAAAGGCA
TTCTGCGGGG CACCGACATG TTCGCCCAGC CGGTCAGTCC TGCTGACAGG AATGTATGCA
CATAATAACG GTATGCTGGG GCTTGCTCAC AGAGGTTTTA AAATAAATGA TTACAGCAAA
CACCTTGCAA GCTACCTGAA AAATTATGAT TATGAAACTG TTTTATCCGG TGTACAGCAT
GAGGCAGATT CTTGGCTGAA TTATGATAAA GCTGCAAAGG TAATAGGCTA CAGCTGTGAT
ATTACTACTG TGCCTGAAAA AGACAATGAA GAAGAGCTTG TTTACTGGGA CAGAAATAAT
GCTGCCGAAA CAGCAGAATA CTTTAAGAAA GCTGCTAAAA CCGATAAGAA ATTTTTTATG
TCATTTGGTA TGTTCAGTAC TCACAGAAAA TATCCCGTCA TTCCGGAAAA TAATACTGAT
CCTGATTATG TAGAGTTGCC GCCGAGAACT TATGATAATG AAAATAACAG GGCGGATACT
GCCAGATACA TGGATTCGGC AAGGATGGCA GATGATTGCA TAAAAACTGT AATAGAGGCA
TTAAAAGATG CAGGACTTTA TGAAAAGACA ATAATAATCT TTACTACCGA TCATGGTGTA
GCCAATCCGT TTGACAAATG TTTTTTGAAT GACAGCGGAA TAGGAGTAGC TTTGATAATC
AGAGATCCGA ATCAGAAGAA ACAGGGAAGA GCAATAGATG CTATGGTTTC GCATATTGAT
ATTTTCCCCA CACTGTGTGA GCTGACAGGA GTGGAGAAGC CGGAGTGGCT TCAGGGGAAA
TCACTGGTTC CCCTTCTTTA TGAAAATAAA AAGGTAAGGG AAGAGATATA TGCAGAGATT
AATTATCACA CATCATATGA ACCTGCAAGA TGTGTAAGGA ATGAAAGATA TAAATATATA
AAGTATTTTG ATAAGACATA TGACAAATAT AACTATTCCA ATATGGATGA TTCCGAAGTA
AAAGGATTTC TCATGAAAAA TGGTCTTTTG GATATGAAAA AAGAAATGGA AATACTTTAT
GATCTGTACT TTGATCCGGG TGAAAGCAAT AATGTAGCAG GAAAAGCTGA ATACAGCGAA
ATTCTTGAAG AAATGAGAAT AAAGCTTCAA AAGTGGCAGA AGCAAACTGA TGATCCTGTG
CTGGAAGGAA GAATAAAAGC ACCTGAAGGA GCAAAAATAA ATAATAAAGA GTGTATGTCA
GCTGGTTCTA AAAATAAAAA TGACTACGAA AAGTTTCCGG ATTAA
 
Protein sequence
MNILYIHTHD SGRFLKPYGY NVPTDYLLEF AKDAVVFRKA FCGAPTCSPS RSVLLTGMYA 
HNNGMLGLAH RGFKINDYSK HLASYLKNYD YETVLSGVQH EADSWLNYDK AAKVIGYSCD
ITTVPEKDNE EELVYWDRNN AAETAEYFKK AAKTDKKFFM SFGMFSTHRK YPVIPENNTD
PDYVELPPRT YDNENNRADT ARYMDSARMA DDCIKTVIEA LKDAGLYEKT IIIFTTDHGV
ANPFDKCFLN DSGIGVALII RDPNQKKQGR AIDAMVSHID IFPTLCELTG VEKPEWLQGK
SLVPLLYENK KVREEIYAEI NYHTSYEPAR CVRNERYKYI KYFDKTYDKY NYSNMDDSEV
KGFLMKNGLL DMKKEMEILY DLYFDPGESN NVAGKAEYSE ILEEMRIKLQ KWQKQTDDPV
LEGRIKAPEG AKINNKECMS AGSKNKNDYE KFPD