Gene Sterm_2095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_2095 
Symbol 
ID8597560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp2222860 
End bp2224644 
Gene Length1785 bp 
Protein Length594 aa 
Translation table11 
GC content36% 
IMG OID 
Productsulfatase 
Protein accessionYP_003308880 
Protein GI269120703 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGTTG TAATGTTAAT GTTTGATACA TTAAACAGGA GAAGTCTGTC AGCTTACGGG 
AATAAATGGA TAAAAACACC TAATTTTGAC AGGCTGGCAG AAAAAACAGT AATGTTCAAT
AATTTTTTTT CGGGAAGTCT GCCGTGTATG CCTGCAAGGA GGGAACTGCA TACAGGAAGA
TACAATTTTT TACACAGAAG CTGGGGACCT ATGGAGCCGT TTGATTTTTC CATGCCGGAA
ACATTAAAAA ATAACGGGAT ATATACACAT CTGGTGACAG ATCATTCACA TTATTTTGAA
GACGGAGGTG CTACCTATCA TAACAGATAT AATACCTGGG AAGGCTTTCG GGGGCAGGAG
GGCGACCGCT GGAAAGGAAA GATAGGGGAT ATTGATATTC CGGAACAGAT AGAAACCGGA
AAAAAAGGAA TATCTTTTAA GCAAAACTGG ATAAACAGAA ATTATCAGAA AAATGAAGAA
GAATTTTCGG GAACAAAGGT AATAAATGCA GGAATAGAAT TTATTACAGA GAACATAAAT
GAAGATAAGT GGTTTTTGCA GATAGAATGC TTTGATCCTC ATGAGCCGTT TTATTCACCG
GAAAAATATA AAGAGCTGTA TAAGCATGAA TATAACGGAA AATTTTTTGA CTGGCCTTCA
TATAAACCGG TAACTGAAAG TGAAGAAGAA ATAGAGCATC TTAATTATGA GTATGCTGCA
CTGCTCAGCA TGTGTGATGC ACAGCTTGGC AAGGTTCTGG ATACTATGGA TAAATACAAT
ATGTGGAAAG ATACAATGCT GATAGTAAAT ACAGATCACG GGTTCTTACT TGGAGAGCAT
GGCTGGCTGG GAAAAAATAT GGAGCCGGTA TATAACGAAG TAGCGCATAT TCCGTTTTTT
ATCTGGGATC CGAGATTTGA AATAAAAAAT GAAACAAGAA ATTCACTGGC TCAGACAATA
GATCTTCCGG CAACAATATT AGAATATTTT AATGTAGAAC TTCCTGAAAC AATGCAGGGA
AAACCGCTGA GAAAGGCTAT AGAAAAAAAG GAGGATATCA GAAAAGCAGG TTTGTTCGGC
ATATACGGTG GGCATATAAA TGTAGTAAAT AATGAGTATA TTTATATGAG AGCTCCGATA
TGTCCTGAAA ACACTCCTTT GTATGAATAT ACCCTAATGC CGGCAAAAAT GAGAGGATTT
TTCAGCAAGA AACAGCTGGA AAATACAGAA TTAGTGAATG GATTTAAGTT TACAAACGGA
ATAAGCGTTC TAAAAACCTT CGGAGAACTG GAATCCTCGC TTTACAGATT TGGAAATAAA
TTATTTCACA GAAAAAATGA TCCGCTTCAG GAAAAAAATC TGGATAACAT AGAGGCAGAG
GAAAAGCTGA CAGAAATAAT GCGGGAGCTG ATGCTCGAAT CAGAAGCTCC GGATGAGCAG
TATGAGAGAA TAGGAATCTA TAAAGACAGA AAAATTACCG CAGAAGAATT AACGGTTCAG
AAAGAAGCAC GAATAAAACG TGAAAAATCA GGTATAAATG AAAATATAAT TATTTCTGAT
AAAGTTCTTG CCCAGATAAA CATAATCAAA GGAATTATAA GAAATAAAGA AGACAGGAAA
TATTTCCTGA AGGAAATAAA CAGCATGTAT GAAGAAAAAA AAGTGATGGA ACTGAAAGAG
GAGGATATAT TAAAGATTGC AGACAGTGTA ACCGGGAAGC TGAATCTCGG GGATAAAAAG
AAAGTTTTAA TGGATAGTAT AAAATATGCC GATGTAAAAG AATAA
 
Protein sequence
MKVVMLMFDT LNRRSLSAYG NKWIKTPNFD RLAEKTVMFN NFFSGSLPCM PARRELHTGR 
YNFLHRSWGP MEPFDFSMPE TLKNNGIYTH LVTDHSHYFE DGGATYHNRY NTWEGFRGQE
GDRWKGKIGD IDIPEQIETG KKGISFKQNW INRNYQKNEE EFSGTKVINA GIEFITENIN
EDKWFLQIEC FDPHEPFYSP EKYKELYKHE YNGKFFDWPS YKPVTESEEE IEHLNYEYAA
LLSMCDAQLG KVLDTMDKYN MWKDTMLIVN TDHGFLLGEH GWLGKNMEPV YNEVAHIPFF
IWDPRFEIKN ETRNSLAQTI DLPATILEYF NVELPETMQG KPLRKAIEKK EDIRKAGLFG
IYGGHINVVN NEYIYMRAPI CPENTPLYEY TLMPAKMRGF FSKKQLENTE LVNGFKFTNG
ISVLKTFGEL ESSLYRFGNK LFHRKNDPLQ EKNLDNIEAE EKLTEIMREL MLESEAPDEQ
YERIGIYKDR KITAEELTVQ KEARIKREKS GINENIIISD KVLAQINIIK GIIRNKEDRK
YFLKEINSMY EEKKVMELKE EDILKIADSV TGKLNLGDKK KVLMDSIKYA DVKE