Gene Sterm_0130 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_0130 
Symbol 
ID8595626 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp136661 
End bp138166 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content37% 
IMG OID 
Productsulfatase 
Protein accessionYP_003306946 
Protein GI269118769 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCAA TAATCATATT TTTTGACAGT CTGAACAGGC ATTTTCTTCC GAATTACGGT 
AATGACTGGG TTAATGCCCC TAATTTCAAA AGACTGGATG AAAAAACACT TACTTTTGAC
AGAAGTTATG TAGGGAGTAT GCCCTGTATG CCTGCAAGAA GAGAACTTCA TACCGGAAGA
CATAATTTTC TCCACAGAGA ATGGGGACCT TTGGAGCCTT TCGATGATTC AATGCCGGAA
ATTCTGAAAA AAGCCGGAGT TTATACTCAT ATGATAACAG ACCATTTTCA TTACTGGGAA
GACGGCGGTG CCACTTTCCA TAACAGATTT TCCTCATATG AAATGATCAG GGGACAGGAA
GGGGACCACT GGAAAGGTGA AGCAGAATAC AAAGAAGATA AAGAATTTTT AAGCATTCCC
GAACCGCACA GCGGCAGCGG AAAAGTTTCT TCCTTATGGA GATATGACAG GATTAACAGA
AAATATATGG ATACAGAGGA AAAACAGCCC CAAAGTAAAG TTTTTTCTCT TAGCTGTGAA
TTTATAGAAA AAAACAGCTC CTATAATAAC TGGCTTCTGC ATATAGAAAC ATTCGATCCC
CATGAACCGT TTTTTGTAAA AGACAAATAT CTGGAACAGT ACAAAGATAC TTACAGCGGA
CCTGAATTTG ACTGGCCCAG AGGTGAGGTC AAAGAATCTC CGGAAGCTGT GGAACATATA
AGAAAAAAAT ATGCTGCTCT GGTTTCCATG TGTGATAAAA ATCTGGGAAT GATTCTTGAT
TTAATGGATA AACACAATAT GTGGGAAGAT ACTATGCTGA TCGTGGGAAC AGACCATGGT
TTTCTTCTCG GAGAGCACGG ATGGTGGGGA AAAAATCTTA TGCCGTATTA TAATGAAATA
GCAAATACAC CTTTATTTAT ATGGGACCCA AGGTCAAAAA AGAAAAATGA AAGAAGAAAT
GCTATTGTGC AGATGATAGA CTGGGCCCCG ACATTGCTTG ATTACTTTGA TGTTGCCATT
CCGGAAACAA TGAAAGGAAA ATCTCTGAAA GAGACTATAG AAACCGATGT TCCTGTCCGC
AAGGAATGTA TTTACGGAGT TCACGGAGGA CATGTAAATA TGTATGACGG AAATTATACC
TATATGAGAG CACCTGCATT CAAAGAAAAC AAACCGCTCT ATAATTATAC ACTTATGCCT
ATGCATATGA ATAAGCTTTT TAGTGTTGAT GAAATAAAAG ATGCCGAGCT TTCAGAGCCG
GTAAATTATT CCAAGAATGT TCCGGTATTA AAATTTCGTG CGGAAGATAA ATATAAAATA
TATAAATATG GTACTTTAAT ATTCGATATC AATAATGATC CGAAACAGCT TTACCCTGTA
AAAGATAAAG CTCTGGAACA GAACCTTACG GAAAAACTGA TTAAAAATAT GGAATTTCAC
GAGTCGCCAA AGGATCAATA TACAAGACTC GGACTAAATA TGCCTAAGGA GAAGAAAAAT
GTATAA
 
Protein sequence
MKAIIIFFDS LNRHFLPNYG NDWVNAPNFK RLDEKTLTFD RSYVGSMPCM PARRELHTGR 
HNFLHREWGP LEPFDDSMPE ILKKAGVYTH MITDHFHYWE DGGATFHNRF SSYEMIRGQE
GDHWKGEAEY KEDKEFLSIP EPHSGSGKVS SLWRYDRINR KYMDTEEKQP QSKVFSLSCE
FIEKNSSYNN WLLHIETFDP HEPFFVKDKY LEQYKDTYSG PEFDWPRGEV KESPEAVEHI
RKKYAALVSM CDKNLGMILD LMDKHNMWED TMLIVGTDHG FLLGEHGWWG KNLMPYYNEI
ANTPLFIWDP RSKKKNERRN AIVQMIDWAP TLLDYFDVAI PETMKGKSLK ETIETDVPVR
KECIYGVHGG HVNMYDGNYT YMRAPAFKEN KPLYNYTLMP MHMNKLFSVD EIKDAELSEP
VNYSKNVPVL KFRAEDKYKI YKYGTLIFDI NNDPKQLYPV KDKALEQNLT EKLIKNMEFH
ESPKDQYTRL GLNMPKEKKN V