Gene Sterm_3201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_3201 
Symbol 
ID8598654 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp3353488 
End bp3354900 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content36% 
IMG OID 
ProductBeta-glucosidase 
Protein accessionYP_003309973 
Protein GI269121796 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.281249 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAAGA AAATAGTATT TCCTGAGGAT TTTTGGTGGG GCTCAGCATG GTCCGCAGAA 
CAGTCGGAAG GAACAGGGGA TACCGGAAAA GCAGAAACTA TATGGAAACG CTGGTTTAGC
GAAGAACCAA ACAGGTTTTA TGACAGAATA GGACCAGATG TAACTACTGA TCATTTTAAC
CGTTACAAGG ATGACATAAG ACTTATGAAA GAAACAGGGC ATAATTCTTT CAGACTTTCC
TTATCATGGG CACGTTTATT TCCTGACGGA GGAAGAGGAG AAATAAACAG AAAAGCTGTT
GATTTTTACA GGGACTTAAT GAGTGAAATG ATAAAAAATG ATATAAAGCC CTTTATAAAT
CTATATCACT TTGATATGCC GATGAAATTA CAGGAGCTGG GAGGCTGGGC ATCACGTGAA
ACAGTGGAAG CTTATGTTAA TTATGCAGTT TCATGCTTTA AGGAATTTGG TGATCTTGGT
TATCACTGGT TTACATCCAA CGAACCTCTC GGTCCTATTT TAGGAACATA TCTCGAAGAC
TTTCATTATC CGAATTTTAT AGATTTTAAA GAAGGAGCAC AGGCAGCTTT TTACACAATA
CTGGCACATG CAAAGGCTAT AAAAGAATTT AAGAAGCTGA ATTTGAAGTC AAAAATAGGA
GTTATATTAA ACCTTAGTCC TACATATCCG AGAAGCAGCA ATAAATGGGA TACAGAAGCA
GCAGAAACCG CAGATGCATT TTATACCAGA AGCTTTCTTG ATCCTATGGT AAAAGGGACT
TTTAATAAAA AACTGGTATC TATACTGAAA GAATACGATC AGATGCCTGA TTATACAGAG
GAAGATCTGA AAATAATATC TGAAAATACT GCACAGATTC TGGGGCTTAA TTATTATGAG
CCGAGAAGGG TGAAAGCCAG ATTAACAGCT GTTAATAAAA ACAGTCCTTT TCTGCCTGAA
TGGTTTTTTG AACTTCATAA TATGCCCGGA AAGAGAATGA ATATATACAG AGGCTGGGAA
ATATATGAAA AAGGAATTTA TGATTTATGT ATGGATATAA AGGAAAACTA CGGCAATATA
GAATCATTTA TTTCTGAAAA CGGAATGGGA GTAGCAGATG AAGAGAGATT TCTCGGAGAA
AACGGACAGA TTATTGATGA ATACAGAATA AATTATATAA AAGACCATCT GGCATATCTG
TATAAAGCAG TAAATGAAGG ATGCAATATA AAGGGATACC ACCTGTGGAC ATTTATAGAC
TGCTGGTCAT GGATAAATGC ATATAAAAAC AGATACGGGC TCGTTTCGCT GGATCTTGCT
ACACAAAAAA GAACAATAAA AAAGAGCGGG GAATTTTTTA AAAAGATGAC GGAAGAAAAC
GGCTTTCTAT ATGATACTGA TAAGTTAGTA TAA
 
Protein sequence
MEKKIVFPED FWWGSAWSAE QSEGTGDTGK AETIWKRWFS EEPNRFYDRI GPDVTTDHFN 
RYKDDIRLMK ETGHNSFRLS LSWARLFPDG GRGEINRKAV DFYRDLMSEM IKNDIKPFIN
LYHFDMPMKL QELGGWASRE TVEAYVNYAV SCFKEFGDLG YHWFTSNEPL GPILGTYLED
FHYPNFIDFK EGAQAAFYTI LAHAKAIKEF KKLNLKSKIG VILNLSPTYP RSSNKWDTEA
AETADAFYTR SFLDPMVKGT FNKKLVSILK EYDQMPDYTE EDLKIISENT AQILGLNYYE
PRRVKARLTA VNKNSPFLPE WFFELHNMPG KRMNIYRGWE IYEKGIYDLC MDIKENYGNI
ESFISENGMG VADEERFLGE NGQIIDEYRI NYIKDHLAYL YKAVNEGCNI KGYHLWTFID
CWSWINAYKN RYGLVSLDLA TQKRTIKKSG EFFKKMTEEN GFLYDTDKLV