Gene Sterm_3852 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_3852 
Symbol 
ID8599298 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp4092719 
End bp4094329 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content36% 
IMG OID 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003310617 
Protein GI269122440 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000990708 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGAAAC TTTTGTTAGT AATTTTGAGT TTAGTTATTG TTGTTTCGTT AGTTTCTTGC 
GGAGGAGGTA AGTCTGGAGG AAATGCAGGG GATAAAGGTA CTGTCAGCTT CAATATAGAA
GTAGAGCCTA CATCACTGGA TCCACAGGTA CTTACTGATG AAGCAGGTCT TAACGTTGCA
CAGTTTTTAT ATGAAAGTCT TGTAAGACTA AACGAAAAGA GTGAAATAGT TCCTGCGGGA
GCTGAAAGAT GGGATATAAG TGAAGACGGG CTGAAGTGGA CTTTTTATAT CAGAAAAGAC
ATGAAGTGGT CGAACGGAGA TCCTGTTACA GCCAAAGATT ATTACAACGG TGTAAAAAGA
GGACTTGATC CTGAGCTTGC AGCAGAATAT GCTTATTTAA CATATTACAT AAAAAATGCG
CAGAGTTACA GTGAAAAGAA AATAACGGAT TTTGAACAGG TAGGGGTAAA AGTTATTGAT
GATTACACAC TGGAATTTGA ATTACAGGAT CCTACAGCTT ATTTCGGGAA ACTGCTTGTA
ATGCCTATAT TCTATCCTGT AAATGAAAAA GCTCTTGCAG AATTCGGGGA TCAGTATGCA
CTTGATCCTA AAAAATCTGT TTATTCAGGA CCGTATATAA TGACAGAATG GAGTCACGGA
AGTAAAGTAG TTCTGGAGAG AAACCCTAAT TACTGGACTA AAGATAAGTT TAAAATCGAA
AAGCTTATTG CGGTAATAAC TGCAGATTTA GATTCGGCAG CAAATTCTTA TGAAAACGGC
GAGCTTACTA TTACTAAAAT TTCTCCTGAA AAGCTAAAGG CTTATAAAGA CAAACCTGAA
TTAGTAAGTT ATTCAGACGG AAGAGTTTAC TATTTTTCAT TTAACCTGAA AAATGATATA
CTTAAAAATC AAAAAGTAAG ACAGGCTTTA TCACTTGCAA TAGACAGAGA CAAGCTGGTA
AATGAAGTGC TGGCAAACGG TTCTGAAAAA GGAAGCGGAA TAGTAGCCTC GGGAATGCCT
GGAATAAAGG ATGACTTTAG AAAAGAAAAC GGTGACTTAT ACGCACAGTA TAAAGATGAA
GATATAAAGA AACTTTTTGA AGAAGGACTT CAGGAATTAG GAAAAACTCC TGCTGATGTA
AAGCTGTCAC TTCTTATAGA CGAACAGGGA ACTGCAAAGA AAGAAGCAGA ATTCTATCAG
GCACAGTGGA GAGAAAAGTT AGGGCTTGAT GTTTCTGTTG ACCAAACTAC TAAAAAAGAC
AGAATAGCAA GATCAAGATC AGGAGACTAT GATATAGTAA GATACTCATG GGGACCTGAC
TTCGCAGATG CTATGACTTA TCTGGAATTG TTCTTCTCGA ATACTGAAAT GAATATTCCT
AGATATGTAA ATCCTGAGTA CGACGAGCTT TTATCAATCG GAAGAAAAAG CAATAATCAT
GATGAAAGAA CTGAAGCTAT GGAGAAAGCT GAAAAGATAG TTACTGAGTC ATTTGCTTAT
TCAGGGCTTT ATTACCAGAC TGTTAATATA CTGGTAAATT CAAAAGTTAA AAATGTTCAT
TTCAGATCTG TTGGTGCACC GATAGATCTT ATAGATGCTA CACTGGATTA A
 
Protein sequence
MRKLLLVILS LVIVVSLVSC GGGKSGGNAG DKGTVSFNIE VEPTSLDPQV LTDEAGLNVA 
QFLYESLVRL NEKSEIVPAG AERWDISEDG LKWTFYIRKD MKWSNGDPVT AKDYYNGVKR
GLDPELAAEY AYLTYYIKNA QSYSEKKITD FEQVGVKVID DYTLEFELQD PTAYFGKLLV
MPIFYPVNEK ALAEFGDQYA LDPKKSVYSG PYIMTEWSHG SKVVLERNPN YWTKDKFKIE
KLIAVITADL DSAANSYENG ELTITKISPE KLKAYKDKPE LVSYSDGRVY YFSFNLKNDI
LKNQKVRQAL SLAIDRDKLV NEVLANGSEK GSGIVASGMP GIKDDFRKEN GDLYAQYKDE
DIKKLFEEGL QELGKTPADV KLSLLIDEQG TAKKEAEFYQ AQWREKLGLD VSVDQTTKKD
RIARSRSGDY DIVRYSWGPD FADAMTYLEL FFSNTEMNIP RYVNPEYDEL LSIGRKSNNH
DERTEAMEKA EKIVTESFAY SGLYYQTVNI LVNSKVKNVH FRSVGAPIDL IDATLD