Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sterm_3852 |
Symbol | |
ID | 8599298 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sebaldella termitidis ATCC 33386 |
Kingdom | Bacteria |
Replicon accession | NC_013517 |
Strand | - |
Start bp | 4092719 |
End bp | 4094329 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003310617 |
Protein GI | 269122440 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00000990708 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGAAAC TTTTGTTAGT AATTTTGAGT TTAGTTATTG TTGTTTCGTT AGTTTCTTGC GGAGGAGGTA AGTCTGGAGG AAATGCAGGG GATAAAGGTA CTGTCAGCTT CAATATAGAA GTAGAGCCTA CATCACTGGA TCCACAGGTA CTTACTGATG AAGCAGGTCT TAACGTTGCA CAGTTTTTAT ATGAAAGTCT TGTAAGACTA AACGAAAAGA GTGAAATAGT TCCTGCGGGA GCTGAAAGAT GGGATATAAG TGAAGACGGG CTGAAGTGGA CTTTTTATAT CAGAAAAGAC ATGAAGTGGT CGAACGGAGA TCCTGTTACA GCCAAAGATT ATTACAACGG TGTAAAAAGA GGACTTGATC CTGAGCTTGC AGCAGAATAT GCTTATTTAA CATATTACAT AAAAAATGCG CAGAGTTACA GTGAAAAGAA AATAACGGAT TTTGAACAGG TAGGGGTAAA AGTTATTGAT GATTACACAC TGGAATTTGA ATTACAGGAT CCTACAGCTT ATTTCGGGAA ACTGCTTGTA ATGCCTATAT TCTATCCTGT AAATGAAAAA GCTCTTGCAG AATTCGGGGA TCAGTATGCA CTTGATCCTA AAAAATCTGT TTATTCAGGA CCGTATATAA TGACAGAATG GAGTCACGGA AGTAAAGTAG TTCTGGAGAG AAACCCTAAT TACTGGACTA AAGATAAGTT TAAAATCGAA AAGCTTATTG CGGTAATAAC TGCAGATTTA GATTCGGCAG CAAATTCTTA TGAAAACGGC GAGCTTACTA TTACTAAAAT TTCTCCTGAA AAGCTAAAGG CTTATAAAGA CAAACCTGAA TTAGTAAGTT ATTCAGACGG AAGAGTTTAC TATTTTTCAT TTAACCTGAA AAATGATATA CTTAAAAATC AAAAAGTAAG ACAGGCTTTA TCACTTGCAA TAGACAGAGA CAAGCTGGTA AATGAAGTGC TGGCAAACGG TTCTGAAAAA GGAAGCGGAA TAGTAGCCTC GGGAATGCCT GGAATAAAGG ATGACTTTAG AAAAGAAAAC GGTGACTTAT ACGCACAGTA TAAAGATGAA GATATAAAGA AACTTTTTGA AGAAGGACTT CAGGAATTAG GAAAAACTCC TGCTGATGTA AAGCTGTCAC TTCTTATAGA CGAACAGGGA ACTGCAAAGA AAGAAGCAGA ATTCTATCAG GCACAGTGGA GAGAAAAGTT AGGGCTTGAT GTTTCTGTTG ACCAAACTAC TAAAAAAGAC AGAATAGCAA GATCAAGATC AGGAGACTAT GATATAGTAA GATACTCATG GGGACCTGAC TTCGCAGATG CTATGACTTA TCTGGAATTG TTCTTCTCGA ATACTGAAAT GAATATTCCT AGATATGTAA ATCCTGAGTA CGACGAGCTT TTATCAATCG GAAGAAAAAG CAATAATCAT GATGAAAGAA CTGAAGCTAT GGAGAAAGCT GAAAAGATAG TTACTGAGTC ATTTGCTTAT TCAGGGCTTT ATTACCAGAC TGTTAATATA CTGGTAAATT CAAAAGTTAA AAATGTTCAT TTCAGATCTG TTGGTGCACC GATAGATCTT ATAGATGCTA CACTGGATTA A
|
Protein sequence | MRKLLLVILS LVIVVSLVSC GGGKSGGNAG DKGTVSFNIE VEPTSLDPQV LTDEAGLNVA QFLYESLVRL NEKSEIVPAG AERWDISEDG LKWTFYIRKD MKWSNGDPVT AKDYYNGVKR GLDPELAAEY AYLTYYIKNA QSYSEKKITD FEQVGVKVID DYTLEFELQD PTAYFGKLLV MPIFYPVNEK ALAEFGDQYA LDPKKSVYSG PYIMTEWSHG SKVVLERNPN YWTKDKFKIE KLIAVITADL DSAANSYENG ELTITKISPE KLKAYKDKPE LVSYSDGRVY YFSFNLKNDI LKNQKVRQAL SLAIDRDKLV NEVLANGSEK GSGIVASGMP GIKDDFRKEN GDLYAQYKDE DIKKLFEEGL QELGKTPADV KLSLLIDEQG TAKKEAEFYQ AQWREKLGLD VSVDQTTKKD RIARSRSGDY DIVRYSWGPD FADAMTYLEL FFSNTEMNIP RYVNPEYDEL LSIGRKSNNH DERTEAMEKA EKIVTESFAY SGLYYQTVNI LVNSKVKNVH FRSVGAPIDL IDATLD
|
| |