Gene Sterm_3843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_3843 
Symbol 
ID8599289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp4078673 
End bp4080265 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content38% 
IMG OID 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003310608 
Protein GI269122431 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0512301 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAC TATTTTTATT ATTTATAATG ATGATTCTTA TTTCCTGCGG AGGTGCAGGT 
AAAGAAAGCA GCTCTTCTGG AAACACTAAG ATTATAGTAA ATGAAACAGC AGAACCAAAA
TCTATAGATC CGGGACTTTT GACCGATCAA AGCGGAATAG CAGTTAATTC ACTGGTAAGT
GAGGGATTGA CAAGACAGGG AAAAGACGGG ACTCCTGAGC CGGGACTGGC TGAAAAATGG
GATGTAAGCG AAGACGGGCT GACATGGACT TTTCATTTGA GAGAAAATAT AAAGTGGTCA
AGCGGAGAAC CTGTTACTGC AGATGATTTC AAATTTGCAT GGCTGAGGGT ACTGGAGCCG
GCTACAGCTT CGGAATATGC CTACATGCTT CATTATATAA AAGGCGGTCA GGCATATAAT
GAAGGTAAAG GAAAAAAAGA GGATGTAGGA ATAAATGTAA TAGACAGCAG GACACTGGAA
GTAAAACTGG AAAGACCCAC TGCTTACTTT GCTTCACTGG CGGCATCTCC TACATATGCT
CCGATCAGGG AGAAGTTTTT TGATGAGAAA GGAAAGAATT TTGCTCTGGA AGCTGATGCC
ATGGAGTACA GCGGACCATA TAAAATAAAA AACTGGAAAC ATGATTCTAA CTTTATTATG
GTAAAAAATG AAAACTACTG GAATAAGGAT CATATAAAAA TAGATGAAGT AGAAATGGTT
CTTGTAGCTG ATTCCACAGC TGAGCTGAAT GCATTTAACA ACGGTGAAAT AGAGCTGATA
AGATTAACGG CCGAGCAGTA TAAAAGATAT GAAAAAGATC CGAGAGTAAA TGTATTCAGA
AATAATTCGG TATGGTATCT GGAATATAAT ATGGAGAATA AATTTCTGGC AAATAAGAAA
ATCAGACAGG CGCTTACTCT TGCAGTAGAT AAAGAGGAAA TGGCAAATAC CATAGTGAAA
GGAACGGGAG AAGCAGCTTA CGGTATAGTA CCTACGGGAT TTCCGGGAGA AAGTAAGACT
TTCAGGGAAG AAAACGGAGA TTCATATCCG AAGTATAACC CGGAAGAAGC AAAAAGACTT
TATAAAGAGG GTCTTGCAGA GCTTGGTGTA ACTGAACTTC CTGAACTGTC ACTGATTATA
AATGAAGCCG GAAATAATAA AAAAATAGCA GAGTATGTGC AAGAAAAAAT CAGAACTAAT
CTGGGGGCAA ATATAAGAAT AGAGCCTATT CCTTTTAAGG AAAGAATGGC AAGACTCCAG
CAGAAAGACT TTGAGATAGT TCTTTCAGGG TGGGGTTCTG ATTATGCAGA TCCTATGACA
TATATAGATT TATTCGTGAC AAACGGAGGA AATAATCATT CGTCATATTC TAATCCAAAA
TATGACGAGC TTATAAAGAC AGCAAATAAC AGCAGTGATA ATAAAGTAAG AATGCAGGCT
ATGAGAGATG CGGAGAAAAT ACTGGGTGAT GATATGCCTG TGGGAGTTAT GCTTTATTCT
ACAAGGGTTA TTATGCTTAA TCCAAAAATA AAAAATGTAT ATTTTAAAGG AATCGGAGCA
GAATATTATT TGTATGATGC TTACGTGGAA TAA
 
Protein sequence
MKKLFLLFIM MILISCGGAG KESSSSGNTK IIVNETAEPK SIDPGLLTDQ SGIAVNSLVS 
EGLTRQGKDG TPEPGLAEKW DVSEDGLTWT FHLRENIKWS SGEPVTADDF KFAWLRVLEP
ATASEYAYML HYIKGGQAYN EGKGKKEDVG INVIDSRTLE VKLERPTAYF ASLAASPTYA
PIREKFFDEK GKNFALEADA MEYSGPYKIK NWKHDSNFIM VKNENYWNKD HIKIDEVEMV
LVADSTAELN AFNNGEIELI RLTAEQYKRY EKDPRVNVFR NNSVWYLEYN MENKFLANKK
IRQALTLAVD KEEMANTIVK GTGEAAYGIV PTGFPGESKT FREENGDSYP KYNPEEAKRL
YKEGLAELGV TELPELSLII NEAGNNKKIA EYVQEKIRTN LGANIRIEPI PFKERMARLQ
QKDFEIVLSG WGSDYADPMT YIDLFVTNGG NNHSSYSNPK YDELIKTANN SSDNKVRMQA
MRDAEKILGD DMPVGVMLYS TRVIMLNPKI KNVYFKGIGA EYYLYDAYVE