Gene Sterm_3850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_3850 
Symbol 
ID8599296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp4089375 
End bp4090943 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content33% 
IMG OID 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003310615 
Protein GI269122438 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAAA TTTTGATATT ATTATTTATT ATTATGGGAA TTATAGTTTA TGGAGCTAAG 
AATGAGATTA CGGTTAATTT TGAATATGAA CCGTATGAGG CAAATTTAGA TCCTCAACTA
AGTGAAAGTA TTACGGCTGC CAATATAATT CCGTTACTGT TTGAAGGACT AATCAAACAG
AATGAAAATG GGGAGCCGGT TCCAGGGATT GCTGAAAAGT GGGAAAGAAA TGCAGAAGGA
CTTGTATGGA CTTTTTATTT GAGAGAAGCC GAGTGGGAAA ATGGAGATCC TGTGACAGCA
AATGATTTTA AATTTGCATG GATAAGGGCA CTTGACAGTA AAAATGCAGC TGAGTACGCT
TACATGCTGT TTCCTATAAA AGGAGCCTAC GAATTCAATG TAGGAATGGG GAATATAGAG
GAATTAGGAA TAAAAGTTAT AGATGAGAAG ACACTAGAAG TAACACTTAA TAGTCCTACA
AGATATTTAG ATTCACTATT AACCTTTCCG GTATATTCTC CGATAAATGA AAAATATTTT
AATCTGTATA AAGATGAATA TGGAAAAGAC GCAGGGAAAA TAATGTCAAA CGGTGCGTAC
AAACTGGTAA AATGGGAGCA TTCTGATGAA TTAGTTCTTG AAAAAAATAA AAATTACTGG
AATGAGAAAG AAGTGAAAAC TGAAAAAATA AGAATAAAAT TAATAAATGA TATATCAAAA
TCATTAGAGG CTTTTGAAAA TAATGAAATT GCGTTTACTG TGATAACACC GGAACTGTAT
ACTGAATATA GGAAAGATAA GAGACTGATT TCTTATGATG ACGGATCTGC ATGGTATTTA
GAGTATAATT TGGAAAATGA TTTTCTTGCT AACAAGAAAA TAAGACAGGC ATTAACAATA
GCGATAGATA AAGAAGAGCT GGGATCAATC TTGCAGGCAA TGGGTAAACC GGCATACGGA
TATGTTCCGG GATTCGTACA GGGGGTAGAT AAATCTTTCA GAAAAGAAGC AGGAAACACA
TATCCGCATT ATAATTCCAA GAAAGCAAAA AAATTATTTG AAGAAGGACT AAAAGAACTC
AATTTCAATG AAGCACCAGA AATAACACTA ATATTTAACG ATCAGGGAAA TAATAAAAAA
ATAGCGGAAT ATATACAGGG AAAAATAAAA AAGGAACTTG GATATAATTT GAAGATAAAA
TCTTTACCAT TCAAAGAAAG ACTTGAAAGA ATGACTCAAA AGACTTTTGA AATAGTTCTT
GCAGGATGGA GCGGAGATTT TAATGATGCA TTGAGCTATA TGGATATATG GACAACTGGC
GGCGGGAATA ACCATGCCTC ATACTCTAAT CCTAAATATG ACGAATTAAT ACAAATAGCA
CAAACAAGCT CTGATCAAAA AGCAAGAATA AAGGCTATGA TAGAAGCAGA AAAAATTCTT
GGTGATGATA TGCCTATAGG AATGCTCTAT TTTAGACAAA AAGTATTTTT GGTAAATCCG
GAGCTTAAAA ATATGAAATT TAAGCCGGTC GGATCTGAAT ATTATTTAAT AGATGCGTAC
ATAGAATAA
 
Protein sequence
MKKILILLFI IMGIIVYGAK NEITVNFEYE PYEANLDPQL SESITAANII PLLFEGLIKQ 
NENGEPVPGI AEKWERNAEG LVWTFYLREA EWENGDPVTA NDFKFAWIRA LDSKNAAEYA
YMLFPIKGAY EFNVGMGNIE ELGIKVIDEK TLEVTLNSPT RYLDSLLTFP VYSPINEKYF
NLYKDEYGKD AGKIMSNGAY KLVKWEHSDE LVLEKNKNYW NEKEVKTEKI RIKLINDISK
SLEAFENNEI AFTVITPELY TEYRKDKRLI SYDDGSAWYL EYNLENDFLA NKKIRQALTI
AIDKEELGSI LQAMGKPAYG YVPGFVQGVD KSFRKEAGNT YPHYNSKKAK KLFEEGLKEL
NFNEAPEITL IFNDQGNNKK IAEYIQGKIK KELGYNLKIK SLPFKERLER MTQKTFEIVL
AGWSGDFNDA LSYMDIWTTG GGNNHASYSN PKYDELIQIA QTSSDQKARI KAMIEAEKIL
GDDMPIGMLY FRQKVFLVNP ELKNMKFKPV GSEYYLIDAY IE