Gene Sterm_1524 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_1524 
Symbol 
ID8596994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp1619315 
End bp1620634 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content38% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003308314 
Protein GI269120137 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA TAAGTATGAG GTTAATGATT TTAATTACTG GAATTATTTT TCTGACTGCC 
TGCGGAAAAA AAGATGATAC AGGAACAACT GAAACAGGTG CAGCAGCCGG CGGGCAGGTA
ACTGTGGAAT TTATGCATTC AATGGTAGAA CAGGAAAGAC TTGATCAGAT AAATGAAATA
ATAGCAGAAT TTGAGAAAGA AAATCCGGAT ATAAAAATAA AACAAATACC GGTAGATGAA
GACTCTTACC AGACTAAAGT AACTACACTC GGATCAAGCG GAAAGCTTCC GGCAATAATA
GAAGTGAGCA ATGATTATGC AAAAGTAATG GCTAAAAATG AGTTTGTAGA TTATGAGGCA
GTAAATAAAG TAATACAGGA TAAAGGTGCT GACAGTTTTT ATGACGGGGC ATTAAGAGTA
TTAAAGACAG AAGACGGGGC TAATTATGCA GCTGTTCCTA TCAGCGGATG GGTTCAGGGT
GTATGGTATA ACAAAAAAGC CTTTCAGGAA AAAGGACTAA AAGAGCCTGA AACATGGGAA
GATATTCTGG CAGCAGCCAA AGCATTCAAT GATCCTGCGA ATAAAAAATA CGGGATAGCA
CTTGCTACTG CAAAAAGTGT AATGACAGAA CAGGTATTTT CACAGTTTGC ATTATCAAAC
GGGGCAAATG TACTGAACGG AGAAGGAAAA GCAGCACTTG ACACTCCTGA AATGAAAGAA
GCGATAGAAT ATTACCAGGA GCTTGCAAAA TATACAATGC CCGGATCAAA TGACGTATCA
CAGGTAAAGG ATGCGTTTTT AAACGGATCG GCTCCTATGG TGATTTATTC TACTTATATA
CTTCCGGCTG CATATAAAGA AGGAATAACA GATGATCTTG GGTATGCTGT TCCTACAAAA
AAACAAGGAG CGGCTTTCGG GGTAGTTTCT GCACTTACTA TTACAAATGG TCTTGATGAT
AATCAAAAGG CAGCGGCAGA AAAATTTGTA GCATTTATGT TAAAAGATCA GTCTAATGCA
AAATGGATAC TTATGTCTCC GGGCGGATTA CAGCCTGTAA TAAAATCTGT AGCTACGAGT
CCTGAATATA CATCTAACGA AGTAATAAAA ACATTTTCGG CATTCAGTAG TGATCTGACA
TCTTCATTTA ATAACCTTCA GATGTTTGGT GTAGTGGACG GGAAAAATTT CATAGTAATG
GGAGATATTA CCAATGCAGG AATTATAGGC GGAATGATTA ATGAAATAGT GGTAAATAAC
AAGGATATAG CTGCCGGTAT GAAAGCAGCA CAGGAGCAGA TAGCTGAAAT TGCACAATAA
 
Protein sequence
MKKISMRLMI LITGIIFLTA CGKKDDTGTT ETGAAAGGQV TVEFMHSMVE QERLDQINEI 
IAEFEKENPD IKIKQIPVDE DSYQTKVTTL GSSGKLPAII EVSNDYAKVM AKNEFVDYEA
VNKVIQDKGA DSFYDGALRV LKTEDGANYA AVPISGWVQG VWYNKKAFQE KGLKEPETWE
DILAAAKAFN DPANKKYGIA LATAKSVMTE QVFSQFALSN GANVLNGEGK AALDTPEMKE
AIEYYQELAK YTMPGSNDVS QVKDAFLNGS APMVIYSTYI LPAAYKEGIT DDLGYAVPTK
KQGAAFGVVS ALTITNGLDD NQKAAAEKFV AFMLKDQSNA KWILMSPGGL QPVIKSVATS
PEYTSNEVIK TFSAFSSDLT SSFNNLQMFG VVDGKNFIVM GDITNAGIIG GMINEIVVNN
KDIAAGMKAA QEQIAEIAQ