Gene Sterm_4038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_4038 
Symbol 
ID8599482 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp4296194 
End bp4297411 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content35% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003310801 
Protein GI269122624 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATT TAAAAGTACT CATACTTATG GTATTTTCAA TATTCATTTT AGTGTCATGC 
GGAGGGGGAG ACTCCAAGGT AAAAACAATA GATTTTATAA TATCTGATGA TTCCCTTGAA
GGGGGAGCCA TGGCAAAGGC AGTGGAAAGA TATAATAATT CACAGGACGA AATAAAAATA
AATCTTATAG AACTTCCTTA TGACAGTGTA AGAGCAAAAG TAAAGACAAT GGTAGCAGGA
GGAAAGGCGC CGGCTCTTAT GAGAACATCA AATATAGATG AATTTGAAAC AGTTTTGGCA
GATCTCTCAG ATACAGTGAA TCCTGCTGAT TTTACAGATA AAATGGAAGA AAATCTTATG
GACGGGAAAT TTTTGGGAGT ACCTTTGAAT TTAACAGTAA ACGGGCTTAT TTATAATAAA
ACATTATTTG ATAAGGCAGG AGTAAAGGTA CCTGATTCAC AGGATAATAT CTGGACATGG
GATGAATTTG TGCAGGCATT AAATACCGTG AAAGAAAAGA ACAGTTTAAA ATACGGAATG
GTAATGGACT TTTCTCAGAA CAGATATCAG ACAATGCTTT ATCAGTTTGG AGGAAGAATA
TTTGATGAGA ACGGGGATAT AGTGGTAGAT CAGCCTGACA GTATAAGAAC TCTTGATTAT
TTTATAAAGC TTCATAAAGA TAAGGTAATG CCTGATGATG TATGGCTTGG CGGGGAAGAT
GCAAGCAATC TTTTCAAAAC AGGAACTATT CCTGCATATT ACTCGGGAAG CTGGAAAATA
AATGAATATA AAAATGATAT AAAAGATTTT GAGTGGGGAA TAGCATATAT GCCGAAAGAA
AAAAACAGAT CTTCAATAGC AGGCGGAAAT TTTCTTGTAG CATTTACTAA GTCACCTGAT
CTTGAAGAGG CAAAAAAATT CATTAAGTGG TTTTATCAGG ACGAGAATTA TAAGCAGTAT
TGTGAAGACG GAGCCTATAT ATCCGGGAAA TTAAGCGTAC ACCCGTCTTA TGACTACGGT
CAGGAATTTT TTGATATTCT TGATAATGAA ATTGTGAATA CACCTGAATT AAGTCCTAAT
GACAAAAAAA TGATAAAAAA ATATAAAGCA GCAGGAAATA TGATGAGAGA TTATATCGTA
TATGCTATTC AGGGAGAAAG AACACCAGTA CAGGCAATGA CTGAACTAAA GGAAAAATGG
TCTGAGCTGA AAAAATAA
 
Protein sequence
MKNLKVLILM VFSIFILVSC GGGDSKVKTI DFIISDDSLE GGAMAKAVER YNNSQDEIKI 
NLIELPYDSV RAKVKTMVAG GKAPALMRTS NIDEFETVLA DLSDTVNPAD FTDKMEENLM
DGKFLGVPLN LTVNGLIYNK TLFDKAGVKV PDSQDNIWTW DEFVQALNTV KEKNSLKYGM
VMDFSQNRYQ TMLYQFGGRI FDENGDIVVD QPDSIRTLDY FIKLHKDKVM PDDVWLGGED
ASNLFKTGTI PAYYSGSWKI NEYKNDIKDF EWGIAYMPKE KNRSSIAGGN FLVAFTKSPD
LEEAKKFIKW FYQDENYKQY CEDGAYISGK LSVHPSYDYG QEFFDILDNE IVNTPELSPN
DKKMIKKYKA AGNMMRDYIV YAIQGERTPV QAMTELKEKW SELKK