Gene Sterm_3784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_3784 
Symbol 
ID8599230 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp4022931 
End bp4024223 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content36% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003310549 
Protein GI269122372 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGTA AAATTTGTTT ATTGATATTA ACATTATTAT TTTTAATTGG ATGCGGAGAG 
AAAAAAAGCA GCGGTACCGG GGGAGACGGA GGAAAAGAAG TAGAATTAAG AGTAATGTGG
TGGGGGTCTG ATGCAAGACA TAAAGCTACA TTAGATGCAA TAAAGCTTTT TGAGGAAAAA
AATCCCGGAA TAAAAATAAA GGCTGAATAT TCAGGGTATG AAGGGTATCT GGAAAAACTG
TCTACACAGA TGAGCGGAAA AACAGCTCCT GATGTTATGC AGGTAGACTG GAACTGGCTG
TATATTTTCT CTAAAAACGG AGACGGTTTT TATAATGTGA AAGACCTGAA AGACTTTGAT
CTCAGTAATT ATGACGAGGC AATTCTGAAC CATACTGTTA TAAAAGATAA ACTTAATGCT
GTTCCGGTAG GGCTGAACGG AATGGGCTTT TATTATAACC AAAGCGTATT TGATAAAGCA
GGGGCAAAAT TTCCTGAAAC AGCAGATGAA CTTTTTGCAG TAAATAAAAT GTTTAAAGAA
AAGCTCGGTG ACGATTACTA TCCTCTTGAT GTATCAGATG AAAAAGTAAA CTATTATTTT
ATAAATTATT ATCTTCTGCA AAAAACAGGA AATAATCTGA TAAATGAGGA AAATAAGATA
GGGATCACAA AAGAAGAGCT TGCAGATGCT TTGAGATTTT ATAAGAGAAT GGTTGATGAA
AAGGTTACAT TATCTACGAA AGACAGAGCG GGACTTGGAA ATGTTCCCGG AGATCAGAAT
CCTCTGTGGG TAGACGGGCA TATCGGCGGA ACCTATGAAT GGTCTTCGAG AACAGGGGTA
TTTCAGGACA CTCTTAAGGA AGGACAGGTG GCTTCCGGGA ACTATATTAA AGGTTTAGGC
GATCATAATG CAGCATTCAT AAGACTGAAT ATGGCATTTG CAGTAAATAA AAATACAAAA
CATCCCGAAG CAGCAGCAAA ATTTTTGAAT TTTATGCTTT CTGATCCTGA GGCTACTAAA
ATACTCGGAA TGACAAGAGG AATACCTTCA AATAAAAAAG CTCTTGAGGT ATTGGAGCAG
GAAAATATGA TAACAGGAAT ATCAAAAGAA GTACTTGATA AGGCTCTGGC ATATCAGGGG
AAAATGATAA GCCCTTATTA TGAGGACGAA AGACTTATTA AGATATTTGC AGGTTATGTG
CAAAAAATTG ATTATGGTCA GATTGACATA GACAAGGCGG CAGAAGGGAT TATTGCTGAT
ATGGAAGCAG CATTAAAGAA TATTTTGAGA TAA
 
Protein sequence
MKSKICLLIL TLLFLIGCGE KKSSGTGGDG GKEVELRVMW WGSDARHKAT LDAIKLFEEK 
NPGIKIKAEY SGYEGYLEKL STQMSGKTAP DVMQVDWNWL YIFSKNGDGF YNVKDLKDFD
LSNYDEAILN HTVIKDKLNA VPVGLNGMGF YYNQSVFDKA GAKFPETADE LFAVNKMFKE
KLGDDYYPLD VSDEKVNYYF INYYLLQKTG NNLINEENKI GITKEELADA LRFYKRMVDE
KVTLSTKDRA GLGNVPGDQN PLWVDGHIGG TYEWSSRTGV FQDTLKEGQV ASGNYIKGLG
DHNAAFIRLN MAFAVNKNTK HPEAAAKFLN FMLSDPEATK ILGMTRGIPS NKKALEVLEQ
ENMITGISKE VLDKALAYQG KMISPYYEDE RLIKIFAGYV QKIDYGQIDI DKAAEGIIAD
MEAALKNILR