Gene Sterm_4050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_4050 
Symbol 
ID8599494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp4310058 
End bp4311359 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content36% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003310813 
Protein GI269122636 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATA AAATATTTTG GTTAATAATG CTTATTTTGA TTTTAACCGT AAGCTGCGGA 
GAGAAAAATA ATACTTCCGG TAAAGATGGC AGCAAGGAAA TAGTACTGAG ATTCTTATGG
TGGGGAAGTG AATCACGTCA TAAAGCAACT CTTGATGCTA TAAAGCTTTT TGAAGAGAAA
AATCCGGGAA TAAAAATAAA AGCAGAATAC GGCGGTACAG ATGGCTATTT CCAGAAGCTT
TCGACACAGC TAACAGGAAA TACAGCACCT GATATAATGC AGGTAGATTA TATATGGCTG
TTTAATTTTT CAAAAAACGG GGACGGCTTT TATGATATTA ATCTGTTAAA AGATGAATTT
AACCTTGCTA ACTATACAAA AGAAGATCTG AGCTATACTA CAATAAACGG AAAACTTAAT
GCAATACCTG TGGGAATGAA TGGAAGAGCA TTTTTCTTTA ATAAGTCTCT GTATGAAAGG
GCAGGAATAG AAATACCTAA AACATTTGAC GAACTGCTTG CTGCCGATAA AATATTAAAG
CAAAAAATAG GTCCGGATAC TAAATCACTG GATATAGTGA CTTCTGACAG CGGTGCTATG
TTCTTTGTGG AATATTATGT GGAGCAAAAA TATGGAAAAA CGCTGATAAA TACTGATAAT
AAAGTAGGAG TTACAAAAGA AGAACTTGCA GATGCATTTA GGTTTTATAA AATGCTTGCA
GACAGCGGTG CAGTAATATC GGCAAAAGAC AGGGCAGGAG CAGGAAACTA TCCTGACGAT
CAGAATCCTT TATGGATAAA TGGTGAGTTA GGCGGTGTTC TAAACTGGAA TACTATGATC
GGGGCATACG GTGATATGCT TAAAAAAGGA GATACAATGG TAGCAGGAGA TTTCCTGATA
GGAATAGGAG ATCATAAATC TGCTTATTTG AAAGTAAATA TGACTTTGGC GATAAATAAA
AATACAAAGC ATCCCAAAGA AGCAGCAAAA TTTTTGAATT TCCTTCTGTC TGATCCTGAG
GCAGCCAAAA TATTAGGGAC TGTAAGGGGA ATTCCTTTGA ATAAGAGTGC TTATGCTGAA
TTGGAAAAAG AAGGTCTTAC TACAGGACCT GTTGCAGAGG GACTTACAAA AGCCCTTAAT
TTTGCCGGAC CTAAAAAAAG CCCTTATATA GAAGATGAAA GAACAAGACA GCTTGGTCTG
GTAATTACAC AAAAACTGGA TTACAACGAA ATAACACCTG AACAGGCAGG AGAACAGATG
TATACAGAAT TGACAAAATT ATTGGAACAA ATGACAAGGT AG
 
Protein sequence
MKNKIFWLIM LILILTVSCG EKNNTSGKDG SKEIVLRFLW WGSESRHKAT LDAIKLFEEK 
NPGIKIKAEY GGTDGYFQKL STQLTGNTAP DIMQVDYIWL FNFSKNGDGF YDINLLKDEF
NLANYTKEDL SYTTINGKLN AIPVGMNGRA FFFNKSLYER AGIEIPKTFD ELLAADKILK
QKIGPDTKSL DIVTSDSGAM FFVEYYVEQK YGKTLINTDN KVGVTKEELA DAFRFYKMLA
DSGAVISAKD RAGAGNYPDD QNPLWINGEL GGVLNWNTMI GAYGDMLKKG DTMVAGDFLI
GIGDHKSAYL KVNMTLAINK NTKHPKEAAK FLNFLLSDPE AAKILGTVRG IPLNKSAYAE
LEKEGLTTGP VAEGLTKALN FAGPKKSPYI EDERTRQLGL VITQKLDYNE ITPEQAGEQM
YTELTKLLEQ MTR