Gene Sterm_4051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_4051 
Symbol 
ID8599495 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp4311499 
End bp4312803 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content36% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003310814 
Protein GI269122637 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAAAAA GAATATTTTG GTTAATGATG CTTATTTTGG TTTTAACAAT AAGTTGTGGT 
GAGAAAAAAG AATCTTCAAA AGGTGCGGGA GCAGAAAAAG AAATTACGCT GAGATTTTCA
TGGTGGGGCG GAGATGCACG TCATAAAGCT ACTTTAGATG TAATAAAGCT ATATGAGGAA
AAAAATCCCG GAGTAAAAAT AAAAGCAGAA TACAGCGGAT GGGACGGACA TTTTGAGAAA
ATCTCTACAC AGATAACAGG AAATACAGCA CCGGATATAA TGCAGATAGA CTACAACTGG
TTATATAACT TTTCAAAAAA CGGAGATGGT TTTTACGATA TTAATACATT AAAAGATGAT
TTTAATCTTG ATAACTATGA TGAACAGGCT TTGAGCTATA CTATTATAAA CGGAAAACTT
AATGCAATAC CTGTGGGAAT GAACGGAAGG GCATTTTTCT TTAATAAATC TTTATATGAT
AAGGCAGGAG TGGAAATTCC AAAAACATTC GACGAATTAC TTGCCACGGA TAAGGTAATT
AAAGAAAAAG TGAGTAAGGA TGCCAAATCA CTGGATATTA CTTCTACTGA CAGCGGAGCA
TTGTTTTTTA TAGAATATTA TGTAGAGCAA AAATACGGGA AACCGATATT AACTGCTGAA
AATACAATAG GGGTCACAAA AGAAGAGCTT GCGGATGCAT TTAAGTTTTA TAAAATGCTG
GTTGACAGCG GAGCTGTTGT TTCGGCAAAA GACAGAGCGG GAGCGGGAAA TTTTCCCGAT
GATCAGAATC CTTTGTGGAT AAACGGCGAG CTGGGAGCTG TTTTAACATG GAATACAATG
GTGGGGCAAT ATGAGAATAT GCTTAAAGAG GGCGATACAT TAGTCTCGGG AGATTTTCTG
ACAGGAATCG GAGAACATAA ATCTACATTC ATAAAAGTAA ATATGGCTTT TGCAATAAAT
AAAAATACAA AATATCCGAA AGAAGCTGCA AAATTTCTGA ATTTTATGCT GTCAGATCCT
GAAGCAGCAA AAATACTGGG AACTGTAAGA GGAATTCCTT TGAATAAAAG TGCTTTTGCC
GAACTGGAAA AGGAAGGACA GACAAAAGGG CCTCTTGCGG AAGGACTTAC AAAAGCACTG
GCTTTCGCCG GACCAAAAAC AAGTCCCTAT ATAGAAGACG AAAGAACAAG AAAGCTGGGA
CTGGAAATTA CGCAAAAAGT GGATTATAAC GAATTAACAC CGGAACAGGC AGGAGAAAGA
TTATATACTG AATTGGAAAA ATTATTAAAA CAAATGACAA GATAG
 
Protein sequence
MGKRIFWLMM LILVLTISCG EKKESSKGAG AEKEITLRFS WWGGDARHKA TLDVIKLYEE 
KNPGVKIKAE YSGWDGHFEK ISTQITGNTA PDIMQIDYNW LYNFSKNGDG FYDINTLKDD
FNLDNYDEQA LSYTIINGKL NAIPVGMNGR AFFFNKSLYD KAGVEIPKTF DELLATDKVI
KEKVSKDAKS LDITSTDSGA LFFIEYYVEQ KYGKPILTAE NTIGVTKEEL ADAFKFYKML
VDSGAVVSAK DRAGAGNFPD DQNPLWINGE LGAVLTWNTM VGQYENMLKE GDTLVSGDFL
TGIGEHKSTF IKVNMAFAIN KNTKYPKEAA KFLNFMLSDP EAAKILGTVR GIPLNKSAFA
ELEKEGQTKG PLAEGLTKAL AFAGPKTSPY IEDERTRKLG LEITQKVDYN ELTPEQAGER
LYTELEKLLK QMTR