Gene Sterm_0472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_0472 
Symbol 
ID8595960 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp516670 
End bp518076 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content35% 
IMG OID 
Productglycoside hydrolase family 4 
Protein accessionYP_003307280 
Protein GI269119103 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.507113 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTACATA AACCTATTAA AATAGTAACT ATTGGTGGTG GTTCCAGTTA TACACCAGAA 
TTATTTGAAG GCTTCATTAA AAGAAGCAAA GAATTACCTA TTGGTGAAAT ATGGCTTGTT
GACATTGAAG AAGGAAAAGA AAAACTGGAA ATTGTTGGTG CCTTAGCTAA ACGTATGGTA
AAAGCAGCAG GGCTTGATTG GAAAGTACAC TTGACTTTAG ACAGAAAAGA AGCCTTAACT
GATGCAGATT TTGTCACAAC ACAATTTAGA GTGGGCTTAT TAGATGCAAG AATTAAAGAT
GAGCGTATTC CATTAAGTCA TGGTATTTTA GGACAGGAAA CTAATGGTGC AGGCGGTATA
TTTAAAGCTT ACAGAACTGT CCCTGTTATT TTAGATATTG TAGAAGATAT GAAAAAGCTT
TGTCCAGATG CTTGGTTAGT TAACTTTACA AATCCTTCAG GTATGATTAC AGAAGCTGTT
ATGCGTTATG GCAACTGGGA CAGGGTAGTC GGTCTTTGTA ACGTGCCAAT ATTATGCCAA
AAAATAGCCT CTGGATCATT AAAAATACCA GAAGAAGAAC TTTTTTTCAA ATTTGCGGGC
TTAAATCATT TTCACTGGCA TCGCGTTTGG GATAAAACAG GAGAAGAAAA AAGTAGTCAA
ATTTTAAGAG ATACATATTC TAATGCAGAA AGTATGGCTG AAGCAATGGA ATTTGTTCGA
AAATCTGCTG GAGAATCAGG AAATACAAAC GACTCTGGTG TTAAGAATAT TCCTAATATC
AGCTACCTGG CAGAACAAGT ACAGAATTTA GGTATTATTC CTTGTATGTA TCATCGTTAC
TACTATATTA CACAAGATAT GCTTGACGAA GAACTTGAGA ATTTTGCAAA AGGTGAAACA
CGTGCTGAAG TCGTTAAAAA AACAGAAAAA GAACTCTTTG AGCTTTATAA AAATCCTGAT
TTATCAATCA AGCCTCCGCA ATTAGAGCAA CGCGGCGGTA CATTTTACAG TGATGCAGCT
TGTGAACTTA TTAATGCTAT CTATAATGAT AAACGTATTC ACATGGTTGT AAGTACTAAA
AATAATGGTG CAATTTCTGA TCTTCCTAAT GATGTTGTTG TTGAAGTTTC AAGTATTATT
ACTAGTCAAG GCCCTGTGCC TATTTCGTGG GGTTCGTTTG ATCCTTCACC TAAAGGCATG
TTACAATTAA TGAAAAATAT GGAACTCATC ACTATTGAAG CAGCATATAC AGGTGATTAT
GGAAAAGCAT TACAGGCATT TACAATTAAT CCGCTTATAC CGCATGGAAA AATTACTAAA
ACATTAATGA ACGAAATGTT GATTGCACAT AAAAAGCACT TGCCCCAGTT CAAAGAAGTT
ATCGAGGAAC TTGAAAAAAT AAAATAA
 
Protein sequence
MVHKPIKIVT IGGGSSYTPE LFEGFIKRSK ELPIGEIWLV DIEEGKEKLE IVGALAKRMV 
KAAGLDWKVH LTLDRKEALT DADFVTTQFR VGLLDARIKD ERIPLSHGIL GQETNGAGGI
FKAYRTVPVI LDIVEDMKKL CPDAWLVNFT NPSGMITEAV MRYGNWDRVV GLCNVPILCQ
KIASGSLKIP EEELFFKFAG LNHFHWHRVW DKTGEEKSSQ ILRDTYSNAE SMAEAMEFVR
KSAGESGNTN DSGVKNIPNI SYLAEQVQNL GIIPCMYHRY YYITQDMLDE ELENFAKGET
RAEVVKKTEK ELFELYKNPD LSIKPPQLEQ RGGTFYSDAA CELINAIYND KRIHMVVSTK
NNGAISDLPN DVVVEVSSII TSQGPVPISW GSFDPSPKGM LQLMKNMELI TIEAAYTGDY
GKALQAFTIN PLIPHGKITK TLMNEMLIAH KKHLPQFKEV IEELEKIK