Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sterm_0472 |
Symbol | |
ID | 8595960 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sebaldella termitidis ATCC 33386 |
Kingdom | Bacteria |
Replicon accession | NC_013517 |
Strand | - |
Start bp | 516670 |
End bp | 518076 |
Gene Length | 1407 bp |
Protein Length | 468 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | |
Product | glycoside hydrolase family 4 |
Protein accession | YP_003307280 |
Protein GI | 269119103 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.507113 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTACATA AACCTATTAA AATAGTAACT ATTGGTGGTG GTTCCAGTTA TACACCAGAA TTATTTGAAG GCTTCATTAA AAGAAGCAAA GAATTACCTA TTGGTGAAAT ATGGCTTGTT GACATTGAAG AAGGAAAAGA AAAACTGGAA ATTGTTGGTG CCTTAGCTAA ACGTATGGTA AAAGCAGCAG GGCTTGATTG GAAAGTACAC TTGACTTTAG ACAGAAAAGA AGCCTTAACT GATGCAGATT TTGTCACAAC ACAATTTAGA GTGGGCTTAT TAGATGCAAG AATTAAAGAT GAGCGTATTC CATTAAGTCA TGGTATTTTA GGACAGGAAA CTAATGGTGC AGGCGGTATA TTTAAAGCTT ACAGAACTGT CCCTGTTATT TTAGATATTG TAGAAGATAT GAAAAAGCTT TGTCCAGATG CTTGGTTAGT TAACTTTACA AATCCTTCAG GTATGATTAC AGAAGCTGTT ATGCGTTATG GCAACTGGGA CAGGGTAGTC GGTCTTTGTA ACGTGCCAAT ATTATGCCAA AAAATAGCCT CTGGATCATT AAAAATACCA GAAGAAGAAC TTTTTTTCAA ATTTGCGGGC TTAAATCATT TTCACTGGCA TCGCGTTTGG GATAAAACAG GAGAAGAAAA AAGTAGTCAA ATTTTAAGAG ATACATATTC TAATGCAGAA AGTATGGCTG AAGCAATGGA ATTTGTTCGA AAATCTGCTG GAGAATCAGG AAATACAAAC GACTCTGGTG TTAAGAATAT TCCTAATATC AGCTACCTGG CAGAACAAGT ACAGAATTTA GGTATTATTC CTTGTATGTA TCATCGTTAC TACTATATTA CACAAGATAT GCTTGACGAA GAACTTGAGA ATTTTGCAAA AGGTGAAACA CGTGCTGAAG TCGTTAAAAA AACAGAAAAA GAACTCTTTG AGCTTTATAA AAATCCTGAT TTATCAATCA AGCCTCCGCA ATTAGAGCAA CGCGGCGGTA CATTTTACAG TGATGCAGCT TGTGAACTTA TTAATGCTAT CTATAATGAT AAACGTATTC ACATGGTTGT AAGTACTAAA AATAATGGTG CAATTTCTGA TCTTCCTAAT GATGTTGTTG TTGAAGTTTC AAGTATTATT ACTAGTCAAG GCCCTGTGCC TATTTCGTGG GGTTCGTTTG ATCCTTCACC TAAAGGCATG TTACAATTAA TGAAAAATAT GGAACTCATC ACTATTGAAG CAGCATATAC AGGTGATTAT GGAAAAGCAT TACAGGCATT TACAATTAAT CCGCTTATAC CGCATGGAAA AATTACTAAA ACATTAATGA ACGAAATGTT GATTGCACAT AAAAAGCACT TGCCCCAGTT CAAAGAAGTT ATCGAGGAAC TTGAAAAAAT AAAATAA
|
Protein sequence | MVHKPIKIVT IGGGSSYTPE LFEGFIKRSK ELPIGEIWLV DIEEGKEKLE IVGALAKRMV KAAGLDWKVH LTLDRKEALT DADFVTTQFR VGLLDARIKD ERIPLSHGIL GQETNGAGGI FKAYRTVPVI LDIVEDMKKL CPDAWLVNFT NPSGMITEAV MRYGNWDRVV GLCNVPILCQ KIASGSLKIP EEELFFKFAG LNHFHWHRVW DKTGEEKSSQ ILRDTYSNAE SMAEAMEFVR KSAGESGNTN DSGVKNIPNI SYLAEQVQNL GIIPCMYHRY YYITQDMLDE ELENFAKGET RAEVVKKTEK ELFELYKNPD LSIKPPQLEQ RGGTFYSDAA CELINAIYND KRIHMVVSTK NNGAISDLPN DVVVEVSSII TSQGPVPISW GSFDPSPKGM LQLMKNMELI TIEAAYTGDY GKALQAFTIN PLIPHGKITK TLMNEMLIAH KKHLPQFKEV IEELEKIK
|
| |